User:Hmerlino/Books/Text Mining
Appearance
The Wikimedia Foundation's book rendering service has been withdrawn. Please upload your Wikipedia book to one of the external rendering services. |
You can still create and edit a book design using the Book Creator and upload it to an external rendering service:
|
This user book is a user-generated collection of Wikipedia articles that can be easily saved, rendered electronically, and ordered as a printed book. If you are the creator of this book and need help, see Help:Books (general tips) and WikiProject Wikipedia-Books (questions and assistance). Edit this book: Book Creator · Wikitext Order a printed copy from: PediaPress [ About ] [ Advanced ] [ FAQ ] [ Feedback ] [ Help ] [ WikiProject ] [ Recent Changes ] |
Text Mining[edit]
Introducction[edit]
- INTRODUCTION
- Text mining
- General Architecture for Text Engineering
- Unstructured data
- Document-term matrix
- Bag-of-words model
- Vector space model
- Tf–idf
- Generalized vector space model
- Information retrieval
- Okapi BM25
- Rocchio algorithm
- Inverted index
- Web crawler
- Concept map
- Metadata
- Language model
- Hidden Markov model
- Baum–Welch algorithm
- Viterbi algorithm
- CLUSTERING HIGH DIMENSIONAL DATA
- Document clustering
- Clustering high-dimensional data
- Biclustering
- Mixture model
- INFORMATION EXTRACTION AND NLP
- Information extraction
- Knowledge extraction
- Natural language processing
- Part of speech
- Part-of-speech tagging
- Named-entity recognition
- Automatic summarization
- Sentiment analysis
- OpenNLP
- UIMA
- DIMENSIONALITY REDUCTION AND MODELING
- Principal component analysis
- Curse of dimensionality
- Singular value decomposition
- Latent variable
- Latent semantic analysis
- Probabilistic latent semantic analysis
- Latent Dirichlet allocation
- Factor analysis
- Non-negative matrix factorization
- Regularization (mathematics)
- TEXT CLASSIFICATION
- Naive Bayes spam filtering
- Naive Bayes classifier
- Logistic regression
- String kernel