Text properties, operations and preprocessing

By Sabib

 Important Questions

Tokenization, Text Normalization, Stop-word removal, Morphological Analysis, Word Stemming (Porter Algorithm), Case folding, Lemmatization, Word statistics (Zipf’s law, Heaps’Law), Index term selection, Inverted indices, Positional Inverted index, Natural Language Processing in Information Retrieval, Basic NLP tasks – POS tagging; shallow parsing

Important Questions
Comments
Subscribe
Notify of
0 Comments
Oldest
Newest Most Voted