Originally Written on:- 11th May, 2019. Natural Language Processing There are a few concepts that are absolutely essential for NLP. Only a few has been discussed in this blog post This blog consists of a few parts:- stemming and lemmatization TF-IDF in NLP Cosine Similarity. Stemming Stemming algorithms work by cutting off the end or the beginning of the word, taking into account a list of common prefixes and suffixes that can be found in an inflected word. This indiscriminate cutting can be successful in some occasions, but not always, and that is why we affirm that this approach presents some limitations. Below we illustrate the method with examples in both English and Spanish. Stemming refers to a crude heuristic process that chops off the end of words in the hope of achieving this goal correctly most of the time and often includes the removal of derivational affixes. Overstemming and Understemming However, because ...