Category Archives: Natural Language Processing

Advanced approximate sentence matching in Python

In our last post, we went over a range of options to perform approximate sentence matching in Python, an import task for many natural language processing and machine learning tasks.  To begin, we defined terms like: tokens: a word, number, or other “discrete” unit of text. stems: words...
Read More

Fuzzy match sentences in Python

Let’s imagine you have a sentence of interest.  You’d like to find all occurrences of this sentence within a corpus of text.  How would you go about this? The most obvious answer is to look for exact matches of the sentence.  You’d search through every sentence of your corpus,...
Read More