Corpus Natural Language Data NLP - Natural Language Processing gensim - It is an open source library for unsupervised topic modeling and natural language processing, using modern statistical machine learning. Scikit-learn - SciPy Toolkit. It features various classification, regression and clustering algorithms including support vector machines. NLTK - The Natural Language ToolKit, is a suite of ..