kavgan/nlp-in-practice

Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.

/ 100

Emerging

This project provides practical code examples and tools to help you analyze and understand large amounts of text. You can feed in documents, articles, or other textual data, and it will help you extract key phrases, summarize topics, classify content, or visualize important words. This is designed for data analysts, researchers, or anyone who needs to make sense of unstructured text.

1,183 stars. No commits in the last 6 months.

Use this if you need to process and gain insights from text data, such as identifying important keywords, classifying documents, or preparing text for further analysis.

Not ideal if you are looking for a ready-to-use, off-the-shelf application with a graphical interface for text analysis.

text-analytics content-analysis data-science natural-language-processing document-classification

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 8 / 25

Community 25 / 25

How are scores calculated?

Stars

1,183

Forks

788

Language

Jupyter Notebook

License

—

Higher-rated alternatives

nltk/nltk

NLTK Source

explosion/spaCy

💫 Industrial-strength Natural Language Processing (NLP) in Python

undertheseanlp/underthesea

Underthesea - Vietnamese NLP Toolkit

stanfordnlp/stanza

Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many...

flairNLP/flair

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Explore NLP Tools

All categories Trending NLP directory Insights