TF-IDF Text Analysis NLP Tools

Tools and implementations for TF-IDF vectorization, text classification, and document analysis using term frequency-inverse document frequency methods. Does NOT include other embedding techniques (word2vec, BERT), general machine learning frameworks, or domain-specific applications like sentiment analysis or fake news detection.

There are 21 tf-idf text analysis tools tracked. 1 score above 50 (established tier). The highest-rated is textvec/textvec at 58/100 with 197 stars.

Get all 21 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=tfidf-text-analysis&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 textvec/textvec

Text vectorization tool to outperform TFIDF for classification tasks

58
Established
2 DigitalPebble/behemoth

Behemoth is an open source platform for large scale document analysis based...

48
Emerging
3 cooperability/BMX-bookmark-extractor

Better brain. Knowledge management tool. Stop saving things you'll never...

44
Emerging
4 nasa-jpl-memex/memex-gate

General Architecture for Text Engineering

43
Emerging
5 NISH1001/tag-generator

A simple tool to generate tags for the given text (document) using TF-IDF.

42
Emerging
6 paradite/tf-idf-keyword

:mag_right: Get keywords from a piece of text using tf-idf

34
Emerging
7 go-nlp/tfidf

tfidf provides TF-IDF functionality

32
Emerging
8 pemagrg1/Magic-Of-TFIDF

TFIDF being the most basic and simple topic in NLP, there's alot that can be...

27
Experimental
9 AsadiAhmad/TF-IDF-Model

Retrieve Information from Text Documents with TF-IDF model and dimention...

21
Experimental
10 wasiahmad/mining_wikipedia

Extract mentions and category taxonomy from Wikipedia

20
Experimental
11 GhariebML/NLP_Text_Representation_Techniques

A comprehensive notebook demonstrating various text representation...

19
Experimental
12 adityabisht02/Research-Paper-Finder-Based-On-Similarity

A fullstack application which can be used to get the most similar research...

18
Experimental
13 krrish-v/mark_importer

Provide a category for all the imported bookmarks, makes easy to manage by...

18
Experimental
14 wangyuhsin/tfidf-text-summarization

This repository contains Python scripts for performing TF-IDF (Term...

17
Experimental
15 karrarkazuya/KTP-java

A simple yet smart search in texts library. it will give you in percent how...

17
Experimental
16 jiayao99/tfidf-text-classification

A tutorial on using TF-IDF for text classification

17
Experimental
17 aneessaheba/hadoop-news-analytics

Distributed word frequency analysis on 5,000 HuffPost news headlines using...

14
Experimental
18 meanderinghuman/tfidf-news-classifier

📰 TF-IDF News Classifier: Zero-training, pure-Python tool that uses TF-IDF +...

11
Experimental
19 Hasnat-Aarif-Aslam/NLP-Foundation-Tokens-Ngrams-BoW-TF-IDF-TFIDF

Comprehensive guide to text preprocessing and vectorization techniques for...

11
Experimental
20 eskutcheon/OnetabAutosorter

tool I wrote in a day or two awhile back using KeyBERT to parse groups of...

11
Experimental
21 craigtrim/tfidf-zones

TF IDF Zones

11
Experimental