Text Similarity Matching NLP Tools
Tools for computing similarity scores between text passages, phrases, or documents using embeddings, distance metrics, or statistical methods. Does NOT include semantic search engines, plagiarism detection systems, or applications that use similarity as a component of larger NLP tasks.
There are 53 text similarity matching tools tracked. 2 score above 50 (established tier). The highest-rated is shibing624/similarity at 61/100 with 1,571 stars.
Get all 53 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=text-similarity-matching&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
shibing624/similarity
similarity: Text similarity calculation Toolkit for Java.... |
|
Established |
| 2 |
eBay/Sequence-Semantic-Embedding
Tools and recipes to train deep learning models and build services for NLP... |
|
Established |
| 3 |
RandolphVI/Text-Pairs-Relation-Classification
About Text Pairs (Sentence Level) Classification (Similarity Modeling) Based... |
|
Emerging |
| 4 |
MartinoMensio/spacy-universal-sentence-encoder
Google USE (Universal Sentence Encoder) for spaCy |
|
Emerging |
| 5 |
piotrmaciejbednarski/text-similarity-node
High-performance and memory efficient native C++ text similarity algorithms... |
|
Emerging |
| 6 |
brianrisk/simphile-text-similarity-nlp
Python Text Similarity NLP Libray |
|
Emerging |
| 7 |
nullnull/simstring
A Python implementation of the SimString, a simple and efficient algorithm... |
|
Emerging |
| 8 |
likejazz/Siamese-LSTM
Siamese LSTM for evaluating semantic similarity between sentences of the... |
|
Emerging |
| 9 |
jinglescode/textual-similarity-universal-sentence-encoder
Extract embeddings and group sentences with universal sentence encoder... |
|
Emerging |
| 10 |
zqhZY/semanaly
semantic analysis using word2vector, doc2vector,lstm and other method.... |
|
Emerging |
| 11 |
Neurotech-HQ/pysimilar
A python library for computing the similarity between two string(text) based... |
|
Emerging |
| 12 |
brunoarine/findlike
Command-line tool that finds lexically similar documents in relation to a... |
|
Emerging |
| 13 |
vishnumani2009/siamese-text-similarity
Tensorflow based implementation of deep siamese LSTM network for sentence... |
|
Emerging |
| 14 |
google/unisim
UniSim is a package for efficient similarity computation, fuzzy matching,... |
|
Emerging |
| 15 |
amansrivastava17/bns-short-text-similarity
📖 Use Bi-normal Separation to find document vectors which is used to... |
|
Emerging |
| 16 |
rameshjes/Semantic-Textual-Similarity
Natural Language Processing using NLTK and Spacy |
|
Emerging |
| 17 |
adrianmfi/git-semantic-similarity
Search git commit messages by semantic similarity with embeddings from... |
|
Emerging |
| 18 |
qiangzi11hao/Semantic-Textual-Similarity
è¯ä¹‰ç›¸ä¼¼åº¦ |
|
Emerging |
| 19 |
caiyinqiong/Semantic-Retrieval-Models
A curated list of awesome papers for Semantic Retrieval (TOIS Accepted:... |
|
Emerging |
| 20 |
kenneth-lange/java-nlp-text-similarity
Measure the similarity between different text documents. |
|
Emerging |
| 21 |
eu90h/semantic-dictionary
A Python dictionary that uses semantic similarity for key matching instead... |
|
Emerging |
| 22 |
ocramz/ncd-tree
text similarity search trees based on Normalized Compression Distance |
|
Emerging |
| 23 |
ankushbhatia2/Document_similarity_research_notebook
Jupyter notebook for my research in Document Similarity. |
|
Emerging |
| 24 |
victormurcia/PyUMLS_Similarity
This package computes a variety of similarity metrics between concepts... |
|
Emerging |
| 25 |
vukbatanovic/STSFineGrain
A collection of Semantic Textual Similarity (STS) models and a framework for... |
|
Emerging |
| 26 |
goliasz/pio-template-text-similarity
Text similarity based on Word2Vec vectors. |
|
Experimental |
| 27 |
PedroMTQ/UniFunc
Tool for similarity analysis of protein function annotations. |
|
Experimental |
| 28 |
Aditya1001001/similarity-and-embedding-app
Learn about text similarity measures & text embedding methods. |
|
Experimental |
| 29 |
khalidbelk/jaccard
🧬 Calculate the similarity index between two texts |
|
Experimental |
| 30 |
Aghabeiki/sentences2vector
Get Similarity of two sentences based gensim word2vec trained model. |
|
Experimental |
| 31 |
mforstenhaeusler/Siamese-LSTM-for-Semantic-Similarity-PyTorch
This repositpory entails an implementation of a Deep Learning Pipeline that... |
|
Experimental |
| 32 |
sauravjoshi23/SiameseNet-Loss
Text Classification Using Siamese Neural Networks - Contrastive Loss,... |
|
Experimental |
| 33 |
nurhaankhaled/SentenceToSentenceSemanticSimilarity
First rank winner at the Natural Language Processing competition FCIS-ASU 2021-2022. |
|
Experimental |
| 34 |
hiun/siamese-nn
Tensorflow Implementation of Siamese Neural Network for Learning Sentence Similarity |
|
Experimental |
| 35 |
DavidHarar/Siamese-Networks-for-name-nickname-similarity
Using siamese networks to learn a similarity metric between names and nicknames. |
|
Experimental |
| 36 |
chigwell/compario
A new package that uses large language models and pattern matching to... |
|
Experimental |
| 37 |
FDSRashid/Semantic-Similarity-Arabic
A package which implements various ways to measure semantic similarity of... |
|
Experimental |
| 38 |
adrianmfi/semantic-similarity-cli
semantic-similarity-cli is a command-line tool leveraging NLP models to... |
|
Experimental |
| 39 |
VanekPetr/text-similarity-ranking
Algorithm to rank text similarity between set of strings and given inputs |
|
Experimental |
| 40 |
mrpawan-gupta/TextTo
The Repository Contains The CPP Program to Calculate the Cosine Similarity... |
|
Experimental |
| 41 |
plain-jane-gray/parse-PDF-NLP-ML
Parses apart a PDF file into separate documents and then uses Natural... |
|
Experimental |
| 42 |
likithponnanna/text-similarity-scorer
This repo contains files to code that deploys an API endpoint for a user to... |
|
Experimental |
| 43 |
t-ski/string-similarity-algorithms
Common string similarity algorithm implementations. |
|
Experimental |
| 44 |
SirapopChu/DSI314_Keyword-similarity-matching-to-assess-the-potential-of-Pathum-Thani-provincial-
The project “Keyword Similarity Matching to Assess the Potential of Pathum... |
|
Experimental |
| 45 |
plandes/spanmatch
Unsupervised Position-Based Semantic Matching |
|
Experimental |
| 46 |
tonyfan08/NLP_TEXT_NER_SIMILARITY
AI NLP Competition |
|
Experimental |
| 47 |
tos-kamiya/d2vg
A Doc2Vec grep. On your desktop. |
|
Experimental |
| 48 |
asgaardlab/21-markos-test_case_similarity_technique-code
Repository with the source code of our technique to analyze a test suite and... |
|
Experimental |
| 49 |
ash-sha/Semantic-Textual-Similarity-NLP
Measuring similarity of a sentence for Biomedical Texts |
|
Experimental |
| 50 |
kaledhoshme123/Semantic-Similarity-using-TimeDistributed-LSTM
The following notebook, reviews the methodology by which we can build a... |
|
Experimental |
| 51 |
shamil8/nlp-similarity
Gensim & Word2Vec for task similarity, with odds-based date estimation for... |
|
Experimental |
| 52 |
tamsinrogers/IdentifierSimilarity
A Python tool that uses Namesake to further explore the impact of natural... |
|
Experimental |
| 53 |
tos-kamiya/stng
An off-the-shelf grep-like tool that performs semantic similarity search |
|
Experimental |