Text Similarity Matching NLP Tools

Tools for computing similarity scores between text passages, phrases, or documents using embeddings, distance metrics, or statistical methods. Does NOT include semantic search engines, plagiarism detection systems, or applications that use similarity as a component of larger NLP tasks.

There are 53 text similarity matching tools tracked. 2 score above 50 (established tier). The highest-rated is shibing624/similarity at 61/100 with 1,571 stars.

Get all 53 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=text-similarity-matching&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 shibing624/similarity

similarity: Text similarity calculation Toolkit for Java....

61
Established
2 eBay/Sequence-Semantic-Embedding

Tools and recipes to train deep learning models and build services for NLP...

50
Established
3 RandolphVI/Text-Pairs-Relation-Classification

About Text Pairs (Sentence Level) Classification (Similarity Modeling) Based...

48
Emerging
4 MartinoMensio/spacy-universal-sentence-encoder

Google USE (Universal Sentence Encoder) for spaCy

46
Emerging
5 piotrmaciejbednarski/text-similarity-node

High-performance and memory efficient native C++ text similarity algorithms...

45
Emerging
6 brianrisk/simphile-text-similarity-nlp

Python Text Similarity NLP Libray

43
Emerging
7 nullnull/simstring

A Python implementation of the SimString, a simple and efficient algorithm...

42
Emerging
8 likejazz/Siamese-LSTM

Siamese LSTM for evaluating semantic similarity between sentences of the...

41
Emerging
9 jinglescode/textual-similarity-universal-sentence-encoder

Extract embeddings and group sentences with universal sentence encoder...

41
Emerging
10 zqhZY/semanaly

semantic analysis using word2vector, doc2vector,lstm and other method....

40
Emerging
11 Neurotech-HQ/pysimilar

A python library for computing the similarity between two string(text) based...

39
Emerging
12 brunoarine/findlike

Command-line tool that finds lexically similar documents in relation to a...

38
Emerging
13 vishnumani2009/siamese-text-similarity

Tensorflow based implementation of deep siamese LSTM network for sentence...

38
Emerging
14 google/unisim

UniSim is a package for efficient similarity computation, fuzzy matching,...

36
Emerging
15 amansrivastava17/bns-short-text-similarity

📖 Use Bi-normal Separation to find document vectors which is used to...

33
Emerging
16 rameshjes/Semantic-Textual-Similarity

Natural Language Processing using NLTK and Spacy

33
Emerging
17 adrianmfi/git-semantic-similarity

Search git commit messages by semantic similarity with embeddings from...

33
Emerging
18 qiangzi11hao/Semantic-Textual-Similarity

语义相似度

32
Emerging
19 caiyinqiong/Semantic-Retrieval-Models

A curated list of awesome papers for Semantic Retrieval (TOIS Accepted:...

32
Emerging
20 kenneth-lange/java-nlp-text-similarity

Measure the similarity between different text documents.

32
Emerging
21 eu90h/semantic-dictionary

A Python dictionary that uses semantic similarity for key matching instead...

32
Emerging
22 ocramz/ncd-tree

text similarity search trees based on Normalized Compression Distance

31
Emerging
23 ankushbhatia2/Document_similarity_research_notebook

Jupyter notebook for my research in Document Similarity.

31
Emerging
24 victormurcia/PyUMLS_Similarity

This package computes a variety of similarity metrics between concepts...

31
Emerging
25 vukbatanovic/STSFineGrain

A collection of Semantic Textual Similarity (STS) models and a framework for...

30
Emerging
26 goliasz/pio-template-text-similarity

Text similarity based on Word2Vec vectors.

29
Experimental
27 PedroMTQ/UniFunc

Tool for similarity analysis of protein function annotations.

29
Experimental
28 Aditya1001001/similarity-and-embedding-app

Learn about text similarity measures & text embedding methods.

29
Experimental
29 khalidbelk/jaccard

🧬 Calculate the similarity index between two texts

29
Experimental
30 Aghabeiki/sentences2vector

Get Similarity of two sentences based gensim word2vec trained model.

27
Experimental
31 mforstenhaeusler/Siamese-LSTM-for-Semantic-Similarity-PyTorch

This repositpory entails an implementation of a Deep Learning Pipeline that...

26
Experimental
32 sauravjoshi23/SiameseNet-Loss

Text Classification Using Siamese Neural Networks - Contrastive Loss,...

25
Experimental
33 nurhaankhaled/SentenceToSentenceSemanticSimilarity

First rank winner at the Natural Language Processing competition FCIS-ASU 2021-2022.

24
Experimental
34 hiun/siamese-nn

Tensorflow Implementation of Siamese Neural Network for Learning Sentence Similarity

22
Experimental
35 DavidHarar/Siamese-Networks-for-name-nickname-similarity

Using siamese networks to learn a similarity metric between names and nicknames.

21
Experimental
36 chigwell/compario

A new package that uses large language models and pattern matching to...

21
Experimental
37 FDSRashid/Semantic-Similarity-Arabic

A package which implements various ways to measure semantic similarity of...

21
Experimental
38 adrianmfi/semantic-similarity-cli

semantic-similarity-cli is a command-line tool leveraging NLP models to...

19
Experimental
39 VanekPetr/text-similarity-ranking

Algorithm to rank text similarity between set of strings and given inputs

19
Experimental
40 mrpawan-gupta/TextTo

The Repository Contains The CPP Program to Calculate the Cosine Similarity...

19
Experimental
41 plain-jane-gray/parse-PDF-NLP-ML

Parses apart a PDF file into separate documents and then uses Natural...

19
Experimental
42 likithponnanna/text-similarity-scorer

This repo contains files to code that deploys an API endpoint for a user to...

19
Experimental
43 t-ski/string-similarity-algorithms

Common string similarity algorithm implementations.

18
Experimental
44 SirapopChu/DSI314_Keyword-similarity-matching-to-assess-the-potential-of-Pathum-Thani-provincial-

The project “Keyword Similarity Matching to Assess the Potential of Pathum...

17
Experimental
45 plandes/spanmatch

Unsupervised Position-Based Semantic Matching

17
Experimental
46 tonyfan08/NLP_TEXT_NER_SIMILARITY

AI NLP Competition

17
Experimental
47 tos-kamiya/d2vg

A Doc2Vec grep. On your desktop.

12
Experimental
48 asgaardlab/21-markos-test_case_similarity_technique-code

Repository with the source code of our technique to analyze a test suite and...

12
Experimental
49 ash-sha/Semantic-Textual-Similarity-NLP

Measuring similarity of a sentence for Biomedical Texts

11
Experimental
50 kaledhoshme123/Semantic-Similarity-using-TimeDistributed-LSTM

The following notebook, reviews the methodology by which we can build a...

10
Experimental
51 shamil8/nlp-similarity

Gensim & Word2Vec for task similarity, with odds-based date estimation for...

10
Experimental
52 tamsinrogers/IdentifierSimilarity

A Python tool that uses Namesake to further explore the impact of natural...

10
Experimental
53 tos-kamiya/stng

An off-the-shelf grep-like tool that performs semantic similarity search

10
Experimental