Semantic Similarity Measurement Embedding Tools

Tools and benchmarks for measuring semantic similarity between text units (words, sentences, documents) using embeddings and NLP methods. Includes evaluation datasets, comparison studies, and similarity calculation implementations. Does NOT include clustering applications, search systems, or downstream tasks like recommendation or matching.

There are 31 semantic similarity measurement tools tracked. The highest-rated is Garrafao/LSCDetection at 42/100 with 31 stars.

Get all 31 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=embeddings&subcategory=semantic-similarity-measurement&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 Garrafao/LSCDetection

Data Sets and Models for Evaluation of Lexical Semantic Change Detection

42
Emerging
2 RepoAnalysis/RepoSim

This repository contains experiments on comparing the similarity of Python...

32
Emerging
3 cod3licious/simec

Similarity Encoder (SimEc) Neural Network Framework for learning low...

31
Emerging
4 jorge-martinez-gil/uwsd

Context-Aware Semantic Similarity Measurement for Unsupervised Word Sense...

29
Experimental
5 cr1m5onk1ng/text_similarity

A nlp library for text similarity based on Transformer models

29
Experimental
6 taskswithcode/sentence_similarity_app

App to compare state-of-the-art models for sentence similarity task

29
Experimental
7 pabvald/semantic-similarity

Comparison of methods based on pre-trained Word2Vec, GloVe and FastText...

28
Experimental
8 Reilly-ConceptsCognitionLab/SemanticDistance

Computes Pairwise Semantic Distance Between Tokens (ngrams, words, turns) in...

27
Experimental
9 ymoslem/Sentence-Similarity

Sentence Similarity Approaches

26
Experimental
10 albertrial/SemEval-2012-task-6

Semantic Textual Similarity: task which consists in evaluating the degree of...

24
Experimental
11 kunal4040/hybrid-search-eval

🔍 Benchmark embedding models in hybrid search with Weaviate. Evaluate MRR@K,...

23
Experimental
12 paulbricman/semantica

Extending conceptual thinking with semantic embeddings.

23
Experimental
13 MatthewPaver/sentence-similarity-analysis

Semantic sentence similarity demonstration using transformer-based embedding...

21
Experimental
14 arclabs561/decksage

Card similarity and deck operations for trading card games (Magic, Pokemon, Yu-Gi-Oh)

21
Experimental
15 anebz/eu-sim

Exploring semantic similarities between contextualized embeddings

19
Experimental
16 bloomberg/semantic-similarity-covariance-shrinkage

Code release for Semantic Similarity Covariance Matrix Shrinkage

19
Experimental
17 joaquimgomez/BachelorsThesis-TextSimilarityMeasures

Code and models used in my Bachelor’s Degree Thesis about large text...

18
Experimental
18 natylaza89/semantic-similarity-llm-dating-app

Semantic Similarity LLM Dating App using Python 3.12, FastAPI, WebSockets,...

17
Experimental
19 Juancinho/similitud-palabras

Es una implementación en python para visualizar la idea de los embeddings de...

17
Experimental
20 colindeseroux/semantop

🧠Semantop is base of word2vec to create a french semantics game

14
Experimental
21 manasRK/semantica

All you need for text preprocessing for NLP

13
Experimental
22 xelandr3/jabruuuhtix

Real-time multiplayer word game inspired by Cémantix, based on semantic...

13
Experimental
23 LironOhana/sentence-similarity-embeddings

M.Sc. assignment — Sentence similarity with embeddings (STS Benchmark;...

13
Experimental
24 omrylcn/semeval

A modular toolkit for evaluating semantic embeddings

13
Experimental
25 lavis-nlp/german_legal_sentences

A dataset of semantically related sentence pairs in the German legal domain

13
Experimental
26 cwc09262/psalms-nlp-research

This is a research specific repository based on a foundation built from the...

13
Experimental
27 asteriscuz/semantic-similarity-calculator

Semantic similarity calculator using sentence embeddings

13
Experimental
28 ocha221/semantic-tagging-tools

simple tag expander/merger based on what's found within a tagged dataset

11
Experimental
29 taskswithcode/semantic_search_app

App to compare state-of-the-art models for sentence similarity task

11
Experimental
30 d1pankarmedhi/texteval

🏋️ Evaluate sentence similarities with standard metrics for NLP related tasks

11
Experimental
31 taskswithcode/semantic_clustering_app

About App to compare state-of-the-art models for semantic clustering task

11
Experimental