nullnull/simstring
A Python implementation of the SimString, a simple and efficient algorithm for approximate string matching.
This helps you quickly find text strings that are similar to a given input, even if they have slight differences or typos. You provide a list of texts and a query text, and it returns all the texts from your list that closely match the query. This is designed for software developers building applications that need to process and match text efficiently.
125 stars. No commits in the last 6 months.
Use this if you are building an application that requires fast and accurate retrieval of similar strings from a large collection, such as for autocomplete, data cleaning, or fuzzy search features.
Not ideal if you need to compare entire documents or paragraphs, or if your application requires advanced semantic understanding of text rather than just character or word similarity.
Stars
125
Forks
17
Language
Python
License
MIT
Category
Last pushed
Oct 24, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/nullnull/simstring"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
shibing624/similarity
similarity: Text similarity calculation Toolkit for Java. 文本相似度计算工具包,java编写,可用于文本相似度计算、情感分析等任务,开箱即用。
eBay/Sequence-Semantic-Embedding
Tools and recipes to train deep learning models and build services for NLP tasks such as text...
RandolphVI/Text-Pairs-Relation-Classification
About Text Pairs (Sentence Level) Classification (Similarity Modeling) Based on Neural Network.
MartinoMensio/spacy-universal-sentence-encoder
Google USE (Universal Sentence Encoder) for spaCy
piotrmaciejbednarski/text-similarity-node
High-performance and memory efficient native C++ text similarity algorithms for Node.js