ocramz/ncd-tree

text similarity search trees based on Normalized Compression Distance

/ 100

Emerging

This is a Haskell library for developers who need to find how similar different pieces of text or data sequences are. It takes a collection of documents or data and builds an index, then allows you to query that index to quickly find the most similar items to a given input. This is ideal for developers building applications that require comparing data based on its underlying structure, without needing to understand the content itself.

Use this if you are a Haskell developer building an application that needs to quickly find similar text snippets, code fragments, or data sequences without extensive feature engineering.

Not ideal if you are not a Haskell developer or if your application requires a precise, exhaustive search rather than an approximate one.

text-similarity sequence-matching information-retrieval data-indexing haskell-development

No Package No Dependents

Maintenance 6 / 25

Adoption 5 / 25

Maturity 13 / 25

Community 7 / 25

How are scores calculated?

Stars

Forks

Language

Haskell

License

BSD-3-Clause

Higher-rated alternatives

shibing624/similarity

similarity: Text similarity calculation Toolkit for Java. 文本相似度计算工具包，java编写，可用于文本相似度计算、情感分析等任务，开箱即用。

eBay/Sequence-Semantic-Embedding

Tools and recipes to train deep learning models and build services for NLP tasks such as text...

RandolphVI/Text-Pairs-Relation-Classification

About Text Pairs (Sentence Level) Classification (Similarity Modeling) Based on Neural Network.

MartinoMensio/spacy-universal-sentence-encoder

Google USE (Universal Sentence Encoder) for spaCy

piotrmaciejbednarski/text-similarity-node

High-performance and memory efficient native C++ text similarity algorithms for Node.js

Explore NLP Tools

All categories Trending NLP directory Insights