princeton-nlp/SimCSE

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

/ 100

Established

This tool helps you understand how semantically similar different pieces of text are, even if they use different words. You input sentences or short phrases, and it outputs numerical representations (embeddings) and similarity scores. This is useful for anyone who needs to automatically group, retrieve, or compare text, such as researchers analyzing surveys or businesses categorizing customer feedback.

3,644 stars. Used by 1 other package. No commits in the last 6 months. Available on PyPI.

Use this if you need to quickly find or group similar sentences from a large collection of text, like identifying related customer inquiries or similar scientific abstracts.

Not ideal if you need to analyze relationships between very long documents or require fine-grained analysis of grammatical structure beyond semantic meaning.

text-analysis information-retrieval customer-feedback content-categorization research-analysis

Stale 6m

Maintenance 0 / 25

Adoption 11 / 25

Maturity 25 / 25

Community 22 / 25

How are scores calculated?

Stars

3,644

Forks

534

Language

Python

License

MIT

Compare

SimCSE and RankCSE

Related tools

n-waves/multifit

The code to reproduce results from paper "MultiFiT: Efficient Multi-lingual Language Model...

yxuansu/SimCTG

[NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation

alibaba-edu/simple-effective-text-matching

Source code of the ACL2019 paper "Simple and Effective Text Matching with Richer Alignment Features".

Shark-NLP/OpenICL

OpenICL is an open-source framework to facilitate research, development, and prototyping of...

alibaba-edu/simple-effective-text-matching-pytorch

A pytorch implementation of the ACL2019 paper "Simple and Effective Text Matching with Richer...

Explore NLP Tools

All categories Trending NLP directory Insights