princeton-nlp/SimCSE
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
This tool helps you understand how semantically similar different pieces of text are, even if they use different words. You input sentences or short phrases, and it outputs numerical representations (embeddings) and similarity scores. This is useful for anyone who needs to automatically group, retrieve, or compare text, such as researchers analyzing surveys or businesses categorizing customer feedback.
3,644 stars. Used by 1 other package. No commits in the last 6 months. Available on PyPI.
Use this if you need to quickly find or group similar sentences from a large collection of text, like identifying related customer inquiries or similar scientific abstracts.
Not ideal if you need to analyze relationships between very long documents or require fine-grained analysis of grammatical structure beyond semantic meaning.
Stars
3,644
Forks
534
Language
Python
License
MIT
Category
Last pushed
Oct 16, 2024
Commits (30d)
0
Reverse dependents
1
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/princeton-nlp/SimCSE"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Related tools
n-waves/multifit
The code to reproduce results from paper "MultiFiT: Efficient Multi-lingual Language Model...
yxuansu/SimCTG
[NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation
alibaba-edu/simple-effective-text-matching
Source code of the ACL2019 paper "Simple and Effective Text Matching with Richer Alignment Features".
Shark-NLP/OpenICL
OpenICL is an open-source framework to facilitate research, development, and prototyping of...
alibaba-edu/simple-effective-text-matching-pytorch
A pytorch implementation of the ACL2019 paper "Simple and Effective Text Matching with Richer...