princeton-nlp/SimCSE

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

58
/ 100
Established

This tool helps you understand how semantically similar different pieces of text are, even if they use different words. You input sentences or short phrases, and it outputs numerical representations (embeddings) and similarity scores. This is useful for anyone who needs to automatically group, retrieve, or compare text, such as researchers analyzing surveys or businesses categorizing customer feedback.

3,644 stars. Used by 1 other package. No commits in the last 6 months. Available on PyPI.

Use this if you need to quickly find or group similar sentences from a large collection of text, like identifying related customer inquiries or similar scientific abstracts.

Not ideal if you need to analyze relationships between very long documents or require fine-grained analysis of grammatical structure beyond semantic meaning.

text-analysis information-retrieval customer-feedback content-categorization research-analysis
Stale 6m
Maintenance 0 / 25
Adoption 11 / 25
Maturity 25 / 25
Community 22 / 25

How are scores calculated?

Stars

3,644

Forks

534

Language

Python

License

MIT

Last pushed

Oct 16, 2024

Commits (30d)

0

Reverse dependents

1

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/princeton-nlp/SimCSE"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.