alibaba/SimCSE-with-CARDS

Source code for SIGIR 2022 paper.

27
/ 100
Experimental

This project helps machine learning engineers and researchers improve the quality of sentence embeddings for natural language processing tasks. By applying 'case-switched' positive examples and carefully selected 'hard negative' examples during model training, it enhances how well models understand the meaning of sentences. The result is better performance on tasks like semantic similarity and natural language inference, benefiting anyone building or evaluating NLP systems.

No commits in the last 6 months.

Use this if you are a machine learning engineer or researcher looking to build more robust and accurate sentence embedding models for natural language processing applications.

Not ideal if you are an end-user without a technical background in machine learning and NLP, as this is a developer-focused tool for model training.

natural-language-processing machine-learning-engineering text-analytics semantic-search model-training
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 6 / 25
Maturity 16 / 25
Community 5 / 25

How are scores calculated?

Stars

16

Forks

1

Language

Python

License

Apache-2.0

Last pushed

Apr 25, 2022

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/alibaba/SimCSE-with-CARDS"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.