alibaba/SimCSE-with-CARDS

Source code for SIGIR 2022 paper.

/ 100

Experimental

This project helps machine learning engineers and researchers improve the quality of sentence embeddings for natural language processing tasks. By applying 'case-switched' positive examples and carefully selected 'hard negative' examples during model training, it enhances how well models understand the meaning of sentences. The result is better performance on tasks like semantic similarity and natural language inference, benefiting anyone building or evaluating NLP systems.

No commits in the last 6 months.

Use this if you are a machine learning engineer or researcher looking to build more robust and accurate sentence embedding models for natural language processing applications.

Not ideal if you are an end-user without a technical background in machine learning and NLP, as this is a developer-focused tool for model training.

natural-language-processing machine-learning-engineering text-analytics semantic-search model-training

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 16 / 25

Community 5 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

princeton-nlp/SimCSE

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

n-waves/multifit

The code to reproduce results from paper "MultiFiT: Efficient Multi-lingual Language Model...

yxuansu/SimCTG

[NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation

alibaba-edu/simple-effective-text-matching

Source code of the ACL2019 paper "Simple and Effective Text Matching with Richer Alignment Features".

Shark-NLP/OpenICL

OpenICL is an open-source framework to facilitate research, development, and prototyping of...

Explore NLP Tools

All categories Trending NLP directory Insights