OpenMatch/COCO-DR
[EMNLP 2022] This is the code repo for our EMNLP‘22 paper "COCO-DR: Combating Distribution Shifts in Zero-Shot Dense Retrieval with Contrastive and Distributionally Robust Learning".
This project helps improve the accuracy of searching for relevant documents or passages, even when the query data is very different from the data used to train the search model. It takes a collection of documents and queries, and outputs a more robust search model that can find better matches across various types of text. This is designed for anyone building advanced search or recommendation systems who needs precise results without extensive fine-tuning.
No commits in the last 6 months.
Use this if you need to build a text search system that performs well on diverse or unexpected document collections without needing to retrain the model for each new domain.
Not ideal if your search needs are basic and don't involve significant shifts in the type of text being queried or searched.
Stars
50
Forks
4
Language
Python
License
MIT
Category
Last pushed
Oct 12, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/OpenMatch/COCO-DR"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
princeton-nlp/SimCSE
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
n-waves/multifit
The code to reproduce results from paper "MultiFiT: Efficient Multi-lingual Language Model...
yxuansu/SimCTG
[NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation
alibaba-edu/simple-effective-text-matching
Source code of the ACL2019 paper "Simple and Effective Text Matching with Richer Alignment Features".
Shark-NLP/OpenICL
OpenICL is an open-source framework to facilitate research, development, and prototyping of...