texttron/hyde

HyDE: Precise Zero-Shot Dense Retrieval without Relevance Labels

33
/ 100
Emerging

This project helps information retrieval specialists and researchers improve search accuracy for complex queries without needing to manually label documents for relevance. It takes a natural language query and generates a hypothetical document to help a search engine find more precise results, even in situations where no prior examples of relevant documents exist. The output is a significantly better set of search results for a given query.

573 stars. No commits in the last 6 months.

Use this if you need to perform highly accurate searches on large document collections, especially when dealing with new or specialized topics where human-labeled relevance judgments are scarce or impossible to obtain.

Not ideal if your search needs are simple and can be met with traditional keyword-based search or if you have ample labeled data for training a supervised retrieval model.

information-retrieval zero-shot-search document-search semantic-search content-discovery
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 8 / 25
Community 15 / 25

How are scores calculated?

Stars

573

Forks

41

Language

Jupyter Notebook

License

Last pushed

Dec 06, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/texttron/hyde"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.