CLUEbenchmark/QBQTC

QBQTC: 大规模搜索匹配数据集

29
/ 100
Experimental

This dataset helps search engine developers evaluate how well their search algorithms match user queries to relevant page titles. You input pairs of search queries and titles, and the dataset provides a label indicating how relevant the title is to the query (from 'poor' to 'very relevant'). It's designed for engineers who are building and improving search engines or similar information retrieval systems.

No commits in the last 6 months.

Use this if you are a search engine engineer or data scientist looking for a large-scale, high-quality dataset to train and benchmark learning-to-rank models for query-title relevance.

Not ideal if you need to build a search engine from scratch or are looking for a dataset in a domain other than general web search.

search-engine-optimization information-retrieval query-relevance machine-learning-engineering ranking-systems
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 9 / 25
Maturity 8 / 25
Community 12 / 25

How are scores calculated?

Stars

86

Forks

9

Language

Python

License

Last pushed

Dec 12, 2021

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/CLUEbenchmark/QBQTC"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.