CLUEbenchmark/QBQTC
QBQTC: 大规模搜索匹配数据集
This dataset helps search engine developers evaluate how well their search algorithms match user queries to relevant page titles. You input pairs of search queries and titles, and the dataset provides a label indicating how relevant the title is to the query (from 'poor' to 'very relevant'). It's designed for engineers who are building and improving search engines or similar information retrieval systems.
No commits in the last 6 months.
Use this if you are a search engine engineer or data scientist looking for a large-scale, high-quality dataset to train and benchmark learning-to-rank models for query-title relevance.
Not ideal if you need to build a search engine from scratch or are looking for a dataset in a domain other than general web search.
Stars
86
Forks
9
Language
Python
License
—
Category
Last pushed
Dec 12, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/CLUEbenchmark/QBQTC"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
patrickfrank1/chesspos
Embedding based chess position search and embedding learning for chess positions
alexklibisz/elastiknn
Elasticsearch plugin for nearest neighbor search. Store vectors and run similarity search using...
freedmand/semantra
Multi-tool for semantic search
koursaros-ai/nboost
NBoost is a scalable, search-api-boosting platform for deploying transformer models to improve...
Mubelotix/SimRepo
Shows similar repositories in the sidebar