CLUEbenchmark/QBQTC

QBQTC: 大规模搜索匹配数据集

/ 100

Experimental

This dataset helps search engine developers evaluate how well their search algorithms match user queries to relevant page titles. You input pairs of search queries and titles, and the dataset provides a label indicating how relevant the title is to the query (from 'poor' to 'very relevant'). It's designed for engineers who are building and improving search engines or similar information retrieval systems.

No commits in the last 6 months.

Use this if you are a search engine engineer or data scientist looking for a large-scale, high-quality dataset to train and benchmark learning-to-rank models for query-title relevance.

Not ideal if you need to build a search engine from scratch or are looking for a dataset in a domain other than general web search.

search-engine-optimization information-retrieval query-relevance machine-learning-engineering ranking-systems

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 9 / 25

Maturity 8 / 25

Community 12 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Featured in

Things AI Won't Tell You About Building a Voice App

Higher-rated alternatives

patrickfrank1/chesspos

Embedding based chess position search and embedding learning for chess positions

alexklibisz/elastiknn

Elasticsearch plugin for nearest neighbor search. Store vectors and run similarity search using...

freedmand/semantra

Multi-tool for semantic search

koursaros-ai/nboost

NBoost is a scalable, search-api-boosting platform for deploying transformer models to improve...

Mubelotix/SimRepo

Shows similar repositories in the sidebar

Explore Embedding Tools

All categories Trending Embeddings directory Insights