Michael-JB/bm25
A BM25 embedder, scorer, and search engine, written in Rust.
This project helps you build a keyword search engine or relevance scorer for text documents. You provide a collection of documents, and it processes them to create numerical representations, allowing you to quickly find documents most relevant to a specific search query. It's designed for software developers who need to integrate efficient text search capabilities into their applications.
56 stars and 291,616 monthly downloads.
Use this if you are a developer building an application that needs fast, in-memory keyword search or document-to-query relevance scoring.
Not ideal if you need a full-fledged, out-of-the-box search solution for end-users without writing code, or if you need to rank results based on semantic meaning beyond keywords.
Stars
56
Forks
7
Language
Rust
License
MIT
Category
Last pushed
Mar 09, 2026
Monthly downloads
291,616
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/Michael-JB/bm25"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
jeanCarloMachado/PythonSearch
A minimalistic search engine for productivity that stores documents as code
neuml/codequestion
🔎 Semantic search for developers
chnsh/deep-semantic-code-search
Deep Semantic Code Search aims to explore a joint embedding space for code and description...
aws-samples/tabular-column-semantic-search
Code accompanying AWS blog post "Build a Semantic Search Engine for Tabular Columns with...
AstraBert/SenTrEv
Simple customizable evaluation for text retrieval performance of Sentence Transformers embedders on PDFs