ejaasaari/lemur

LEMUR reduces multi-vector retrieval for late interaction models such as ColBERT into regular single-vector retrieval.

/ 100

Emerging

LEMUR helps developers who are building search and retrieval systems to make them much faster. It takes collections of document embeddings and query embeddings, and outputs a ranked list of relevant documents. This is useful for anyone working with large text datasets who needs to quickly find the most relevant documents for a given query.

Use this if you are a developer building a search system that uses late interaction models like ColBERT and need to significantly speed up your retrieval process.

Not ideal if you do not have an AVX-512 compatible CPU or are not comfortable working with Python development tools.

information-retrieval search-engine-development natural-language-processing document-ranking

No Package No Dependents

Maintenance 10 / 25

Adoption 7 / 25

Maturity 11 / 25

Community 10 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Featured in

Embeddings Are Easier Than Whatever You're Doing Instead

Higher-rated alternatives

FlagOpen/FlagEmbedding

Retrieval and Retrieval-augmented LLMs

qdrant/fastembed

Fast, Accurate, Lightweight Python library to make State of the Art Embedding

Blaizzy/mlx-embeddings

MLX-Embeddings is the best package for running Vision and Language Embedding models locally on...

Merck/Sapiens

Sapiens is a human antibody language model based on BERT.

amansrivastava17/embedding-as-service

One-Stop Solution to encode sentence to fixed length vectors from various embedding techniques

Explore Embedding Tools

All categories Trending Embeddings directory Insights