XiangpengHao/pq-vector
Vector search using only Parquet and DataFusion
This tool helps developers perform efficient vector similarity searches on large datasets. It takes a Parquet file containing vector embeddings as input, adds an embedded index to it, and allows for fast retrieval of the most similar vectors to a given query. This is ideal for backend or data engineers building applications that require real-time similarity search.
Use this if you are a Rust developer working with DataFusion and need to implement fast, scalable vector search directly on Parquet files without managing separate index services.
Not ideal if you need a high-level API for vector search and are not comfortable with Rust or DataFusion, or if your vectors are not stored in Parquet files.
Stars
54
Forks
3
Language
Rust
License
—
Category
Last pushed
Feb 11, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/vector-db/XiangpengHao/pq-vector"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
databendlabs/databend
Data Agent Ready Warehouse : One for Analytics, Search, AI, Python Sandbox. — rebuilt from...
oceanbase/oceanbase
The Fastest Distributed Database for Transactional, Analytical, and AI Workloads.
matrixorigin/matrixone
MySQL-compatible HTAP database with Git for Data, vector search, and fulltext search....
ArcadeData/arcadedb
ArcadeDB Multi-Model Database, one DBMS that supports SQL, Cypher, Gremlin, HTTP/JSON, MongoDB...
datalevin/datalevin
A simple, fast and versatile Datalog database