amikos-tech/chromadb-data-pipes
ChromaDB Data Pipes 🖇️ - The easiest way to get data into and out of ChromaDB
This project helps you move information in and out of your ChromaDB database easily. You can bring in data from local files like PDFs, Hugging Face datasets, or even other ChromaDB instances, and it outputs organized data within ChromaDB or exported to files and Hugging Face. This tool is designed for anyone managing large text-based datasets in ChromaDB, especially those building or maintaining applications like chatbots or search engines.
No commits in the last 6 months. Available on PyPI.
Use this if you need to transfer large datasets, re-embed existing documents, back up your collections, or share data with others who use ChromaDB.
Not ideal if you are working with multimodal data (like images or audio) or if your primary need is complex real-time data streaming.
Stars
20
Forks
3
Language
Python
License
MIT
Category
Last pushed
Oct 22, 2024
Commits (30d)
0
Dependencies
14
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/amikos-tech/chromadb-data-pipes"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
treeverse/dvc
🦉 Data Versioning and ML Experiments
runpod/runpod-python
🐍 | Python library for RunPod API and serverless worker SDK.
microsoft/vscode-jupyter
VS Code Jupyter extension
4paradigm/OpenMLDB
OpenMLDB is an open-source machine learning database that provides a feature platform computing...
uber/petastorm
Petastorm library enables single machine or distributed training and evaluation of deep learning...