activeloopai/deeplake

Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai

65
/ 100
Established

This tool helps AI engineers and data scientists manage, store, and query the vast amounts of unstructured data (like images, videos, audio, and text) needed for training AI models or building LLM applications. It takes in raw AI data and outputs structured, searchable datasets that can be streamed directly to machine learning frameworks. It's designed for anyone working with large, diverse AI datasets who needs efficient data handling.

9,033 stars. Used by 1 other package. Available on PyPI.

Use this if you need a scalable, efficient way to store, version, query, and stream diverse AI data (like embeddings, images, or video) for training deep learning models or powering LLM-based applications.

Not ideal if your primary need is for a traditional relational database for structured tabular data without deep learning or LLM application requirements.

AI data management machine learning engineering LLM application development deep learning training data versioning
Maintenance 10 / 25
Adoption 11 / 25
Maturity 25 / 25
Community 19 / 25

How are scores calculated?

Stars

9,033

Forks

709

Language

C++

License

Apache-2.0

Last pushed

Feb 16, 2026

Commits (30d)

0

Dependencies

3

Reverse dependents

1

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/vector-db/activeloopai/deeplake"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.