AstraBert/ingest-anything

From data to vector database effortlessly

/ 100

Established

This tool helps you prepare diverse, non-PDF files like documents, code, or web content for use in AI applications. It takes your raw files or URLs and transforms them into a structured format (embeddings) stored in a vector database, which is crucial for building powerful search or question-answering systems. It's designed for AI developers or data scientists who need to easily populate vector databases with various data types.

No commits in the last 6 months. Available on PyPI.

Use this if you are building an AI application and need a streamlined way to get various types of data—beyond just PDFs and Markdown—into a vector database for tasks like RAG.

Not ideal if you primarily work with existing PDFs or Markdown files, or if you don't need to use a vector database for your application.

AI development vector database management data preparation LLM application development information retrieval

Stale 6m

Maintenance 2 / 25

Adoption 9 / 25

Maturity 24 / 25

Community 15 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Related tools

pixeltable/pixeltable

Data Infrastructure providing a declarative, incremental approach for multimodal AI workloads.

activeloopai/deeplake

Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store,...

superlinked/VectorHub

VectorHub is a free, open-source learning website for people (software developers to senior ML...

hhblaze/DBreeze

C# .NET NOSQL ( key value, object store embedded TextSearch SemanticSearch Vector layer ) ACID...

TileDB-Inc/TileDB-Vector-Search

Cloud-native vector similarity search and storage with efficient, serverless scale-out

Explore Vector Databases

All categories Trending Vector Database directory Insights