AstraBert/ingest-anything

From data to vector database effortlessly

50
/ 100
Established

This tool helps you prepare diverse, non-PDF files like documents, code, or web content for use in AI applications. It takes your raw files or URLs and transforms them into a structured format (embeddings) stored in a vector database, which is crucial for building powerful search or question-answering systems. It's designed for AI developers or data scientists who need to easily populate vector databases with various data types.

No commits in the last 6 months. Available on PyPI.

Use this if you are building an AI application and need a streamlined way to get various types of data—beyond just PDFs and Markdown—into a vector database for tasks like RAG.

Not ideal if you primarily work with existing PDFs or Markdown files, or if you don't need to use a vector database for your application.

AI development vector database management data preparation LLM application development information retrieval
Stale 6m
Maintenance 2 / 25
Adoption 9 / 25
Maturity 24 / 25
Community 15 / 25

How are scores calculated?

Stars

89

Forks

12

Language

Python

License

MIT

Last pushed

May 17, 2025

Commits (30d)

0

Dependencies

5

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/vector-db/AstraBert/ingest-anything"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.