osllmai/inDox
The Indox Ecosystem offers integrated AI tools for data workflows. Our four components (IndoxArcg, IndoxMiner, IndoxJudge, and IndoxGen) enhance AI applications with advanced retrieval, extraction, evaluation, and generation capabilities, supporting multiple document formats and LLM providers.
This suite of AI tools helps you manage your data workflows, especially when working with large language models (LLMs). It takes various document formats like PDFs and HTML, processes them, and then helps extract structured information, evaluate the performance of your AI models, or generate new, synthetic datasets. Data scientists, AI engineers, and researchers working with text and AI models would find this valuable.
Available on PyPI.
Use this if you need integrated tools to streamline the entire lifecycle of developing and deploying AI solutions involving document analysis and LLMs, from data preparation to model evaluation and data augmentation.
Not ideal if you are looking for a simple, single-purpose tool for a specific task like basic text summarization or a pre-trained model for direct use without custom data processing or evaluation needs.
Stars
19
Forks
2
Language
Jupyter Notebook
License
AGPL-3.0
Category
Last pushed
Feb 05, 2026
Commits (30d)
0
Dependencies
17
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/rag/osllmai/inDox"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
joungminsung/OpenDocuments
Self-hosted open-source RAG platform that unifies organizational documents and answers natural...
PT-Perkasa-Pilar-Utama/ppu-pdf
Pdf utilities for text extraction in digital and convert scanned pdf into canvas.
pega2077/ai_file_manager
AIFileManager--AI based file manager. Auto tag,classify,rag your documents,images,videos
Harry-027/DocuMind
A document based RAG application
kbrisso/byte-vision
Byte-Vision is a privacy-first document intelligence platform that transforms static documents...