MigoXLab/dingo
Dingo: A Comprehensive AI Data, Model and Application Quality Evaluation Tool
Dingo helps AI developers and data engineers ensure the quality of their AI training data and deployed AI systems. It takes various datasets, including LLM training data and RAG system inputs, and identifies issues like low quality text, hallucinations, and problematic special characters. The output is a detailed quality report, allowing users to improve their AI's performance.
658 stars. Actively maintained with 30 commits in the last 30 days.
Use this if you need to systematically assess and improve the quality of your machine learning datasets, LLM fine-tuning data, or the outputs of your Retrieval-Augmented Generation (RAG) systems.
Not ideal if you are looking for a no-code solution and prefer a visual interface without any programming, as the open-source version primarily uses code.
Stars
658
Forks
67
Language
JavaScript
License
Apache-2.0
Category
Last pushed
Mar 13, 2026
Commits (30d)
30
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/MigoXLab/dingo"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
NVIDIA-NeMo/Curator
Scalable data pre processing and curation toolkit for LLMs
data-prep-kit/data-prep-kit
Open source project for data preparation for GenAI applications
TheDataStation/pneuma
LLM-Powered Data Discovery System for Tabular Data
cleanlab/cleanlab-studio
Client interface to Cleanlab Studio
jpmorganchase/CodeQuest
CodeQUEST is a generalizable framework which leverages LLMs to iteratively evaluate and enhance...