ModelEngine-Group/DataMate
DataMate is an enterprise-level data processing platform designed for model fine-tuning and RAG retrieval.
This platform helps AI/ML teams prepare raw data for use in large language models. It provides a visual, drag-and-drop interface for collecting, cleaning, synthesizing, and annotating data, then evaluating its quality. The output is high-quality, structured data ready for fine-tuning models or enhancing retrieval-augmented generation (RAG) systems. It's designed for data scientists, ML engineers, and data annotators.
333 stars.
Use this if you need an all-in-one solution to manage and process diverse datasets for training or improving large language models within an enterprise setting.
Not ideal if you are looking for a simple, lightweight tool for basic data manipulation or if your primary need is general-purpose analytics rather than AI model preparation.
Stars
333
Forks
40
Language
TypeScript
License
—
Category
Last pushed
Mar 13, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/rag/ModelEngine-Group/DataMate"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
ItzCrazyKns/Vane
Vane is an AI-powered answering engine.
ConardLi/easy-dataset
A powerful tool for creating datasets for LLM fine-tuning 、RAG and Eval
xuwei95/ezdata
基于python和llm大模型开发的数据处理和任务调度系统。...
DS4SD/deepsearch-toolkit
Interact with the Deep Search platform for new knowledge explorations and discoveries
mithun50/TreeDex
Tree-based, vectorless document RAG framework. Connect any LLM via URL/API key.