hikariming/pindata

PinData is a modern, open-source dataset management platform designed specifically for large language model (LLM) training workflows

30
/ 100
Emerging

PinData helps organizations transform their diverse raw data, like documents and reports, into organized knowledge and high-quality datasets for AI applications. It takes various enterprise files and structured data as input, processing them to produce structured knowledge bases and ready-to-use training datasets. This platform is ideal for data managers, AI solution architects, researchers, and professional service providers who work with large volumes of enterprise information.

No commits in the last 6 months.

Use this if you need to unify, process, and structure large volumes of enterprise data (documents, reports, manuals) into a coherent knowledge base or high-quality training datasets for AI models.

Not ideal if your primary need is simply data storage without extensive processing, AI-driven structuring, or dataset generation for large language models.

enterprise-data-management AI-transformation knowledge-asset-creation research-data-management business-intelligence
No License Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 8 / 25
Maturity 7 / 25
Community 13 / 25

How are scores calculated?

Stars

44

Forks

6

Language

TypeScript

License

Last pushed

Jul 07, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/hikariming/pindata"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.