thunlp/UltraChat

Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)

43
/ 100
Emerging

UltraChat provides a vast collection of human-like multi-round dialogues, perfect for training advanced conversational AI. It takes raw text data and transforms it into structured, diverse conversations that can be used to teach AI models how to interact more naturally. This is primarily useful for AI researchers and developers focused on building and improving large language models capable of engaging in extended, nuanced discussions.

2,802 stars. No commits in the last 6 months.

Use this if you need a high-quality, extensive dataset of multi-turn conversations to train or fine-tune large language models for more engaging and informative chat interactions.

Not ideal if you are looking for a simple, out-of-the-box chatbot for immediate deployment without any model training or development.

conversational-ai natural-language-processing large-language-models ai-model-training
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 17 / 25

How are scores calculated?

Stars

2,802

Forks

135

Language

Python

License

MIT

Last pushed

Mar 13, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/thunlp/UltraChat"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.