thunlp/UltraChat
Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)
UltraChat provides a vast collection of human-like multi-round dialogues, perfect for training advanced conversational AI. It takes raw text data and transforms it into structured, diverse conversations that can be used to teach AI models how to interact more naturally. This is primarily useful for AI researchers and developers focused on building and improving large language models capable of engaging in extended, nuanced discussions.
2,802 stars. No commits in the last 6 months.
Use this if you need a high-quality, extensive dataset of multi-turn conversations to train or fine-tune large language models for more engaging and informative chat interactions.
Not ideal if you are looking for a simple, out-of-the-box chatbot for immediate deployment without any model training or development.
Stars
2,802
Forks
135
Language
Python
License
MIT
Category
Last pushed
Mar 13, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/thunlp/UltraChat"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
openai/openai-cookbook
Examples and guides for using the OpenAI API
rgbkrk/dangermode
Execute IPython & Jupyter from the comforts of chat.openai.com
CogStack/OpenGPT
A framework for creating grounded instruction based datasets and training conversational domain...
Declipsonator/GPTZzzs
Large language model detection evasion through grammar and vocabulary modifcation.
flypythoncom/python
python is all you need !