chatopera/insuranceqa-corpus-zh
:helicopter: 保险行业语料库,聊天机器人
This project provides a comprehensive collection of insurance-related questions and high-quality answers in Chinese, sourced from real-world user inquiries and professional experts. It serves as a ready-to-use dataset for building intelligent systems. Data is provided in two forms: raw translated QA pairs and a pre-processed version with tokenization and stop-word removal, suitable for direct machine learning integration. This resource is ideal for AI researchers, chatbot developers, and data scientists working on natural language processing in the insurance sector.
1,049 stars. No commits in the last 6 months.
Use this if you need a high-quality, domain-specific dataset of insurance questions and answers to train and evaluate natural language processing models, especially for chatbot development or answer selection tasks.
Not ideal if you are looking for a general-purpose language corpus outside the insurance domain or if you prefer to collect and annotate your data from scratch.
Stars
1,049
Forks
343
Language
Python
License
—
Category
Last pushed
May 26, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/chatopera/insuranceqa-corpus-zh"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related frameworks
deeppavlov/DeepPavlov
An open source library for deep learning end-to-end dialog systems and chatbots.
FreeBirdsCrew/AI_ChatBot_Python
AI ChatBot using Python Tensorflow and Natural Language Processing (NLP) along side TFLearn
pochih/RL-Chatbot
🤖 Deep Reinforcement Learning Chatbot
Conchylicultor/DeepQA
My tensorflow implementation of "A neural conversational model", a Deep learning based chatbot
RasaHQ/rasa_core
Rasa Core is now part of the Rasa repo: An open source machine learning framework to automate...