chatopera/insuranceqa-corpus-zh

:helicopter: 保险行业语料库,聊天机器人

53
/ 100
Established

This project provides a comprehensive collection of insurance-related questions and high-quality answers in Chinese, sourced from real-world user inquiries and professional experts. It serves as a ready-to-use dataset for building intelligent systems. Data is provided in two forms: raw translated QA pairs and a pre-processed version with tokenization and stop-word removal, suitable for direct machine learning integration. This resource is ideal for AI researchers, chatbot developers, and data scientists working on natural language processing in the insurance sector.

1,049 stars. No commits in the last 6 months.

Use this if you need a high-quality, domain-specific dataset of insurance questions and answers to train and evaluate natural language processing models, especially for chatbot development or answer selection tasks.

Not ideal if you are looking for a general-purpose language corpus outside the insurance domain or if you prefer to collect and annotate your data from scratch.

insurance chatbot-development natural-language-processing customer-service-automation question-answering
Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 25 / 25

How are scores calculated?

Stars

1,049

Forks

343

Language

Python

License

Last pushed

May 26, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/chatopera/insuranceqa-corpus-zh"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.