CLUEbenchmark/CLUE

中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

/ 100

Emerging

This project offers a comprehensive benchmark for evaluating and comparing the performance of Chinese language understanding models. It provides standardized datasets for various tasks, pre-trained models, and a public leaderboard. This helps researchers and AI developers assess the capabilities of different models on real-world Chinese text understanding challenges.

4,237 stars.

Use this if you are developing or researching AI models for Chinese language processing and need a standardized way to evaluate their performance on tasks like text classification, natural language inference, and reading comprehension.

Not ideal if you are looking for an off-the-shelf application to directly solve a business problem or if your primary focus is on languages other than Chinese.

Chinese-NLP AI-model-evaluation text-understanding natural-language-processing AI-research

No License No Package No Dependents

Maintenance 10 / 25

Adoption 10 / 25

Maturity 8 / 25

Community 21 / 25

How are scores calculated?

Stars

4,237

Forks

546

Language

Python

License

—

Higher-rated alternatives

shibing624/MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline....

lyogavin/airllm

AirLLM 70B inference with single 4GB GPU

GradientHQ/parallax

Parallax is a distributed model serving framework that lets you build your own AI cluster anywhere

CrazyBoyM/llama3-Chinese-chat

Llama3、Llama3.1 中文后训练版仓库 - 微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档。

MediaBrain-SJTU/MING

明医 (MING)：中文医疗问诊大模型

Explore Transformer Models

All categories Trending Transformer directory Insights