CLUEbenchmark/SuperCLUE

SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese

/ 100

Emerging

Need to understand how well large language models (LLMs) perform on tasks relevant to the Chinese language and culture? SuperCLUE provides comprehensive evaluations across key capabilities like language understanding, professional knowledge, and AI agent performance. It helps compare different Chinese LLMs and understand their strengths and weaknesses, offering a ranked list of models and detailed breakdowns of their abilities.

3,277 stars.

Use this if you need to compare or select Chinese large language models based on their performance across a wide range of practical applications and specialized skills.

Not ideal if you are looking for an evaluation of non-Chinese language models or highly specialized, niche technical benchmarks not related to general LLM capabilities.

AI-model-evaluation Chinese-language-AI LLM-benchmarking natural-language-processing AI-agent-performance

No License No Package No Dependents

Maintenance 10 / 25

Adoption 10 / 25

Maturity 8 / 25

Community 16 / 25

How are scores calculated?

Stars

3,277

Forks

112

Language

—

License

—

Higher-rated alternatives

AI-Planning/l2p

Library for LLM-driven action model acquisition via natural language

datawhalechina/self-llm

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调（全参数/Lora）、部署国内外开源大模型（LLM）/多模态大模型（MLLM）教程

microsoft/LMOps

General technology for enabling AI capabilities w/ LLMs and MLLMs

liguodongiot/llm-action

本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）

theaniketgiri/create-llm

The fastest way to build and start training your own LLM. CLI tool that scaffolds...

Explore LLM Tools

All categories Trending LLM Tool directory Insights