[画像:@CLUEbenchmark]

CLUE benchmark

Organization of Language Understanding Evaluation benchmark for Chinese: tasks & datasets, baselines, pre-trained Chinese models, corpus and leaderboard

Pinned Loading

CLUE CLUE Public

中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

Python 4.2k 547
SuperCLUE SuperCLUE Public

SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese

3.3k 112
SuperCLUE-Safety SuperCLUE-Safety Public

SC-Safety: 中文大模型多轮对抗安全基准

144 12
SuperCLUE-Auto SuperCLUE-Auto Public

汽车行业中文大模型测评基准,基于多轮开放式问题的细粒度评测

37 4
SuperCLUE-Agent SuperCLUE-Agent Public

SuperCLUE-Agent: 基于中文原生任务的Agent智能体核心能力测评基准

92 6
SuperCLUE-RAG SuperCLUE-RAG Public

中文原生检索增强生成测评基准

123 4

People

@vikotse @joytianya @brightmart @ydli-ai

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CLUE benchmark

Pinned Loading

Repositories

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

People

Top languages

Uh oh!

Most used topics

Uh oh!