Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

CLUE benchmark

Organization of Language Understanding Evaluation benchmark for Chinese: tasks & datasets, baselines, pre-trained Chinese models, corpus and leaderboard

Pinned Loading

  1. CLUE CLUE Public

    中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

    Python 4.2k 547

  2. SuperCLUE SuperCLUE Public

    SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese

    3.3k 112

  3. SuperCLUE-Safety SuperCLUE-Safety Public

    SC-Safety: 中文大模型多轮对抗安全基准

    144 12

  4. SuperCLUE-Auto SuperCLUE-Auto Public

    汽车行业中文大模型测评基准,基于多轮开放式问题的细粒度评测

    37 4

  5. SuperCLUE-Agent SuperCLUE-Agent Public

    SuperCLUE-Agent: 基于中文原生任务的Agent智能体核心能力测评基准

    92 6

  6. SuperCLUE-RAG SuperCLUE-RAG Public

    中文原生检索增强生成测评基准

    123 4

Repositories

Loading
Type
Select type
Language
Select language
Sort
Select order
Showing 10 of 52 repositories

Top languages

Loading...

Most used topics

Loading...

AltStyle によって変換されたページ (->オリジナル) /