Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

CLUE benchmark

Organization of Language Understanding Evaluation benchmark for Chinese: tasks & datasets, baselines, pre-trained Chinese models, corpus and leaderboard

Pinned Loading

  1. CLUE CLUE Public

    中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

    Python 4.2k 547

  2. SuperCLUE SuperCLUE Public

    SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese

    3.3k 112

  3. SuperCLUE-Safety SuperCLUE-Safety Public

    SC-Safety: 中文大模型多轮对抗安全基准

    149 12

  4. SuperCLUE-Auto SuperCLUE-Auto Public

    汽车行业中文大模型测评基准,基于多轮开放式问题的细粒度评测

    38 4

  5. SuperCLUE-Agent SuperCLUE-Agent Public

    SuperCLUE-Agent: 基于中文原生任务的Agent智能体核心能力测评基准

    94 6

  6. SuperCLUE-RAG SuperCLUE-RAG Public

    中文原生检索增强生成测评基准

    124 4

Repositories

Loading
Type
Select type
Language
Select language
Sort
Select order
Showing 10 of 52 repositories
  • CLUE Public

    中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

    CLUEbenchmark/CLUE’s past year of commit activity
    Python 4,227 547 78 2 Updated Sep 8, 2025
  • SuperCLUE Public

    SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese

    CLUEbenchmark/SuperCLUE’s past year of commit activity
    3,270 112 38 0 Updated Sep 8, 2025
  • SuperCLUE-CPIFOpen Public

    中文精确指令遵循测评基准(开源版)

    CLUEbenchmark/SuperCLUE-CPIFOpen’s past year of commit activity
    Python 6 1 0 0 Updated Aug 12, 2025
  • Math24o Public

    Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark

    CLUEbenchmark/Math24o’s past year of commit activity
    Python 11 0 0 0 Updated Mar 27, 2025
  • 2024h1 Public

    中文大模型基准测评2024上半年度报告,Report of LLMs in Chinese, First Half of 2024

    CLUEbenchmark/2024h1’s past year of commit activity
    1 0 1 0 Updated Jul 9, 2024
  • SuperCLUE-Video Public

    中文原生多层次文生视频测评基准

    CLUEbenchmark/SuperCLUE-Video’s past year of commit activity
    18 1 0 0 Updated Jul 8, 2024
  • SuperCLUE-V Public

    中文原生多模态理解测评基准(测评方案)

    CLUEbenchmark/SuperCLUE-V’s past year of commit activity
    3 0 0 0 Updated Jul 8, 2024
  • SuperCLUE-Long Public

    中文原生长文本测评基准

    CLUEbenchmark/SuperCLUE-Long’s past year of commit activity
    5 0 0 0 Updated Jul 8, 2024
  • SuperCLUE-Image Public

    中文原生文生图测评基准

    CLUEbenchmark/SuperCLUE-Image’s past year of commit activity
    9 0 0 0 Updated Jul 8, 2024
  • SuperCLUE-Coder Public

    中文原生代码助手测评基准,产品级

    CLUEbenchmark/SuperCLUE-Coder’s past year of commit activity
    0 0 0 0 Updated Jul 8, 2024

AltStyle によって変換されたページ (->オリジナル) /