Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

THUKEG

ChatGLM, GLM-4, CogVLM, CodeGeeX, CogView, ImageReward, CogVideoX | CogDL, GraphMAE, AMiner | Zhipu.ai (Z.ai) & Knowledge Engineering Group (KEG)

Pinned Loading

  1. GLM GLM Public

    GLM (General Language Model)

    Python 3.4k 339

  2. slime slime Public

    slime is an LLM post-training framework for RL Scaling.

    Python 3.4k 424

  3. P-tuning-v2 P-tuning-v2 Public

    An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks

    Python 2.1k 207

  4. ReST-MCTS ReST-MCTS Public

    ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)

    Python 688 51

  5. T1 T1 Public

    RL Scaling and Test-Time Scaling (ICML'25)

    112 1

  6. AgentRL AgentRL Public

    Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework

    Python 187 9

Repositories

Loading
Type
Select type
Language
Select language
Sort
Select order
Showing 10 of 126 repositories
  • slime Public

    slime is an LLM post-training framework for RL Scaling.

    THUDM/slime’s past year of commit activity
    Python 3,356 Apache-2.0 424 118 (3 issues need help) 43 Updated Jan 16, 2026
  • CaRR Public

    This repository contains the code and data for the paper "Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards".

    THUDM/CaRR’s past year of commit activity
    Python 43 MIT 3 1 0 Updated Jan 12, 2026
  • MobileRL Public
    THUDM/MobileRL’s past year of commit activity
    Python 51 MIT 6 1 0 Updated Dec 23, 2025
  • AgentRL Public

    Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework

    THUDM/AgentRL’s past year of commit activity
    Python 187 MIT 9 7 0 Updated Dec 16, 2025
  • AgentBench Public

    A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

    THUDM/AgentBench’s past year of commit activity
    Python 3,087 Apache-2.0 222 57 (38 issues need help) 8 Updated Nov 17, 2025
  • ComputerRL Public
    THUDM/ComputerRL’s past year of commit activity
    Python 12 Apache-2.0 5 3 0 Updated Nov 7, 2025
  • PETra Public
    THUDM/PETra’s past year of commit activity
    Python 2 0 0 0 Updated Nov 5, 2025
  • AlignBench Public

    大模型多维度中文对齐评测基准 (ACL 2024)

    THUDM/AlignBench’s past year of commit activity
    Python 421 30 15 0 Updated Oct 25, 2025
  • LLM4CardGame Public
    THUDM/LLM4CardGame’s past year of commit activity
    Python 10 1 2 0 Updated Oct 15, 2025
  • DeepDive Public

    DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL

    THUDM/DeepDive’s past year of commit activity
    Python 229 19 2 0 Updated Oct 2, 2025

AltStyle によって変換されたページ (->オリジナル) /