THUKEG

ChatGLM, GLM-4, CogVLM, CodeGeeX, CogView, ImageReward, CogVideoX | CogDL, GraphMAE, AMiner | Zhipu.ai (Z.ai) & Knowledge Engineering Group (KEG)

Pinned Loading

GLM GLM Public

GLM (General Language Model)

Python 3.4k 339
slime slime Public

slime is an LLM post-training framework for RL Scaling.

Python 3.4k 424
P-tuning-v2 P-tuning-v2 Public

An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks

Python 2.1k 207
ReST-MCTS ReST-MCTS Public

ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)

Python 688 51
T1 T1 Public

RL Scaling and Test-Time Scaling (ICML'25)

112 1
AgentRL AgentRL Public

Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework

Python 187 9