MCore-Bridge: Providing Megatron-Core model definitions for state-of-the-art large models and making Megatron training as simple as Transformers — with support for 300+ large language models (Qwen3-Next, GLM-5.1, Deepseek-V4, MiniMax-2.7, ...) and 200+ multimodal large models (Qwen3.5, Qwen3-Omni, Gemma4, ...).
transformers lora minimax peft megatron llm ms-swift deepseek-r1 llama4 gpt-oss qwen3-vl qwen3-omni deepseek-v4 glm-5 qwen3-5 gemma4
-
Updated
Jun 11, 2026 - Python