Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Pull requests: modelscope/ms-swift

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[megatron] Support Internvl3/Internvl3.5
#5647 opened Sep 3, 2025 by Jintao-Huang Loading...
bug fix: RuntimeError when training GRPO with LoRA and PtEngine
#5645 opened Sep 3, 2025 by chenjianhuii Loading...
1 of 4 tasks
Fix: when use PPOTrainer for rlhf, custom callbacks will not work
#5637 opened Sep 2, 2025 by kiritoxkiriko Loading...
1 of 4 tasks
Bug fix: eval OOM due to deepcopy of torch model
#5607 opened Aug 29, 2025 by hellopahe Loading...
1 task done
[template] fix suffix of ChatML template bug Something isn't working
#5589 opened Aug 28, 2025 by popomen Loading...
1 of 4 tasks
[init]support gptq grpo in colocate mode
#5569 opened Aug 27, 2025 by ItGirls Loading...
1 of 4 tasks
[WIP]Merge ulysses and ring-attention
#5522 opened Aug 25, 2025 by tastelikefeet Loading...
1 of 4 tasks
[Feature] Add Swanlab Slack notification
#4887 opened Jul 9, 2025 by dykderrick Loading...
2 of 4 tasks
Aacedar patch 3
#4832 opened Jul 4, 2025 by aacedar Loading...
Update template_meta.prefix bug
#4813 opened Jul 3, 2025 by aacedar Loading...
support ernie_vl
#4763 opened Jun 30, 2025 by Jintao-Huang Loading...
fix: add SO_REUSEADDR to find_free_port to handle TIME_WAIT state
#4573 opened Jun 12, 2025 by qykong Loading...
1 of 4 tasks
swift-megatron qwen3-235b-a22b stale
#4401 opened May 29, 2025 by fudp Loading...
3 tasks
Neptune completion logging stale
#3904 opened Apr 16, 2025 by Reichenbachian Loading...
1 task done
Update dataset_info.json stale
#3723 opened Mar 31, 2025 by sandeep-sm Loading...
3 tasks
[WIP] support reasoning_content
#3159 opened Feb 18, 2025 by Jintao-Huang Loading...
loss_scale bug when meeting <image>
#3036 opened Feb 8, 2025 by mangoyuan Draft
1 of 4 tasks
add example OCRBench dataset
#2677 opened Dec 17, 2024 by ex-yanminmin001 Loading...
3 tasks
support pixtral large
#2481 opened Nov 20, 2024 by Jintao-Huang Draft
add push to hub tracker
#1214 opened Jun 24, 2024 by tastelikefeet Loading...
1 of 4 tasks
Fix bug for less data then grad acc
#779 opened Apr 23, 2024 by Firmament-cyou Loading...
1 of 4 tasks
ProTip! Adding no:label will show everything without a label.

AltStyle によって変換されたページ (->オリジナル) /