Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

DeepSpeed教程 & 示例注释 & 学习笔记 (大模型高效训练)

License

Notifications You must be signed in to change notification settings

bobo0810/LearnDeepSpeed

Folders and files

NameName
Last commit message
Last commit date

Latest commit

History

16 Commits

Repository files navigation

LearnDeepSpeed 🚀

目的:基于DeepSpeed,突破硬件限制,实现大模型高效训练。

最小示例

  • cifar示例
    • 分布式数据并行DDP的训练pipeline
    • MoE用法
    • 学习率调度器的配置
    • ZeRO零冗余优化器的配置
  • pipeline_parallelism示例
    • 流水并行的训练pipeline
    • 流水模型的保存、加载、指标评估
    • TensorBoard可视化

DeepSpeed训练Tricks

https://zhuanlan.zhihu.com/p/654923210

DeepSpeed训练配置

https://zhuanlan.zhihu.com/p/654925843

参考

About

DeepSpeed教程 & 示例注释 & 学习笔记 (大模型高效训练)

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

AltStyle によって変換されたページ (->オリジナル) /