DeepSpeed

Microsoft open source library

DeepSpeed

Original author(s)	Microsoft Research
Developer(s)	Microsoft
Initial release	May 18, 2020; 4 years ago (2020年05月18日)

Stable release	v0.16.2 / December 18, 2024; 2 months ago (2024年12月18日)
Repository	github.com/microsoft/DeepSpeed
Written in	Python, CUDA, C++
Type	Software library
License	Apache License 2.0
Website	deepspeed.ai

DeepSpeed is an open source deep learning optimization library for PyTorch.^[1]

Library

[edit ]

The library is designed to reduce computing power and memory use and to train large distributed models with better parallelism on existing computer hardware.^[2]^[3] DeepSpeed is optimized for low latency, high throughput training. It includes the Zero Redundancy Optimizer (ZeRO) for training models with 1 trillion or more parameters.^[4] Features include mixed precision training, single-GPU, multi-GPU, and multi-node training as well as custom model parallelism. The DeepSpeed source code is licensed under MIT License and available on GitHub.^[5]

The team claimed to achieve up to a 6.2x throughput improvement, 2.8x faster convergence, and 4.6x less communication.^[6]

References

[edit ]

^ "Microsoft Updates Windows, Azure Tools with an Eye on The Future". PCMag UK. May 22, 2020.
^ Yegulalp, Serdar (February 10, 2020). "Microsoft speeds up PyTorch with DeepSpeed". InfoWorld.
^ "Microsoft unveils "fifth most powerful" supercomputer in the world". Neowin. 18 June 2023.
^ "Microsoft trains world's largest Transformer language model". February 10, 2020.
^ "microsoft/DeepSpeed". July 10, 2020 – via GitHub.
^ "DeepSpeed: Accelerating large-scale model inference and training via system optimizations and compression". Microsoft Research. 2021年05月24日. Retrieved 2021年06月19日.

External links

[edit ]

v t e Deep learning software
Comparison
Open source	Apache MXNet Apache SINGA Caffe Deeplearning4j DeepSpeed Dlib Keras Microsoft Cognitive Toolkit ML.NET OpenNN PyTorch TensorFlow Theano Torch ONNX OpenVINO MindSpore
Proprietary	Apple Core ML IBM Watson Neural Designer Wolfram Mathematica MATLAB Deep Learning Toolbox
Category

v
t
e

Microsoft free and open-source software (FOSS)

Overview

Software

Applications	3D Movie Maker Atom Conference XP Family.Show File Manager Open Live Writer Microsoft PowerToys Terminal Windows Calculator Windows Console Windows Package Manager WorldWide Telescope XML Notepad
Video games	Allegiance
Programming languages	Bosque C# Dafny F# F* GW-BASIC IronPython IronRuby Lean P Power Fx PowerShell Project Verona Q# Small Basic Online TypeScript Visual Basic
Frameworks, development tools	.NET .NET Framework .NET Gadgeteer .NET MAUI .NET Micro Framework AirSim ASP.NET ASP.NET AJAX ASP.NET Core ASP.NET MVC ASP.NET Razor ASP.NET Web Forms Avalonia Babylon.js BitFunnel Blazor C++/WinRT CCF ChakraCore CLR Profiler Dapr DeepSpeed DiskSpd Dryad Dynamic Language Runtime eBPF on Windows Electron Entity Framework Fluent Design System Fluid Framework Infer.NET LightGBM Managed Extensibility Framework Microsoft Automatic Graph Layout Microsoft C++ Standard Library Microsoft Cognitive Toolkit Microsoft Design Language Microsoft Detours Microsoft Enterprise Library Microsoft SEAL mimalloc Mixed Reality Toolkit ML.NET mod_mono Mono MonoDevelop MSBuild MsQuic Neural Network Intelligence npm NuGet OneFuzz Open Management Infrastructure Open Neural Network Exchange Open Service Mesh Open XML SDK Orleans Playwright ProcDump ProcMon Python Tools for Visual Studio R Tools for Visual Studio RecursiveExtractor Roslyn Sandcastle SignalR StyleCop SVNBridge T2 Temporal Prover Text Template Transformation Toolkit TLA+ Toolbox U-Prove vcpkg Virtual File System for Git Voldemort VoTT Vowpal Wabbit Windows App SDK Windows Communication Foundation Windows Driver Frameworks KMDF UMDF Windows Forms Windows Presentation Foundation Windows Template Library Windows UI Library WinJS WinObjC WiX XDP for Windows XSP xUnit.net Z3 Theorem Prover
Operating systems	MS-DOS (v1.25, v2.0 & v4.0) Barrelfish SONiC Azure Linux
Other	ChronoZoom Extensible Storage Engine FlexWiki FourQ Gollum Project Mu ReactiveX SILK TLAPS TPM 2.0 Reference Implementation WikiBhasha

Licenses

Forges

v
t
e

Microsoft Research (MSR)

Main
projects

Languages, compilers	Bartok Bosque Cω F* Lean P Project Verona Phoenix Polyphonic C# SecPAL
Distributed–grid computing	BitVault Confidential Consortium Framework DeepSpeed Orleans
Internet, networking	AjaxView Avalanche Conference XP Gazelle HoneyMonkey Penny Black Wallop WikiBhasha
Other projects	Automatic Graph Layout Cognitive Toolkit Digits Holoportation IllumiRoom Image Composite Editor Infer.NET LightGBM LiveStation MyLifeBits Neural Network Intelligence NodeXL OneFuzz PhotoDNA SEAL SLAM T2 Temporal Prover WorldWide Telescope Z3 Theorem Prover
Operating systems	Barrelfish HomeOS Midori Singularity Verve
APIs	Accelerator Dryad Joins mimalloc
Launched as products	C# Comic Chat Detours F# Sideshow PixelSense (TouchLight) SenseCam ClearType Group Shot Allegiance TrueSkill Songsmith Xbox Kinect

MSR Labs
applied
research

Live Labs

Current	Pivot Seadragon Deep Zoom
Discontinued	Deepfish Listas Live Clipboard Photosynth

FUSE Labs

Other labs

DeepSpeed

Library

See also

References

Further reading

External links