Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Releases: databricks/megablocks

v0.10.0

30 May 04:36
@bowenyang008 bowenyang008

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.9.0...v0.10.0

Contributors

irenedea, bowenyang008, and dakinggg
Assets 2
Loading

v0.9.0

09 Apr 00:06
@irenedea irenedea

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.8.0...v0.9.0

Contributors

irenedea and rithwik-db
Loading

v0.8.0

14 Mar 17:57
@rithwik-db rithwik-db

Choose a tag to compare

Breaking Changes

As a consequence of the torch 2.6.0 upgrade, sparse support is disabled for megablocks (meaning that only grouped support is available).

For additional context, torch 2.6.0 depends on triton 3.2.0, which introduced some change to how it handles dtype promotion when two binary operands have different dtypes, and as a result we're encountering an int16 overflow in the stk dependency of megablocks which results in an illegal memory access (IMA). Once this issue is resolved, we will release a new version of megablocks. View #168 for additional details.

What's Changed

New Contributors

Full Changelog: v0.7.0...v0.8.0

Contributors

rithwik-db
Loading

v0.7.0

20 Nov 00:44
@j316chuck j316chuck

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.6.1...v0.7.0

Contributors

janEbert, j316chuck, and 4 other contributors
Loading

v0.6.1

31 Aug 14:49
@snarayan21 snarayan21

Choose a tag to compare

What's New

Patch release to remove dependencies specified via github and instead use released versions through pypi (specifically, stanford-stk and grouped-gemm). This allows for releasing megablocks itself via pypi.

What's Changed

  • Remove direct dependencies, allowing for megablocks pypi release by @snarayan21 in #149

Full Changelog: v0.6.0...v0.6.1

Contributors

snarayan21
Loading
giladturok reacted with hooray emoji
1 person reacted

v0.6.0

30 Aug 18:55
@eitanturok eitanturok

Choose a tag to compare

What's New

1. Torch 2.4 Compatibility (#145)

MegaBlocks now supports Torch 2.4!

2. New CI/CD

MegaBlocks has new Github Actions for better CI/CD! Now on every PR, MegaBlocks will automatically perform code linting and formatting (#131) and run tests on a GPU (#127).

3. Remove Weight Parallelism (#137)

Weight parallelism was not in use and so we removed it.

4. Shared Experts (#109)
Implement shared experts, based on the DeepSeekMoE paper.

Bug Fixes

  1. Better handle incompatible ffn sizes (#108)
  2. Fix AMP for memory optimized options (#111)
  3. Don't save moe lb-loss tensors (#119)

What's Changed

New Contributors

Full Changelog: v0.5.1...v0.6.0

Loading

v0.5.1

11 Jan 22:14
@tgale96 tgale96
f05609c
This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
GPG key ID: 4AEE18F83AFDEB23
Expired
Verified
Learn about vigilant mode.

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.5.0...v0.5.1

Loading

v0.5.0

08 Dec 16:51
@mvpatel2000 mvpatel2000
0460181
This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
GPG key ID: 4AEE18F83AFDEB23
Expired
Verified
Learn about vigilant mode.

Choose a tag to compare

What's New

Several improvements to avoid CPU <> GPU device synchronizations, GLU support, and support for some new models 👀

What's Changed

New Contributors

Full Changelog: v0.4.0...v0.5.0

Contributors

sashaDoubov, vchiley, and 3 other contributors
Loading

v0.4.0

24 Oct 22:44
@mvpatel2000 mvpatel2000
6a71b18
This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
GPG key ID: 4AEE18F83AFDEB23
Expired
Verified
Learn about vigilant mode.

Choose a tag to compare

What's Changed

Full Changelog: v0.3.3...v0.4.0

Contributors

tgale96 and mvpatel2000
Loading

v0.3.3

17 Oct 21:58
@mvpatel2000 mvpatel2000
52aa1b2
This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
GPG key ID: 4AEE18F83AFDEB23
Expired
Verified
Learn about vigilant mode.

Choose a tag to compare

What's Changed

  • Enable running MegaBlocks MoE without bias by @vchiley in #31

Full Changelog: v0.3.2...v0.3.3

Contributors

vchiley
Loading
Previous 1
Previous

AltStyle によって変換されたページ (->オリジナル) /