Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Bump gram-newton-schulz to 0.1.5 (cutlass-dsl 4.5.2)#95

Merged
JohnLangford merged 1 commit into
main from
jcl/dion-gram-ns-0.1.5
Jun 26, 2026
Merged

Bump gram-newton-schulz to 0.1.5 (cutlass-dsl 4.5.2) #95
JohnLangford merged 1 commit into
main from
jcl/dion-gram-ns-0.1.5

Conversation

@JohnLangford

@JohnLangford JohnLangford commented Jun 26, 2026

Copy link
Copy Markdown
Contributor

What

Bump the optional [gns] / [gram-newton-schulz] extra's pin from
gram-newton-schulz==0.1.4 to 0.1.5 in requirements_gns.txt, and reconcile
the transitive pins it controls.

Transitive pins (reconciled, not just one line)

gram-newton-schulz==0.1.5 directly pins its dependencies differently from 0.1.4:

Package 0.1.4 (Requires-Dist) 0.1.5 (Requires-Dist) This PR
quack-kernels >=0.4.1 ==0.5.0 pin updated 0.4.10.5.0
nvidia-cutlass-dsl ==4.4.2 ==4.5.2 tracked in comment/docs 4.4.24.5.2

Both transitive versions changed, so this is not a no-op renumber. The explanatory
comment in requirements_gns.txt was updated to cite the 0.1.5-correct cutlass version.

Docs reconciled

The cutlass-dsl pin is the whole point of these explicit pins — drift has historically
broken FlashAttention-4 downstream. Notably, 4.5.2 is the same cutlass version the
README/CHANGELOG attribute to FA4 / Blackwell stacks, so the previously-documented
4.4.2-vs-4.5.2 conflict is now resolved, not merely renumbered. Updated:

  • README.md (Quick Start note + train extra note): 4.4.24.5.2, and the "install
    in a separate environment / conflicts with FA4" wording reworded to reflect that the
    extra's cutlass pin now matches FA4 / Blackwell.
  • CHANGELOG.md (Unreleased): dropped the now-false 4.4.2 conflict statements from the
    breaking-install entry and added a bullet documenting the bump and the resulting
    nvidia-cutlass-dsl 4.4.24.5.2 alignment.

Verification

  • Confirmed gram-newton-schulz==0.1.5 exists on PyPI.
  • Read Requires-Dist for 0.1.5 from the PyPI JSON API: quack-kernels==0.5.0,
    nvidia-cutlass-dsl==4.5.2. Cross-checked quack-kernels==0.5.0 requires
    nvidia-cutlass-dsl>=4.5.2 (consistent with the ==4.5.2 pin).
  • pip install --dry-run resolve of gram-newton-schulz==0.1.5 in a scratch venv
    resolved to exactly gram-newton-schulz 0.1.5 / quack-kernels 0.5.0 /
    nvidia-cutlass-dsl 4.5.2
    , matching the written pins.

Bump the optional [gns] extra's gram-newton-schulz pin 0.1.4 -> 0.1.5.
0.1.5 directly pins quack-kernels==0.5.0 and nvidia-cutlass-dsl==4.5.2
(was quack-kernels>=0.4.1 / cutlass-dsl 4.4.2), so update the
quack-kernels pin to 0.5.0 and the transitive-pin comment accordingly.
The cutlass-dsl pin now matches the 4.5.2 used by Flash-Attention-4 /
Blackwell stacks, resolving the conflict the README/CHANGELOG previously
documented; reconcile those docs to the new versions and framing.
Verified against the PyPI metadata for gram-newton-schulz 0.1.5 and a
pip --dry-run resolve, which both produce gram-newton-schulz 0.1.5 /
quack-kernels 0.5.0 / nvidia-cutlass-dsl 4.5.2.
@JohnLangford JohnLangford merged commit a2629fd into main Jun 26, 2026
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

No reviews

Assignees

No one assigned

Labels

None yet

Projects

None yet

Milestone

No milestone

Development

Successfully merging this pull request may close these issues.

1 participant

AltStyle によって変換されたページ (->オリジナル) /