Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

[Test] support llm-compressor: w8a8_fp8_block, wNa16 #11701

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
Wangzheee wants to merge 3 commits into sgl-project:main
base: main
Choose a base branch
Loading
from Wangzheee:llm-compressor_w8a8_fp8_block-w8a8_int8-wNa16

Conversation

Copy link

@Wangzheee Wangzheee commented Oct 16, 2025

Motivation

support llm-compressor:

  • w8a8_fp8: BLOCK
  • w8a8_int8: CHANNEL, TENSOR
  • wNa16: CHANNEL, TENSOR

Modifications

Accuracy Tests

Benchmarking and Profiling

Checklist

AniZpZ reacted with thumbs up emoji
Copy link
Contributor

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

@Wangzheee Wangzheee force-pushed the llm-compressor_w8a8_fp8_block-w8a8_int8-wNa16 branch from 10d0b63 to 76de4b2 Compare October 16, 2025 04:28
@Wangzheee Wangzheee changed the title (削除) [Test] support llm-compressor: w8a8_fp8_block w8a8_int8 wNa16 (削除ここまで) (追記) [Test] support llm-compressor: w8a8_fp8_block, wNa16 (追記ここまで) Oct 16, 2025
@AniZpZ AniZpZ self-assigned this Oct 17, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

@merrymercy merrymercy Awaiting requested review from merrymercy merrymercy is a code owner

@Ying1123 Ying1123 Awaiting requested review from Ying1123 Ying1123 is a code owner

@zhyncs zhyncs Awaiting requested review from zhyncs zhyncs is a code owner

@ispobock ispobock Awaiting requested review from ispobock ispobock is a code owner

@HaiShaw HaiShaw Awaiting requested review from HaiShaw HaiShaw is a code owner

@ch-wan ch-wan Awaiting requested review from ch-wan ch-wan is a code owner

@BBuf BBuf Awaiting requested review from BBuf BBuf is a code owner

@kushanam kushanam Awaiting requested review from kushanam kushanam is a code owner

@Edwardf0t1 Edwardf0t1 Awaiting requested review from Edwardf0t1 Edwardf0t1 is a code owner

At least 1 approving review is required to merge this pull request.

Labels

None yet

Projects

None yet

Milestone

No milestone

Development

Successfully merging this pull request may close these issues.

2 participants

AltStyle によって変換されたページ (->オリジナル) /