-
Notifications
You must be signed in to change notification settings - Fork 393
Pull requests: vllm-project/llm-compressor
Pull requests list
[ddp] fixing data slice bug
quality-failed
ready
When a PR is ready for review
#2361
opened Feb 12, 2026 by
HDCharles
Loading...
Fix CI/CD failures
ready
When a PR is ready for review
#2359
opened Feb 12, 2026 by
dsikka
Loading...
add qwen3 vl autoround example
documentation
Improvements or additions to documentation
ready
When a PR is ready for review
#2357
opened Feb 12, 2026 by
xin3he
Loading...
feat: early group-size divisibility check with layer FQNs
#2353
opened Feb 11, 2026 by
GOavi101
Loading...
DataLoader options, single-pass weight calibration, optional sequential prefetch
documentation
Improvements or additions to documentation
ready
When a PR is ready for review
#2349
opened Feb 11, 2026 by
GOavi101
Loading...
[AWQ] Add activation_hook_target field for custom activation cache hooking
ready
When a PR is ready for review
#2346
opened Feb 10, 2026 by
ZewenShen-Cohere
Loading...
AWQ: orig_layer_weights should save all balance layer weights
ready
When a PR is ready for review
#2344
opened Feb 10, 2026 by
ZewenShen-Cohere
Loading...
Add model_free_ptq example for glm 4.6 block fp8
documentation
Improvements or additions to documentation
#2343
opened Feb 10, 2026 by
mgoin
Loading...
[Bugfix] Guard against MLA
ready
When a PR is ready for review
#2337
opened Feb 6, 2026 by
kylesayrs
Loading...
[MoE] MiniMax-M2/M2.1 calibration follow-up
documentation
Improvements or additions to documentation
#2335
opened Feb 6, 2026 by
LudovicoYIN
•
Draft
[GPTQ][ddp] PoC for GPTQ with DDP
enhancement
New feature or request
gptq
For any PR / issue related to GPTQ support
quality-failed
#2333
opened Feb 6, 2026 by
HDCharles
Loading...
Add GSM8K evaluation script and AWQ+FP8 results
documentation
Improvements or additions to documentation
#2330
opened Feb 4, 2026 by
rtj1
Loading...
[AWQ] Add option to consider smooth layer quantization in scale search
#2323
opened Jan 31, 2026 by
Ramshankar07
Loading...
Benchmark torch.compile optimization for quantization
ready
When a PR is ready for review
#2320
opened Jan 31, 2026 by
colldata79
Loading...
Add AFMOE mappings for awq and smoothquant
ready
When a PR is ready for review
#2316
opened Jan 30, 2026 by
bartowski1182
Loading...
move smoothquant to transforms
documentation
Improvements or additions to documentation
ready
When a PR is ready for review
#2314
opened Jan 30, 2026 by
Etelis
Loading...
[Misc] Reword warning message to make log grepping easier
#2312
opened Jan 29, 2026 by
kylesayrs
Loading...
Support FP8 Block Quantization for Non-Divisible Shapes
#2290
opened Jan 26, 2026 by
Etelis
Loading...
3 of 4 tasks
Refactor Matching Logic to Use compressed-tensors Utilities
needs-rebase
ready
When a PR is ready for review
#2284
opened Jan 24, 2026 by
Etelis
Loading...
ProTip!
Add no:assignee to see everything that’s not assigned.