Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Pull requests: vllm-project/llm-compressor

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[ddp] fixing data slice bug quality-failed ready When a PR is ready for review
#2361 opened Feb 12, 2026 by HDCharles Loading...
Fix CI/CD failures ready When a PR is ready for review
#2359 opened Feb 12, 2026 by dsikka Loading...
add qwen3 vl autoround example documentation Improvements or additions to documentation ready When a PR is ready for review
#2357 opened Feb 12, 2026 by xin3he Loading...
DataLoader options, single-pass weight calibration, optional sequential prefetch documentation Improvements or additions to documentation ready When a PR is ready for review
#2349 opened Feb 11, 2026 by GOavi101 Loading...
[AWQ] Add activation_hook_target field for custom activation cache hooking ready When a PR is ready for review
#2346 opened Feb 10, 2026 by ZewenShen-Cohere Loading...
AWQ: orig_layer_weights should save all balance layer weights ready When a PR is ready for review
#2344 opened Feb 10, 2026 by ZewenShen-Cohere Loading...
Add model_free_ptq example for glm 4.6 block fp8 documentation Improvements or additions to documentation
#2343 opened Feb 10, 2026 by mgoin Loading...
[Bugfix] Guard against MLA ready When a PR is ready for review
#2337 opened Feb 6, 2026 by kylesayrs Loading...
Improve how we identify and run e2e smoke tests
#2336 opened Feb 6, 2026 by dhuangnm Loading...
[MoE] MiniMax-M2/M2.1 calibration follow-up documentation Improvements or additions to documentation
#2335 opened Feb 6, 2026 by LudovicoYIN Draft
[GPTQ][ddp] PoC for GPTQ with DDP enhancement New feature or request gptq For any PR / issue related to GPTQ support quality-failed
#2333 opened Feb 6, 2026 by HDCharles Loading...
[bug][awq] fix inf handling awq For any issue / PR related to AWQ support bug Something isn't working ready When a PR is ready for review
#2332 opened Feb 5, 2026 by HDCharles Loading...
[AutoRound] Add DP Support
#2331 opened Feb 5, 2026 by yiliu30 Loading...
Add GSM8K evaluation script and AWQ+FP8 results documentation Improvements or additions to documentation
#2330 opened Feb 4, 2026 by rtj1 Loading...
Benchmark torch.compile optimization for quantization ready When a PR is ready for review
#2320 opened Jan 31, 2026 by colldata79 Loading...
Update vLLM GPU Utilization
#2319 opened Jan 30, 2026 by dsikka Draft
Add AFMOE mappings for awq and smoothquant ready When a PR is ready for review
#2316 opened Jan 30, 2026 by bartowski1182 Loading...
move smoothquant to transforms documentation Improvements or additions to documentation ready When a PR is ready for review
#2314 opened Jan 30, 2026 by Etelis Loading...
Support FP8 Block Quantization for Non-Divisible Shapes
#2290 opened Jan 26, 2026 by Etelis Loading...
3 of 4 tasks
Refactor Matching Logic to Use compressed-tensors Utilities needs-rebase ready When a PR is ready for review
#2284 opened Jan 24, 2026 by Etelis Loading...
Previous 1
Previous
ProTip! Add no:assignee to see everything that’s not assigned.

AltStyle によって変換されたページ (->オリジナル) /