Pull requests: intel/auto-round

New pull request New

24 Open 1,000 Closed

Pull requests list

[fix]AttributeError: 'Autotuner' object has no attribute '_cache_lock'

#1414 opened Feb 5, 2026 by xin3he

3 of 9 tasks

refine qwen3_vl_moe experts forward

#1413 opened Feb 5, 2026 by WeiweiZhang1

2 of 9 tasks

better warning for not support mllm while quant_non_text_module

#1411 opened Feb 5, 2026 by n1ck-guo

1 of 9 tasks

update auto-round-kernel package name to auto-round-lib

#1410 opened Feb 5, 2026 by chensuyue

2 of 9 tasks

adapt vllm_ext to new extra config

#1409 opened Feb 5, 2026 by mengniwang95

1 of 6 tasks

0.10.0

Support Qwen3 Omni model quantization

#1404 opened Feb 4, 2026 by lvliang-intel • Draft

2 of 9 tasks

Refactor evaluation in tests to use evaluate_accuracy function

#1402 opened Feb 4, 2026 by xin3he

1 of 9 tasks

1.0.0

support gpt-oss mxfp4 directly loading

#1401 opened Feb 4, 2026 by xin3he

1 of 9 tasks

support multiple device evaluation for activation quantized model

#1394 opened Feb 4, 2026 by wenhuach21

9 tasks

[Experimental][Won't Merge] DDP PoC won't merge

#1391 opened Feb 4, 2026 by yiliu30 • Draft

9 tasks

Optimize CPU RAM peak memory during quantization

#1386 opened Feb 3, 2026 by lvliang-intel

3 of 9 tasks

fix Qwen3-VL model auto_awq export, add auto_awq vllm ut

#1384 opened Feb 2, 2026 by WeiweiZhang1

3 of 9 tasks

Refactor module access to use PyTorch get/set_submodule API

#1365 opened Jan 29, 2026 by scopophobic

support hadamard transform for mxfp4 with rtn or autoround method.

#1349 opened Jan 27, 2026 by lkk12014402

refactor init of compressor engineering ready

#1339 opened Jan 26, 2026 by n1ck-guo

1 of 9 tasks

Add asym for XPU backend.

#1316 opened Jan 22, 2026 by luoyu-intel • Draft

Update torch to 2.9.1 in CI

#1313 opened Jan 22, 2026 by XuehaoSun

align act_max of experts for qwen3-vl and qwen3-next

#1311 opened Jan 21, 2026 by xin3he • Draft

Robust FP8 layer detection for ignore_layers (#1283)

#1289 opened Jan 15, 2026 by scopophobic

Fix ignore_layers not working for FP8 models

#1286 opened Jan 15, 2026 by Copilot AI

11 tasks done

@yiliu30

[WIP][refactor quanizers][step 1] refactor rtn and tuning

#1278 opened Jan 14, 2026 by n1ck-guo

fix disable_opt_rtn spelling error

#1250 opened Jan 9, 2026 by WeiweiZhang1

add per-task lm_eval args for exprimental usage

#1017 opened Nov 11, 2025 by WeiweiZhang1

[WIP] [STEP 2] split compressor into few quantizers

#841 opened Sep 23, 2025 by n1ck-guo

ProTip! Filter pull requests by the default branch with base:main.

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pull requests: intel/auto-round

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Pull requests list