Pull requests: vllm-project/vllm

New pull request New

1,178 Open 14,151 Closed

Pull requests list

[PERF] Decouple projections from GDN custom op qwen

#27512 opened Oct 25, 2025 by vadiklyutiy

Add standalone multimodal encoder benchmark frontend performance

#27511 opened Oct 25, 2025 by alhridoy

add cpu device support for nixl_connector kv-connector

#27510 opened Oct 25, 2025 by ZhengHongming888

5 tasks

qwen3moe on gh200 qwen

#27507 opened Oct 25, 2025 by bhaktatejas922

[Multimodal] Move profiling info out of processing info deepseek llama multi-modality qwen

#27506 opened Oct 25, 2025 by DarkLight1337 • Draft

7 tasks

Clarify V0→V1 error; keep SamplingParams importable when VLLM_USE_V1=0 frontend v1

#27503 opened Oct 25, 2025 by nick-allison

3 of 5 tasks

Prefill / Decode Split into Compiled Region

#27501 opened Oct 25, 2025 by therealnaveenkamal

1 of 5 tasks

update trtllm gen moe api

#27498 opened Oct 25, 2025 by jiahanc • Draft

5 tasks

feat: make extraInit containers fully configurable in helm chart documentation

#27497 opened Oct 25, 2025 by HanFa

3 of 5 tasks

[WIP] [GPT-OSS] customized symm_mem based EP comm kernel integration frontend gpt-oss v1

#27495 opened Oct 25, 2025 by Luosuu

[Bugfix] fix empty prompts for async-engine mode in benchmark throughput performance ready

#27494 opened Oct 25, 2025 by luccafong

[Performance] Support FP8 flashinfer TRTLLM MOE on Qwen3 and Qwen-3next qwen

#27492 opened Oct 24, 2025 by jiahanc • Draft

5 tasks

Add more dims for batch invariant shims

#27489 opened Oct 24, 2025 by bwasti

3 of 5 tasks

[Chore] Optimize P2PNCCLEngine http_address kv-connector ready

#27488 opened Oct 24, 2025 by yewentao256

[Bugfix][LoRA][FusedMoE] Select MxFP4 Backend based on LoRA Enablement ready

#27487 opened Oct 24, 2025 by varun-sundar-rabindranath

[Refactor] Add Shared Block Max Reduction Helper

#27483 opened Oct 24, 2025 by harishappana-git

5 tasks

[Test] Draft: Nixl fault tests ci/build kv-connector ready v1

#27481 opened Oct 24, 2025 by wseaton

[Test] Batch Invariant: Unit test using parameterized backend ready v1

#27478 opened Oct 24, 2025 by yewentao256

@yewentao256

[Rocm][fused_moe][fp4] view weight to torch.float4_e2m1fn_x2 when running aiter fused moe for fp4 model rocm

#27474 opened Oct 24, 2025 by zejunchen-zejun

[Kernel] Enable moe LoRA kernel support FP16 ready

#27468 opened Oct 24, 2025 by jeejeelee

5 tasks

[Bugfix] Fix processor initialization for model from modelscope instead of HF ready

#27461 opened Oct 24, 2025 by lengrongfu

5 tasks

[Typo] fix clangd log to marker

#27459 opened Oct 24, 2025 by Echo-Nie

[Performance][MLA][ROCm] Remove redundant D2D copy in deepseek deepseek rocm v1

#27457 opened Oct 24, 2025 by ganyi1996ppo

5 tasks

[Model] [Bugfix] Fix inconsistencies in the handling of layer names

#27453 opened Oct 24, 2025 by Alnusjaponica • Draft

2 tasks

Fix decoding server's logprobs handling in Prefill/Decode disaggregation mode frontend kv-connector v1

#27449 opened Oct 24, 2025 by Prowindy

5 tasks

ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Pull requests: vllm-project/vllm

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Pull requests list