Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[PERF] Decouple projections from GDN custom op qwen Related to Qwen models
#27512 opened Oct 25, 2025 by vadiklyutiy Loading...
Add standalone multimodal encoder benchmark frontend performance Performance-related issues
#27511 opened Oct 25, 2025 by alhridoy Loading...
add cpu device support for nixl_connector kv-connector
#27510 opened Oct 25, 2025 by ZhengHongming888 Loading...
5 tasks
qwen3moe on gh200 qwen Related to Qwen models
#27507 opened Oct 25, 2025 by bhaktatejas922 Loading...
[Multimodal] Move profiling info out of processing info deepseek Related to DeepSeek models llama Related to Llama models multi-modality Related to multi-modality (#4194) qwen Related to Qwen models
#27506 opened Oct 25, 2025 by DarkLight1337 Draft
7 tasks
Prefill / Decode Split into Compiled Region
#27501 opened Oct 25, 2025 by therealnaveenkamal Loading...
1 of 5 tasks
update trtllm gen moe api
#27498 opened Oct 25, 2025 by jiahanc Draft
5 tasks
feat: make extraInit containers fully configurable in helm chart documentation Improvements or additions to documentation
#27497 opened Oct 25, 2025 by HanFa Loading...
3 of 5 tasks
[Bugfix] fix empty prompts for async-engine mode in benchmark throughput performance Performance-related issues ready ONLY add when PR is ready to merge/full CI is needed
#27494 opened Oct 25, 2025 by luccafong Loading...
[Performance] Support FP8 flashinfer TRTLLM MOE on Qwen3 and Qwen-3next qwen Related to Qwen models
#27492 opened Oct 24, 2025 by jiahanc Draft
5 tasks
Add more dims for batch invariant shims
#27489 opened Oct 24, 2025 by bwasti Loading...
3 of 5 tasks
[Chore] Optimize P2PNCCLEngine http_address kv-connector ready ONLY add when PR is ready to merge/full CI is needed
#27488 opened Oct 24, 2025 by yewentao256 Loading...
[Bugfix][LoRA][FusedMoE] Select MxFP4 Backend based on LoRA Enablement ready ONLY add when PR is ready to merge/full CI is needed
#27487 opened Oct 24, 2025 by varun-sundar-rabindranath Loading...
[Refactor] Add Shared Block Max Reduction Helper
#27483 opened Oct 24, 2025 by harishappana-git Loading...
5 tasks
[Test] Draft: Nixl fault tests ci/build kv-connector ready ONLY add when PR is ready to merge/full CI is needed v1
#27481 opened Oct 24, 2025 by wseaton Loading...
[Test] Batch Invariant: Unit test using parameterized backend ready ONLY add when PR is ready to merge/full CI is needed v1
#27478 opened Oct 24, 2025 by yewentao256 Loading...
[Kernel] Enable moe LoRA kernel support FP16 ready ONLY add when PR is ready to merge/full CI is needed
#27468 opened Oct 24, 2025 by jeejeelee Loading...
5 tasks
[Bugfix] Fix processor initialization for model from modelscope instead of HF ready ONLY add when PR is ready to merge/full CI is needed
#27461 opened Oct 24, 2025 by lengrongfu Loading...
5 tasks
[Typo] fix clangd log to marker
#27459 opened Oct 24, 2025 by Echo-Nie Loading...
Previous 1 3 4 5 47 48
Previous
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.

AltStyle によって変換されたページ (->オリジナル) /