Pull requests: vllm-project/vllm

New pull request New

1,231 Open 14,415 Closed

Pull requests list

[Frontend] Make RequestIdMiddleware return the internal request_id frontend ready

#27983 opened Nov 3, 2025 by markmc

[Quantization] support gpt-oss for quantized kv cache weight loading gpt-oss

#27980 opened Nov 3, 2025 by xuebwang-amd

5 tasks

[KVConnector] Enable get_block_ids_with_load_errors() in LMCache connector kv-connector

#27978 opened Nov 3, 2025 by ziruiliu

5 tasks

fix(benchmarks): Remove hardcoded dtype in hf backend performance

#27976 opened Nov 3, 2025 by git-jxj

3 of 5 tasks

[Refactor] Lazy import tool_parser deepseek documentation frontend llama tool-calling

#27974 opened Nov 3, 2025 by chaunceyjiang

5 tasks

[Model] fix ernie45 reasoning_parser ready

#27973 opened Nov 3, 2025 by CSWYF3634076

[Bugfix] Handle escaped characters in GLM tool parser to prevent double serialization ci/build frontend gpt-oss tool-calling v1

#27970 opened Nov 3, 2025 by soaringk

3 of 5 tasks

[Model][Bugfix] fix pipeline parallelism support for NemotronH

#27968 opened Nov 3, 2025 by tomeras91

[Model] app optimal triton fused moe configs for NemotronH MoE performance

#27967 opened Nov 3, 2025 by tomeras91

[Bugfix][ROCm] Fix AITER attention backend for deepseek-ocr model deepseek rocm v1

#27965 opened Nov 3, 2025 by vllmellm

5 tasks

[Doc][Last/N] Improve all pooling task | Refactor pooling-related documentation documentation

#27963 opened Nov 3, 2025 by noooop • Draft

5 tasks

[Refactor] to simplify and extract the shared logic between chat completion and responses frontend ready tool-calling

#27961 opened Nov 3, 2025 by chaunceyjiang

5 tasks

[LoRA][FusedMoE] Introduce FusedMoEPermuteExpertsUnpermuteWithLoRA needs-rebase

#27959 opened Nov 3, 2025 by varun-sundar-rabindranath

Make pre-commit work on fedora

#27958 opened Nov 3, 2025 by rabi

[V0 deprecation] Remove VLLM_USE_V1 usage in most modules documentation frontend kv-connector multi-modality structured-output v1

#27955 opened Nov 3, 2025 by wangxiyuan

5 tasks

@sangstar

[CPU] Refactor CPU attention backend ci/build v1

#27954 opened Nov 3, 2025 by bigPYJ1151

2 of 5 tasks

[HARDWARE][CPU] Add Option for Disabling Binding to Specific CPU Cores documentation v1

#27953 opened Nov 3, 2025 by StanHatko

4 of 5 tasks

Update Flashinfer from v0.4.1 to v0.5.0 ci/build ready

#27952 opened Nov 3, 2025 by hmellor

v0.11.1

[CI/Build] Update checking logic in cutlass_group_gemm_supported moe rocm

#27948 opened Nov 2, 2025 by zhewenl

[CI/Build] amd-ci-fix-kernels-attn ci/build rocm

#27947 opened Nov 2, 2025 by zhewenl • Draft

5 tasks

Fix hard-coded parameter name in gemma3n.py

#27946 opened Nov 2, 2025 by seungduk-yanolja

5 tasks

[CI/Build] Update LM Eval Version in AMD CI ci/build rocm

#27944 opened Nov 2, 2025 by zhewenl

[V1][Perf] Optimize Medusa proposer: reduce sync overhead speculative-decoding v1

#27943 opened Nov 2, 2025 by skyloevil

[Metrics] [KVConnector] Add Offloading Connector metrics kv-connector v1

#27942 opened Nov 2, 2025 by omerpaz95

[Bugfix][Core] Load plugins in new processes created by fork

#27940 opened Nov 2, 2025 by matan-dup

3 of 5 tasks

ProTip! Mix and match filters to narrow down what you’re looking for.

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Pull requests: vllm-project/vllm

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Pull requests list