-
Notifications
You must be signed in to change notification settings - Fork 3.1k
Pull requests: sgl-project/sglang
Pull requests list
[sgl-kernel] enhance sgl-kernel import logic for sm8x
run-ci
#11707
opened Oct 16, 2025 by
FlamingoPg
Loading...
1 of 4 tasks
[quantization][MoE] fix the check for
tp_size
/ moe_ep_size
/ moe_intermediate_size
/ weight_block_size_n
run-ci
#11702
opened Oct 16, 2025 by
kevin85421
Loading...
1 of 4 tasks
[Test] support llm-compressor: w8a8_fp8_block, wNa16
#11701
opened Oct 16, 2025 by
Wangzheee
Loading...
4 tasks
chore: bump SGLang version to 0.5.3.post3
run-ci
#11693
opened Oct 16, 2025 by
sglang-bot
Loading...
[router] Add Configurable L0 and L1 Tokenizer Caching
enhancement
New feature or request
router
router-benchmark
run-ci
#11688
opened Oct 16, 2025 by
slin1237
Loading...
2 of 4 tasks
[router] fix get_models endpoint for openai router
run-ci
#11687
opened Oct 16, 2025 by
key4ng
Loading...
4 tasks
[Lint] Add
python/sglang
to ruff F401 checks and remove unused imports in files
run-ci
#11685
opened Oct 15, 2025 by
CatherineSue
Loading...
1 of 4 tasks
wip: Remove redundant fill_(0) in dp_scatter
run-ci
#11683
opened Oct 15, 2025 by
ch-wan
Loading...
4 tasks
Cleaning indexer for DeepSeek V3.2
run-ci
#11682
opened Oct 15, 2025 by
Fridge003
Loading...
4 tasks
[Router] Refactor protocol definitions: split spec.rs into modular files
run-ci
#11677
opened Oct 15, 2025 by
key4ng
Loading...
4 tasks
[Bug fix] fix Qwen3-VL dense model launch failure caused by rotary-embedding
#11675
opened Oct 15, 2025 by
coco-alen
Loading...
4 tasks
feat: return partial generation results when aborting requests in waiting queue
run-ci
#11673
opened Oct 15, 2025 by
guoyuhong
Loading...
1 of 4 tasks
[2/2] [feature] support openai like classification api in router
run-ci
#11670
opened Oct 15, 2025 by
whybeyoung
Loading...
[RL] support weight updation with dp attention
run-ci
#11669
opened Oct 15, 2025 by
zhuzilin
Loading...
1 of 4 tasks
Manually flip deepep_mode for cuda_graph
run-ci
#11666
opened Oct 15, 2025 by
zhuzilin
Loading...
1 of 4 tasks
WIP: Use trtllm_mla decode kernel for draft extend in speculative decoding
run-ci
#11664
opened Oct 15, 2025 by
Qiaolin-Yu
Loading...
4 tasks
fix: bench_serving error with PD disaggregation
run-ci
#11662
opened Oct 15, 2025 by
Yi-sir
Loading...
4 tasks
fix: replace buggy flashinfer fp4 gemm with sgl-kernel fp4 gemm
run-ci
#11661
opened Oct 15, 2025 by
Qiaolin-Yu
Loading...
4 tasks
check_offload_progress more frequently
run-ci
#11656
opened Oct 15, 2025 by
pansicheng
Loading...
4 tasks
Fuse writing KV buffer into rope kernel (amd gpu sgl-kernel)
#11654
opened Oct 15, 2025 by
wejoncy
Loading...
4 tasks
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.
You can’t perform that action at this time.