-
Notifications
You must be signed in to change notification settings - Fork 3.1k
Pull requests: sgl-project/sglang
Pull requests list
[sgl-kernel] enhance sgl-kernel import logic for sm8x
run-ci
#11707
opened Oct 16, 2025 by
FlamingoPg
Loading...
1 of 4 tasks
[NVIDIA] Update to leverage flashinfer trtllm FP4 MOE throughput kernel
high priority
run-ci
#11563
opened Oct 13, 2025 by
jiahanc
Loading...
4 tasks done
[Do not merge] Adding flashinfer_cubin
high priority
run-ci
#11459
opened Oct 11, 2025 by
Fridge003
Loading...
4 tasks
[Docs] update sgl-kernel readme
run-ci
#11379
opened Oct 9, 2025 by
FlamingoPg
Loading...
1 of 4 tasks
[Feature] Qwen3-Next & FLA: Support MTP topk>1; Up to 1.7% faster
high priority
run-ci
#11133
opened Oct 1, 2025 by
byjiang1996
Loading...
4 tasks done
model: qwen3-omni (thinker-only)
high priority
run-ci
#10911
opened Sep 25, 2025 by
mickqian
Loading...
4 tasks
[Feature] Accelerate Simple-EAGLE with a Fused Verify-Draft CUDA Graph
high priority
run-ci
#10866
opened Sep 24, 2025 by
zhanxxxxxxx
Loading...
4 tasks
[6/n]decouple quantization implementation from vLLM dependency
ready-to-merge
The PR is ready to merge after the CI is green.
run-ci
#10750
opened Sep 22, 2025 by
Hongbosherlock
Loading...
4 tasks
[benchmark] refactor bench (part 1)
high priority
run-ci
#10409
opened Sep 13, 2025 by
XucSh
Loading...
4 tasks
Add support for bf16 x bf16 cutlass fused MoE
high priority
run-ci
#10275
opened Sep 10, 2025 by
nvcastet
Loading...
4 tasks
feat: Add FP4 (E2M1) KV Cache Support with Quantization Utilities for MLA
high priority
quant
LLM Quantization
run-ci
#10078
opened Sep 5, 2025 by
JackChuang
Loading...
4 tasks done
[Fix] Enhance flush_cache in PD disaggregation
high priority
run-ci
#9865
opened Sep 1, 2025 by
duzeyan
Loading...
1 of 4 tasks
[Interface] Add an alias for
sglang launch server
high priority
#9800
opened Aug 29, 2025 by
vincentzed
Loading...
4 tasks
update sgl-kernel for w4afp8_machete
high priority
run-ci
#9736
opened Aug 28, 2025 by
mianpeng
Loading...
Support modelopt llama4 nvfp4 workflow and fix issues
high priority
#9526
opened Aug 23, 2025 by
Edwardf0t1
Loading...
1 of 4 tasks
[NVIDIA] adding DSR1 deployment guide on B200
#9408
opened Aug 20, 2025 by
kushanam
Loading...
4 tasks
[POC] Overlap scheduler refactor with SD
enhancement
New feature or request
high priority
run-ci
speculative-decoding
#9334
opened Aug 19, 2025 by
hnyls2002
Loading...
Optimize alloc_decode performance
high priority
#8967
opened Aug 8, 2025 by
fzyzcjy
Loading...
6 tasks
[CI] Defer Resource-Intensive Tests to Scheduled Runs(Nightly and Weekly)
ci
continue integration related
enhancement
New feature or request
#8873
opened Aug 6, 2025 by
key4ng
Loading...
6 tasks
Fix positional argument
high priority
#8792
opened Aug 5, 2025 by
liquanfeng
Loading...
6 tasks done
[2/N]Support DeepSeek-R1 w4a8 low latency deepep
high priority
#8464
opened Jul 28, 2025 by
ayrnb
Loading...
6 tasks
fix: fix mtp use flashmla backend bugs
#7710
opened Jul 2, 2025 by
zhangxiaolei123456
Loading...
6 tasks
ProTip!
Mix and match filters to narrow down what you’re looking for.
You can’t perform that action at this time.