-
Notifications
You must be signed in to change notification settings - Fork 204
Pull requests: SemiAnalysisAI/InferenceX
Pull requests list
[Klaud Cold] MI300X MiniMax-M3 nightly image and FP8 KV cache
full-sweep-fail-fast
#1858
opened Jun 19, 2026 by
cquil11
Collaborator
Loading...
[AMD] Add MiniMax-M3-FP4 MI355X ATOMMESH
AMD
full-sweep-enabled
#1856
opened Jun 19, 2026 by
seungrokj
Collaborator
Loading...
4 tasks
[AMD] Add DSv4-FP4-MI355X ATOMMESH MTP
AMD
full-sweep-enabled
#1855
opened Jun 19, 2026 by
seungrokj
Collaborator
Loading...
2 tasks
[codex] Add all-evals matrix expansion mode
#1854
opened Jun 19, 2026 by
Oseltamivir
Collaborator
Loading...
6 of 8 tasks
[codex] Cover every multinode parallelism in evals
#1850
opened Jun 19, 2026 by
Oseltamivir
Collaborator
•
Draft
[AMD] Optimize MiniMax M3 sparse index scoring on MI300X
sweep-enabled
#1840
opened Jun 18, 2026 by
Oseltamivir
Collaborator
Loading...
[Klaud Cold] MI325X MiniMax-M3 EAGLE3 nightly image and FP8 KV cache
full-sweep-fail-fast
#1838
opened Jun 18, 2026 by
cquil11
Collaborator
Loading...
[codex] Update MiniMax M3 B300 FlashInfer image
full-sweep-fail-fast
#1834
opened Jun 18, 2026 by
cquil11
Collaborator
Loading...
[codex] Update MiniMax M3 B300 EAGLE3 FlashInfer image
full-sweep-fail-fast
#1835
opened Jun 18, 2026 by
cquil11
Collaborator
Loading...
[codex] Update MiniMax M3 B200 FlashInfer image
full-sweep-fail-fast
#1833
opened Jun 18, 2026 by
cquil11
Collaborator
Loading...
[codex] Update MiniMax M3 B200 EAGLE3 FlashInfer image
full-sweep-fail-fast
#1832
opened Jun 18, 2026 by
cquil11
Collaborator
Loading...
fix(ci): bound multinode pre-run Slurm cleanup drain loop (unblocks NVIDIA sweeps)
#1820
opened Jun 18, 2026 by
arygupt
Collaborator
Loading...
[AMD] add dsv4 sglang disagg
AMD
full-sweep-enabled
#1818
opened Jun 18, 2026 by
billishyahao
Collaborator
Loading...
Add Qwen3.5-FP8 GB200 SGLang disaggregated benchmark
full-sweep-enabled
#1810
opened Jun 16, 2026 by
RohitNagraj
Collaborator
Loading...
[AMD] [MI300X] minimaxm3-fp8-mi300x-vllm: enable AITER kernels for MXFP8 on MI300X
full-sweep-enabled
#1808
opened Jun 16, 2026 by
JohnQinAMD
Collaborator
Loading...
Fix for https://github.com/sgl-project/sglang/issues/22072
#1806
opened Jun 16, 2026 by
davzhuAMD
Loading...
[NV]Add GLM-5 NVFP4 GB200 disagg non-mtp TRT-LLM benchmarks via Dynamo
full-sweep-enabled
#1803
opened Jun 16, 2026 by
xinli-sw
Collaborator
Loading...
[NV]Add GLM-5 NVFP4 GB200 disagg-mtp TRT-LLM benchmarks via Dynamo
full-sweep-enabled
#1800
opened Jun 16, 2026 by
xinli-sw
Collaborator
Loading...
[NV]Add GLM-5 NVFP4 GB300 disagg-mtp TRT-LLM benchmarks via Dynamo
full-sweep-enabled
#1799
opened Jun 16, 2026 by
xinli-sw
Collaborator
Loading...
[NV]Update Kimi K2.5 NVFP4 GB200 disaggregated TRT-LLM benchmarks via Dynamo
full-sweep-enabled
#1797
opened Jun 16, 2026 by
xinli-sw
Collaborator
Loading...
[NV]Add Kimi K2.5 NVFP4 GB300 disaggregated TRT-LLM benchmarks via Dynamo
full-sweep-enabled
#1796
opened Jun 16, 2026 by
xinli-sw
Collaborator
Loading...
chore(runners): add TensorWave MI300X docker runners (mi300x-tw)
#1793
opened Jun 16, 2026 by
cquil11
Collaborator
Loading...
[NV]dsr1-fp4-b200-sglang: add DPA PDL lane
full-sweep-enabled
#1792
opened Jun 15, 2026 by
hshrivastava-droid
Collaborator
Loading...
ProTip!
Adding no:label will show everything without a label.
You can’t perform that action at this time.