-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Pull requests: antirez/ds4
Pull requests list
rocm: fix distributed inference on unified-memory APUs (strix halo / gfx1151)
#407
opened Jun 13, 2026 by
kyuz0
Loading...
[3/N] add prefetch support for CUDA backend : running ds4 for any GPU with cache (2.75 x faster!)
#402
opened Jun 12, 2026 by
yiakwy-xpu-ml-framework-team
Loading...
Add multi-column attn-out low projection kernel for small batches
#399
opened Jun 11, 2026 by
rwl4
Loading...
CUDA backend (DGX-Spark) — refactored into modular .cuh files mirroring ROCm structure
#398
opened Jun 11, 2026 by
gundemirbas
Loading...
ROCm runtime: configurable weight cache limit and arena chunk size via environment variables
#397
opened Jun 11, 2026 by
gundemirbas
Loading...
Add env-gated prompt-lookup speculative decoding for greedy generation
#396
opened Jun 11, 2026 by
rwl4
Loading...
fix(kv-cache): refresh cold anchor after partial prefix hits
#394
opened Jun 11, 2026 by
TerryChengTW
Loading...
3 tasks done
Add teaching mode to ds4-agent, with teach-bench benchmark
#391
opened Jun 11, 2026 by
rowantrollope
Loading...
Clamp MTP draft depth to the prefill capacity
#381
opened Jun 10, 2026 by
pandysp
Contributor
Loading...
feat: add native Agent Skills support to ds4-agent
#380
opened Jun 10, 2026 by
fry69
Contributor
Loading...
Keep live KV reusable when clients strip transient metadata blocks
#378
opened Jun 10, 2026 by
adv0r
Loading...
[2/N] add cuda imatrix support for custom RL model
#377
opened Jun 10, 2026 by
yiakwy-xpu-ml-framework-team
Loading...
ds4_server: Add /health endpoint that returns HTTP 200 once model is fully loaded
#374
opened Jun 9, 2026 by
mcmalayalam
Loading...
Fix agent edit: accept [upto] markers indented or padded with blanks (+ golden cases)
#373
opened Jun 9, 2026 by
rinaldofesta
Loading...
Add continuous depth-1 MTP speculation (DS4_MTP_CONTINUOUS)
#371
opened Jun 9, 2026 by
pandysp
Contributor
Loading...
[1/N] add fp8 fp32 scale support for custom RL model
#368
opened Jun 9, 2026 by
yiakwy-xpu-ml-framework-team
Loading...
make: consistent ROCm targets (rocm-strix-halo / rocm-generic) + portable lib paths (#357, #179)
#365
opened Jun 8, 2026 by
jamesburton
Loading...
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.