Carlos Fundora carlosfundora

🎯

Focusing

Achievements

gfxGRAPH gfxGRAPH Public

Drop-in CUDA Graph → HIP Graph translation layer for AMD gfx1030/1031 (RDNA2), featuring DeepSpeed-HIP inference kernels, safe eager fallback, dynamic-shape bucketing, and pure-Rust architectural c...

Python 1 1
llama.cpp-1-bit-turbo llama.cpp-1-bit-turbo Public

Forked from ggml-org/llama.cpp

HIP/ROCm fork optimized for AMD RDNA2 (gfx1030) with PrismML Q1_0_G128 1-bit quant support, RotorQuant, TurboQuant, EAGLE3 and P-EAGLE speculative decoding, and full Wave32 kernel optimizations.

C++ 16
sglang-1-bit-turbo sglang-1-bit-turbo Public

Forked from sgl-project/sglang

AMD ROCm (gfx1030) inference fork with RotorQuant/TurboQuant KV compression, PHANTOM-X zero-copy draft speculation, EAGLE3 speculative decoding, 12 RDNA2 crash fixes, and PrismML Bonsai Q1_0_G128 1...

Python 5 1