I build local LLM inference stacks from source on consumer hardware, benchmark models systematically, and publish datasets on HuggingFace. I also build analytics dashboards and have scaled a tech community to 20,000+ members.

Current Focus:

local inference optimisation (llama.cpp, CUDA..)
systematic benchmarks across dense, MoE, and hybrid architectures
quantisation testing (GGUF Q4_K_M, IQ4_XS, turboquant turbo2/turbo3)
context window scaling analysis and VRAM profiling
publishing benchmark datasets on HuggingFace

🛠️ Tech Stack

📊 Background

AI / ML Practitioner - local LLM inference, model evaluation, HuggingFace contributor
Growth Lead @ Yari Finance - DeFi protocol growth, partnerships, on-chain analytics
Founder @ BeraLand - built a 20K+ member blockchain community from zero
15+ Dune dashboards tracking 1ドルB+ in trading volume
Master's in Corporate & Market Finance - KPMG background

🎓 Learning Journey

Boot.dev Profile

I write about AI infrastructure, local inference, and model evaluation on X

Pinned Loading

llm-bench-rig llm-bench-rig Public

Dual-engine (llama.cpp + vLLM) LLM benchmarking pipeline for GGUF & safetensors on NVIDIA GPUs — speed, quality, live dashboard, publishable cards.

Python 11 2
hermes-recipes hermes-recipes Public

tested hermes agent recipes: configs, deploys, mcp, automations. copy, run, build.

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

witcheer notwitcheer

Achievements

Achievements

Block or report notwitcheer

AI Practitioner & Data-Driven Growth Specialist

What I Do

🛠️ Tech Stack

AI / ML

data & analytics

frontend

infra & tools

📊 Background

🎓 Learning Journey

Pinned Loading

Uh oh!