I build AI systems and lead teams.
Currently working on:
Applied AI systems • Autonomous agents and orchestration • Evals and reliability • AI coding systems & DX
I build AI systems and lead teams.
Currently working on:
Applied AI systems • Autonomous agents and orchestration • Evals and reliability • AI coding systems & DX
Agentic personal OS to automate high-leverage workflows with Claude Code, Codex, Pi, OpenClaw and other coding agents/ runtime platforms.
AI agent evaluation framework for full trajectories: tasks, actions, observations, verified state and behavior, rewards, baselines, and RL-ready exports.
Python 1
End-to-end LLM eval framework for AI products. Provider-agnostic and agent-native, with skills, traces, judge validation, and a human review interface.
JavaScript 3
Forked from huggingface/nanotron
Minimalistic large language model 3D-parallelism training
Python
#!/usr/bin/env python3
# nanoclaw.py - A minimal OpenClaw
# Multi-agent CLI with role-based agents
# Run: uv run --with anthropic --with schedule python nanoclaw.py