Name	Name	Last commit message	Last commit date
Latest commit History 106 Commits
.claude-plugin	.claude-plugin
.codex	.codex
.github/workflows	.github/workflows
__tests__	__tests__
apps/web-ui	apps/web-ui
crates	crates
docs	docs
examples	examples
packages	packages
scripts	scripts
.editorconfig	.editorconfig
.gitignore	.gitignore
CHANGELOG.md	CHANGELOG.md
CODE_OF_CONDUCT.md	CODE_OF_CONDUCT.md
CONTRIBUTING.md	CONTRIBUTING.md
Cargo.toml	Cargo.toml
LICENSE	LICENSE
README.md	README.md
SECURITY.md	SECURITY.md
deny.toml	deny.toml
package-lock.json	package-lock.json
package.json	package.json
renovate.json	renovate.json
rust-toolchain.toml	rust-toolchain.toml
vitest.config.ts	vitest.config.ts

agent-harness-generator

The meta-harness for AI agents — a harness that builds other harnesses.

Like ruflo is the meta-harness for Claude, this is the meta-harness for AI agents themselves: a system whose job is to produce focused, vertical, branded agent harnesses that run on any host. Pick primitives, pick content, supply identity → ship a npm-publishable harness with your own npx <name> CLI, MCP server, memory, learning loop, and witness-signed releases.

Open the Studio → npm — coming soon Tests — 563 passing CI — 16 jobs License MIT

Claude Code Codex pi.dev Hermes OpenClaw RVM

Rust + WASM NAPI-RS GCP-gated publish Witness signed

⚡ Try it in 30 seconds

# Browser — zero install. Live now.
open https://ruvnet.github.io/agent-harness-generator/
# OR — terminal scaffold
npx create-agent-harness my-bot --template vertical:coding --host claude-code
cd my-bot && npx . --help

That's it. You now own an npm-publishable AI agent harness with your name on it, your branding, your agents, your MCP server, your memory, and an npx <your-name> command. It runs on Claude Code, Codex, pi.dev, Hermes, OpenClaw, or RVM — pick one or all. Nothing leaves your browser or your machine.

Open the Studio

One line: A meta-harness — a marketplace plugin + CLI that scaffolds your own focused, vertical AI agent harnesses with their own npx <name> command, MCP server, memory, learning loop, and brand — that run unchanged on Claude Code, Codex, pi.dev, Hermes, OpenClaw, and RVM.

What's a meta-harness? A harness is a runtime that orchestrates AI agents (memory + routing + hooks + MCP + claims). A meta-harness is the level above: a harness whose product is OTHER harnesses. agent-harness-generator emits self-contained, npm-publishable harnesses you OWN — same kernel, your branding, your agents, your marketplace presence. The kernel updates flow to your harness via @ruflo/kernel npm peer; the content stays yours.

One paragraph: Ruflo bundles primitives (MCP server, hooks, memory bridge, swarm coordinator, intelligence pipeline, claims, routing) WITH opinionated content (60+ agents, 30+ skills, 33 plugins). agent-harness-generator factors those apart. You pick the primitives, pick the content, supply a name + brand, and out comes a brand-new npm-publishable harness with its own CLI, MCP registration, memory namespace, and marketplace identity — running on the host of your choice.

Agent Harness Studio — the browser product

A 100% client-side Studio (in the spirit of ruflo's goal UI) that turns any GitHub repo — or a blank slate — into a governed, branded, multi-host AI agent harness. Recommend agents, skills, commands, MCP tools, and policy; preview the live file tree; download a signed-ready, npm-publishable .zip. Nothing leaves your browser. Desktop- and mobile-friendly, deployable to GitHub Pages.

Open the Studio design ADR-020→024

Agent Harness Studio

Embeddings recommend · rules generate · tests prove parity.

Four tabs — the agent-harness supply chain

Tab	What it does
Repo → Harness	Paste a GitHub URL → deterministic repo analysis → archetype scoring → an editable harness plan (agents, skills, commands, MCP mode, risk policy, confidence). No repo code is ever executed. Semantic engine: Lexical (default, deterministic) or optional in-browser MiniLM embeddings (Transformers.js, WebGPU/WASM).
Create harness	Branded-runtime builder: 16 quick-start verticals, composable agents/skills/commands, kernel options, and the Primitives panel (CLI · MCP · memory · learning · witness · release gates). Live file tree + `<name>.zip`, byte-compatible with `create-agent-harness`.
Skill / Agent / Command	Author or pick a single artifact → a Claude-ready `SKILL.md` folder (YAML frontmatter) you drop straight into Claude desktop or claude.ai.
Verify	Drop a generated `.zip` → unzipped and checked in-browser (structure · kernel dep · host wiring · unresolved vars · MCP policy · secrets). Nothing uploaded.

Repo → Harness Author a Claude skill/agent/command In-browser Verify

Mobile-friendly

MCP — one selectable, security-first primitive

MCP is included as a first-class adapter surface, not the core identity. It is modular, gated, and default-deny (ADR-022):

Modes: off · local (stdio) · remote (Streamable HTTP + auth).
Emits src/mcp/{server,tools,resources,prompts,policy,audit}.ts (+ auth.ts remote) and a scannable .harness/mcp-policy.json.
Safe defaults: default-deny, no network/shell/file-write, approve-dangerous, 30 s timeout, 8 calls/turn, audit on.
harness mcp-scan <path> — "npm audit for agent tools": static-only scan (never executes) flagging shell/network grants, missing audit/timeouts, wildcard permissions, unguarded secrets, and unpinned deps. Exit 1 on any HIGH.

CLI Repo → Harness (ADR-026) — the deeper, local counterpart to the Studio's importer:

harness analyze-repo . # local, analysis-only → repo-profile.json + harness-plan.json
harness analyze-repo . --embed # opt-in deterministic embeddings via @ruvector/ruvllm (offline; lexical fallback)
harness analyze-repo . --scaffold my-harness # materialise the recommended harness

No repository code is executed; inferred build/test commands are emitted as trust: inferred · execution: disabled.

Quick start

cd apps/web-ui
npm install
npm run dev # local Studio
npm test # 48 generator unit tests
npm run e2e # Playwright desktop + mobile (zero console errors)
npm run bench # generator hot-path micro-bench (sub-100μs/op)

Source + rationale: apps/web-ui/ · ADRs 020 · 021 · 022 · 023 · 024 · 025.

Quick-start templates — 18 verticals

One canonical catalog (templates/catalog.def.mjs) feeds the CLI templates, the catalog.json the template-catalog Rust crate validates, and the Studio gallery — so UI, CLI, and core never drift.

npx create-agent-harness --list # browse the catalog
npx create-agent-harness my-bot --template vertical:coding

Category	Templates
Starter / Operations	`minimal`, `vertical:devops`
Engineering	`vertical:coding`, `vertical:ai`
Knowledge	`vertical:research`, `vertical:ruview` (ruvector), `vertical:education` (mastery-based tutoring, iter 80)
Finance / Professional	`vertical:trading`, `vertical:legal`, `vertical:health`
Customer / Growth	`vertical:support`, `vertical:crm`, `vertical:marketing`, `vertical:advertising`, `vertical:sales` (B2B pipeline, iter 87)
Business / Frontier	`vertical:business`, `vertical:agentics`, `vertical:exotic`

Each generated vertical ships bespoke domain agents (with system prompts), skills, commands, and guarded per-host settings. The 11 generated dirs come from the definition; minimal, devops, trading, support, legal, and research are curated by hand and never clobbered by the generator.

Latest vertical — vertical:education (iter 80): a tutoring pod (tutor → explainer → quiz-master → grader) over per-learner mastery memory. Pedagogy invariants are policy: abstain-not-hallucinate floor, no teaching on unmastered prereqs, hidden rubrics. Two commands: teach-next (run a teaching cycle) and mastery-report (summarise the learner's map). MCP namespaces mastery_log + curriculum; everything else default-deny.

Quick try

# 50ms scaffold → validate → cleanup, no network, exits 0 if healthy
node examples/quickstart/quickstart.mjs
# Try every supported host (claude-code | codex | pi-dev | hermes | openclaw | rvm)
node examples/quickstart/quickstart.mjs --host=codex
# 20ms two-instance federation handshake demo
node examples/federation/federation.mjs

See examples/ and docs/ARCHITECTURE.md for the layered map.

Status

Production-ready release pipeline. Full CI matrix green: 16 jobs across Rust ×ばつ 3 OS + WASM ×ばつ 3 OS + Node 20+22 ×ばつ 3 OS + Bench + pack+install ×ばつ 3 OS + CI-passed aggregator. The release flow is a single command (node scripts/release.mjs <bump> --push) that bumps 15 sources atomically, runs all gates, and tags. See ADR-019 for the architectural lockdown.

Layer	Status	Where
Kernel (Rust + WASM + NAPI-RS)	Shipped	`crates/kernel/` — 7 subsystems
`@ruflo/kernel` runtime resolver	Shipped	`packages/kernel-js/`
6 host adapters	Shipped	claude-code / codex / pi-dev / hermes / openclaw / rvm
`create-agent-harness` CLI	Shipped	scaffold + 15 `harness` subcommands (see row below)
6 Codex skills	Shipped	create / publish / validate / secrets / verify-witness / upgrade-harness
Claude marketplace plugin	Shipped + schema-validated	`.claude-plugin/plugin.json`
Witness signing (Ed25519)	Shipped + tamper-tested	ADR-011
MCP tool dispatch	Shipped + integration-tested	11 end-to-end cases
Federation transport	Shipped + runnable demo	`examples/federation/`
Release pipeline (6 primitives + 1 orchestrator)	Shipped	ADR-019
CI matrix (16 jobs across Rust + WASM + Node + Bench + pack+install)	Green	`.github/workflows/ci.yml`
Security (cargo-audit + cargo-deny + npm-audit + CodeQL + audit-deps aggregate)	Green	`.github/workflows/security.yml`
Publish pipeline (GCP WIF + 2 gates + 11 packages + IPFS pin)	Wired + tested	`.github/workflows/publish.yml`
Test suite	563/563	66 test files
15 `harness` subcommands	Shipped	sign · verify · doctor · federate · secrets · validate · mcp · publish · upgrade · completions · sbom · audit · mcp-scan · analyze-repo · diag (ADR-027)
Agent Harness Studio (live Pages)	Shipped	https://ruvnet.github.io/agent-harness-generator/ — 100% client-side
Perf-regression gate (host-bench-baseline)	Wired + baselined	`packages/bench/host-baseline.json` — 50% threshold
SBOM (SPDX-2.3)	Wired	`scripts/sbom.mjs` — CI artifact

Day-to-day	Wall time	Command
Did I break anything?	<1s	`node scripts/healthcheck.mjs`
Is the deployed Studio alive?	~1s	`node scripts/healthcheck.mjs --probe-pages`
Is THIS local kernel compatible with this harness?	<100ms	`harness diag <path>`
Same, for a CI script	<100ms	`harness diag <path> --json`
File a support ticket — bundle everything the maintainer needs	<100ms	`harness diag <path> --bundle > bundle.json`
Is this scaffolded harness release-ready?	<1s	`harness validate <path>`
Is this branch release-ready?	~30s	`node scripts/preflight.mjs`
Preview the v0.1.0 GH release body	~1s	`node scripts/release-notes.mjs --version=0.1.0`
Same, tweet-length summary	~1s	`node scripts/release-notes.mjs --version=0.1.0 --summary`
Cut a release	~60s	`node scripts/release.mjs patch --push`

Architecture in 60 seconds — the meta-harness pattern

 You <- harness AUTHOR (uses the meta-harness)
 |
 v
 agent-harness-generator <- THE META-HARNESS
 (this repo: scaffolds, signs, publishes harnesses)
 |
 v
 Your harness (npm package) <- THE HARNESS YOU SHIP
 |
 v
 Your users
 |
 v
 npx <your-name> <- Identity (rename + brand)
 |
 v
 <your-harness> <- Content (your agents/skills/plugins/prompts)
 |
 v
 @ruflo/kernel <- Kernel (shared primitives, Rust + WASM + NAPI-RS)
 |
 v
 Host adapter <- Per-host abstraction
 (Claude Code / Codex / pi.dev / Hermes / OpenClaw / RVM)
 |
 v
 LLM providers

Read top-down: you (the harness author) operate the meta-harness. The meta-harness produces your harness. Your harness is what users install. They never see the meta-harness layer — only the brand and CLI you ship.

The kernel is Rust source code compiled to two targets: WebAssembly (primary, cross-platform) and per-platform native binaries via NAPI-RS (escape hatch for hot Node paths). At load time, @ruflo/kernel prefers the native package for the current platform and falls back to wasm.

Working precedent: @ruvector/emergent-time@0.1.0 — 55 KB wasm-opt'd module shipping today through exactly this Rust → wasm-pack → npm pipeline.

Host support

Host	Integration shape	Notes
Claude Code	MCP server + 5-handler-type hooks + 3-scope settings	Ruflo-native target; richest hook surface
OpenAI Codex	MCP via `~/.codex/config.toml` `[mcp_servers.*]` tables	TOML not JSON; no first-class hooks
pi.dev	Pi extension (TypeScript via `pi install npm:...`)	No MCP by design — adapter uses `pi.registerTool()`
Hermes Agent	MCP-supported runtime (`optional-mcps/`)	Adapter scrubs `<think>` + stray `<tool_call>` per issue #741
OpenClaw	MCP via `~/.openclaw/openclaw.json` + workspace skills	Personal AI assistant gateway with built-in multi-platform messaging (WhatsApp/Telegram/Slack/Discord)
RVM	Bare-metal microhypervisor (AArch64) with capability tokens + hash-chained witness	Hardware-isolated deployment for federated / multi-tenant / untrusted-peer scenarios (ADR-018)

See ADR-004 — Host integration model.

Quality gates

Concern	Where	What it does
CI	`ci.yml`	Rust 3-platform matrix (fmt + clippy `-D warnings` + test + doc), wasm build + `wasm-tools validate` + 500 KB size budget, Node 20/22 ×ばつ 3-platform tests
Publish gate	`publish.yml`	GCP Workload Identity Federation auth → Secret Manager fetches `NPM_TOKEN` → smoke test → `npm publish --provenance` (SLSA L2)
Security	`security.yml`	cargo-audit, cargo-deny, npm-audit, CodeQL, weekly cron
Smoke	`smoke.mjs`	Kernel loads, `kernelInfo().version` matches `package.json`, `mcpValidate` accepts/rejects correctly
Provenance	ADR-011	Ed25519-signed witness manifest, byte-deterministic across CI runners (wasm enables this)

Quick start (developers)

git clone https://github.com/ruvnet/agent-harness-generator
cd agent-harness-generator
# Rust workspace
cargo test --workspace
cargo clippy --workspace --all-targets -- -D warnings
# WASM build
npm run build:wasm
# TypeScript + smoke
npm install
npm run build
npm run smoke

See CONTRIBUTING.md for the full developer workflow.

Read the design (17 ADRs + INDEX)

Start here: docs/adrs/INDEX.md

Highlights:

ADR-001 Goals & non-goals — what this is and isn't
ADR-002 Kernel boundary — Rust + WASM + NAPI-RS
ADR-002a Publishing pipeline — Cargo workspace + wasm-pack + napi build
ADR-003 Generator architecture — create-vite-style templates + AST-aware rename
ADR-004 Host integration — adapter contract per host
ADR-006 Memory + learning — @ruvector/emergent-time@0.1.0 integration
ADR-009 Anti-slop — derived trust tiers
ADR-011 Witness + provenance — signed manifests

Related projects

ruflo — the meta-harness this generator factors apart
ruvector — vector + agentic database (memory backend)
@ruvector/emergent-time — memory-decay clock the kernel uses
NAPI-RS — Rust → Node bindings used for the native target

License

MIT — see LICENSE.

Keywords: agent harness, agent harness generator, AI agent scaffolding, MCP server, Claude Code plugin, Codex plugin, pi.dev extension, hermes agent, multi-agent framework, agentic AI, agentic workflow, autonomous agents, agent orchestration, vertical AI harness, agent CLI generator, npm create agent, npx scaffold, Rust WASM kernel, NAPI-RS, wasm-bindgen, wasm-pack, agent memory, ReasoningBank, HNSW, emergent time, agent provenance, witness manifest, plugin marketplace, IPFS registry, drift detection, anti-slop, TDD, self-evolving agent, federated agents, swarm intelligence, GCP Workload Identity Federation, Secret Manager, SLSA provenance, npm provenance

Folders and files

Latest commit

History

Repository files navigation

agent-harness-generator

⚡ Try it in 30 seconds

Agent Harness Studio — the browser product

Four tabs — the agent-harness supply chain

MCP — one selectable, security-first primitive

Quick start

Quick-start templates — 18 verticals

Quick try

Status

Architecture in 60 seconds — the meta-harness pattern

Host support

Quality gates

Quick start (developers)

Read the design (17 ADRs + INDEX)

Related projects

License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages