Verifiable Labs

Verifiable Labs builds clean feedback and promotion gates for increasingly general AI agents.

AI agents are getting better at passing the tests they were tuned on. Verifiable Labs helps agents improve through generated clean feedback loops, then verifies whether those improvements truly generalize before promotion — on hidden, out-of-distribution, and adversarial scenarios the agent has never seen.

Website PyPI Zenodo DOI License

What we build

Evaluate — compile an evaluation contract from your agent's goal, generate public / hidden / OOD / adversarial scenarios, and score clean performance with contamination and hack-risk analysis.
Gate — a contamination-resistant promotion gate: clean verified-generalization score (CleanVGS), generalization gap, and an ACCEPT / REJECT / LIMITED_ROLLOUT decision with an assurance card.
Improve — human-reviewed improvement suggestions and candidate agent configs, re-verified by the gate. Improvements are never auto-applied.
Substrate — clean feedback records, transfer metrics, failure memory, and generated curriculum for teams building increasingly general agents.

The privacy-preserving default is evaluate-only: nothing is exported, nothing is reused for training, and human review is required.

Formal foundation

Selected mathematical properties behind the contamination-resistant promotion gate are machine-verified in Lean 4. The implementation is property-tested against the formal specification.

The Lean 4 development and its Python property-test mirror are open source in verifiable-labs-envs (formal/ and src/verifiable_labs_envs/formal_spec/).

Open core

Repository	What it holds
verifiable-labs-envs	SDK, 25 procedurally generated environments, formal track, CLI (Apache-2.0)
vlabs-sdk	SDK contracts: run modes, provider interface, schemas (pointer)
vlabs-formal	Lean 4 formal track + property-test mirror (pointer)
vlabs-examples	Public-safe examples and quickstarts
vlabs-evidence	Redacted sample assurance cards and aggregate metrics
vlabs-docs	Product and positioning documentation

The evaluation platform (scenario generation, contamination firewall, anti-hack engine, billing, API) is private. Hidden evaluation content, gold answers, detection details, customer data, and raw traces are never published — that separation is what keeps the feedback clean.

Published evidence

Public, synthetic / redacted demo evidence:

Hugging Face dataset — https://huggingface.co/datasets/verifiablelabs/vlabs-clean-gate-evidence
Weights & Biases (entity verifiable-labs): clean-generalization-gate · contamination-firewall · anti-hack-engine · scenario-compiler · runpod-costs

All published evidence is synthetic / redacted and is not a training dataset. It contains no customer data, hidden evaluations, gold answers, raw traces, private anti-hack traps, or private engine internals.

Install the SDK: pip install "vlabs-sdk==0.0.2"

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Verifiable Labs

Verifiable Labs

What we build

Formal foundation

Open core

Published evidence

Links

Popular repositories Loading

Repositories

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

People

Top languages

Uh oh!

Most used topics

Uh oh!