Name	Name	Last commit message	Last commit date
Latest commit History 14 Commits
.github	.github
examples	examples
src	src
tests	tests
.gitignore	.gitignore
CONTRIBUTING.md	CONTRIBUTING.md
LICENSE	LICENSE
README.md	README.md
package.json	package.json
pnpm-lock.yaml	pnpm-lock.yaml
tsconfig.json	tsconfig.json
tsup.config.ts	tsup.config.ts
vitest.config.ts	vitest.config.ts

VIGIL

Zero-dependency, <2ms safety guardrails for AI agents.

Vigil validates what AI agents do, not what they say. Drop it in front of any tool-calling agent to catch destructive commands, data exfiltration, SSRF, injection attacks, and more — before they execute.

Install

npm install vigil-agent-safety

Or via ClawdHub (for Clawdbot users):

npx clawhub install vigil

Quick Start

import { checkAction } from 'vigil-agent-safety';
const result = checkAction({
 agent: 'my-agent',
 tool: 'exec',
 params: { command: 'rm -rf /' },
});
console.log(result.decision); // "BLOCK"
console.log(result.rule); // "destructive"
console.log(result.reason); // "Destructive command: matched pattern..."

What It Catches

Category	Examples	Decision
Destructive	`rm -rf /`, `mkfs`, reverse shells	BLOCK
SSRF	`169.254.169.254`, `localhost:6379`, `gopher://`	BLOCK
Exfiltration	`curl evil.com`, `.ssh/id_rsa`, `.aws/credentials`	BLOCK
SQL Injection	`DROP TABLE`, `UNION SELECT`, `OR 1=1`	BLOCK
Path Traversal	`../../../etc/shadow`, `/proc/self`	BLOCK
Prompt Injection	"Ignore previous instructions", `[INST]` tags	BLOCK
Encoding Attacks	`base64 -d`, `eval(atob(...))`, hex escapes	BLOCK
Credential Leaks	API keys, AWS keys, private keys, tokens	ESCALATE

22 battle-tested rules. All pattern-based. All under 2ms.

Why Vigil?

Existing safety tools (Llama Guard, ShieldGemma) filter content — what agents say. Vigil validates actions — what agents do. Content safety ≠ action safety.

Vigil	Llama Guard	Regex	GPT-4 Review
Latency	<2ms	200-500ms	<1ms	2-5s
Dependencies	0	PyTorch	0	API key
Validates	Actions	Content	Strings	Content
Offline	✅	✅	✅	❌

CLI

# Check a tool call
npx vigil-agent-safety check --tool exec --params '{"command":"rm -rf /"}'
# JSON output for scripting
npx vigil-agent-safety check --tool exec --params '{"command":"ls"}' --json
# List policy templates
npx vigil-agent-safety policies

Exit codes: 0=ALLOW, 1=BLOCK, 2=ESCALATE

API

`checkAction(input): VigilResult`

import { checkAction } from 'vigil-agent-safety';
const result = checkAction({
 agent: 'my-agent', // optional
 tool: 'exec', // tool being called
 params: { command: '...' }, // tool parameters
 role: 'developer', // optional
 context: ['...'], // optional
});
// result: { decision, rule, confidence, risk_level, reason, latencyMs }

`configure(config)`

import { configure } from 'vigil-agent-safety';
configure({
 mode: 'warn', // 'enforce' | 'warn' | 'log'
 onViolation: (result, input) => {
 console.log(`[vigil] ${result.decision}: ${result.reason}`);
 },
});

`loadPolicy(name)`

import { loadPolicy } from 'vigil-agent-safety';
const policy = loadPolicy('moderate'); // 'restrictive' | 'moderate' | 'permissive'
// Or load custom: loadPolicy('./my-policy.json')

Integration Examples

See examples/ for complete integration patterns:

basic.ts — Minimal usage
express-middleware.ts — HTTP middleware
mcp-wrapper.ts — MCP server wrapper
circleci-mcp.ts — CircleCI CI/CD safety (protected branches, secret access, rate limiting)
langchain-callback.ts — LangChain integration
openclaw-extension.ts — OpenClaw/Clawdbot agent extension
generic-hook.ts — Generic before-tool-call hook

Roadmap

Vigil v0.1.0 ships with pattern-based rules — fast, predictable, zero dependencies. Here's what's coming:

🔜 v0.2 — Policy Engine + MCP Proxy

Custom YAML policy files for org-specific rules
Per-agent permission scoping (agent X can only call tools Y, Z)
Allowlist/blocklist for paths, domains, commands
MCP Proxy — drop-in safety layer for any MCP server. Zero code changes, just a config update. Works with Claude Desktop, Cursor, Windsurf, and any MCP client.

🔜 v0.3 — Vigil Cloud + Audit Logging

Hosted API with dashboard and warn-mode analytics
Structured JSON audit logs for compliance
Team policies with role-based access
See what your agents are actually doing: "47 risky actions blocked this week across 3 agents"

🧪 v0.4 — Benchmarks + Hybrid ML

Published false positive/negative rates across standard threat datasets
Optional cloud ML classification for ambiguous cases (rules first, ML as fallback)
Plugin architecture for custom rule functions
vigil report CLI for security posture snapshots

🧠 v0.5+ — Local ML Model

Fine-tuned safety model on HuggingFace for GPU users
Catches attacks that bypass pattern matching (obfuscation, indirect injection)
Same API — checkAction() automatically upgrades, no code changes

🏁 v1.0 — When It's Earned

v1.0 ships when Vigil has 100+ production users, external benchmarks, and proven accuracy. Not before.

Want to influence the roadmap? Open an issue or star the repo to show interest.

Support Vigil

Vigil is free and open source. If it saves you time or keeps your agents safe, consider supporting the project:

⭐ Star this repo — helps others discover Vigil
💖 Sponsor on GitHub — recurring support
🪙 Crypto donations:
- EVM: 0x3AA32976b514F4caaad1e8C69fD55d0E89B50a0e
- BTC: bc1qzqz9pnrngtq9y4tt9e7vznknxn4dtmphe2pppn
- SOL: 8ag7B9DvnUdrgmbYnYxhAv25jwcLjzyoWt8uzYGd5XSC

Every bit helps us keep building open source security tools.

License

Apache 2.0 — Built by HexIT Labs

Uh oh!

License

hexitlabs/vigil

Folders and files

Latest commit

History

Repository files navigation

VIGIL

Install

Quick Start

What It Catches

Why Vigil?

CLI

API

checkAction(input): VigilResult

configure(config)

loadPolicy(name)

Integration Examples

Roadmap

🔜 v0.2 — Policy Engine + MCP Proxy

🔜 v0.3 — Vigil Cloud + Audit Logging

🧪 v0.4 — Benchmarks + Hybrid ML

🧠 v0.5+ — Local ML Model

🏁 v1.0 — When It's Earned

Support Vigil

License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Sponsor this project

Uh oh!

Packages 0

Languages

`checkAction(input): VigilResult`

`configure(config)`

`loadPolicy(name)`

Packages