Question 1

What is agent-guardrails?

Accepted Answer

agent-guardrails is a zero-dependency JavaScript library that wraps an AI agent's actions — tool calls, shell commands, model outputs — with invariants, cost caps, and a bounded tool scope. A blocking violation halts the action before it runs, so the unsafe call never executes. About 3 KB, Node 18+, MIT-licensed.

Question 2

How do you stop an AI agent from running a destructive command?

Accepted Answer

Wrap every action in a guard that checks it before execution. Built-in guardrails cover the common failures: allowTools bounds scope to an allowlist, denyDestructive blocks rm -rf /, DROP DATABASE, force-push to main and sudo, noSecrets blocks leaked credentials, and maxCost / maxCalls cap spend and runaway loops. A blocking check throws before the tool ever runs; warnOn flags an action without stopping it.

Question 3

Why do AI agents fail in production?

Accepted Answer

Rarely on model quality — almost always on the operating layer around the model: unbounded scope, runaway cost, destructive tool calls, leaked secrets, and no hard stop when the agent goes off the rails. The agents that actually ship share three properties: bounded scope, guardrails, and a hard stop on violation. These patterns came from running an autonomous multi-agent system unattended in production.

allowTools(list)	tool calls outside an allowlist (bounded scope)
denyDestructive()	rm -rf /, mkfs, fork bombs, DROP DATABASE, force-push main, sudo…
noSecrets()	actions containing API keys / tokens / private keys
maxCost(budget)	cumulative spend over a budget (the cost-spiral problem)
maxCalls(limit)	runaway loops — caps total executed actions
validate(fn)	actions/outputs that fail a schema or predicate
warnOn(fn)	non-blocking — flags an action without stopping it

agent-guardrails

Why

Quickstart

Built-in guardrails

Questions