Capability containment
Every action is checked against a signed behavioral contract before execution. An agent cannot exceed its declared scope — regardless of how its reasoning is manipulated. Blocked before execution, not detected after.
AEGIS is the enterprise platform for autonomous AI agents — signed behavioral contracts declared up front, enforced at every action, and proved in a tamper-evident record. Agentic agent drift is blocked, not just logged.
Prompt injection, misconfiguration, or compromised reasoning routes lead an otherwise legitimate agent to take actions it was never meant to take — Existing controls see the connection. They don't see what the agent does with it.
Logs written by the application itself can be altered, replayed, or quietly dropped by the same process that produced them. An audit that the actor can rewrite is not an audit.
Applications built outside sanctioned infrastructure connect directly to production databases with no governance layer in path — discovered only after a breach, by detection rather than design.
"Prove your AI agents did exactly what they were authorized to do."
Tamper-evident, cryptographically verifiable audit record of every agent action. Built on the CCR ledger — KMS-signed, SHA-256 hash-chained, INSERT-only. Any deletion or modification cryptographically breaks the chain. Satisfies the tamper-evident record-keeping requirement across SOC 2, NIST AI RMF, EU AI Act, and OCC SR 11-7.
"Did the agent stay inside the contract you signed for it?"
Signed behavioral contracts enforced at runtime. Every agent declares what it will do before deployment — permitted tools, data sources, CLI commands, output classes, rate limits — and every action is checked against that declaration before execution. Purpose drift is blocked, not just logged.
"Stop the bad action before it reaches the network."
Real-time enforcement at the boundary between AI agents and the systems they access. Proxy-layer data access, injection detection, rate limiting, CLI interception, MCP security. Actions outside the declared scope do not execute — before they reach the network, not after.
One platform vs. a three-product stack
| Control layer | What it governs | What it cannot govern |
|---|---|---|
| Network firewall / SASE | Who can reach the network; outbound LLM API traffic inspection. | What an authorized agent does once inside; whether actions are consistent with the agent's declared purpose. |
| AI identity platform | Who the agent is; just-in-time privilege grant and automatic revocation (zero standing privileges); non-human identity credential abstraction | Whether actions match declared purpose; behavioral baseline anomaly detection; query-level content inspection; tamper-evident record. Requires a second platform for runtime enforcement. |
| SOC correlation | Unified incident view across AI security, network, and endpoint signals — required to correlate AI agent events with broader breach indicators. | Purpose alignment enforcement, behavioral baselines, proxy-layer data access control. A third product and license; without it, the other two produce isolated, uncorrelated signals. |
| Secrets manager / PAM | Who holds credentials; rotation and expiry. | Whether the agent uses those credentials appropriately for its declared purpose. |
| Database proxy | Who can connect to the database. | Whether the caller is an AI agent; what its declared purpose is; whether this query fits that purpose. |
| AEGISLayer 4 · runtime governance | What AI agents are authorized to do, are doing, and have done — verified against a signed purpose declaration, enforced enforced before every action, proved in a tamper-evident record. | What AI agents are authorized to do, are doing, and have done — verified against a signed purpose declaration, enforced before every action, proved in a tamper-evident record. |
The most common state AEGIS replaces is not a competing product — it is the absence of AI agent governance entirely. Most enterprises deploying AI agents in 2026 have:
If your AI agents are operating today without signed behavioral contracts, enforced scope limits, and a tamper-evident audit record, you already have a gap — one your next examiner, auditor, or incident investigator will find before you do. 'Doing nothing' is not a defensible posture under SR 11-7, SOC 2, or any emerging AI governance framework. The only question is whether you close that gap on your terms or theirs. AEGIS closes it on yours.
AI agents are a new attack surface your existing tools weren't built to cover. Credential theft at the AI layer, prompt injection, behavioral drift, and ungoverned MCP server connections are all CISO-owned problems — and none of them appear in your EDR, SIEM, or DLP today.
Banks under OCC SR 11-7 already govern models — inventory, documentation, monitoring, audit trail. AI agents are being examined under the same framework. AEGIS is SR 11-7 governance extended to the agent runtime layer: the AIM is model documentation, the CCR is the audit trail, the Fusion Engine is ongoing behavioral monitoring.
AppSec finds the gap in AI application security reviews: no signed behavioral contract, no enforced scope limit, no audit trail that holds up. AEGIS turns that finding into a production gate — enforcement that runs before every agent action, with tamper-evident proof it ran. Security findings become shipped requirements instead of deprioritized backlog items.
EU AI Act Article 14 requires human oversight for consequential AI decisions. GDPR Article 22 requires documented safeguards for automated processing. When your regulator asks for evidence that governance ran at the moment the agent acted — not a policy that says it should have — AEGIS is the answer.
Every action is checked against a signed behavioral contract before execution. An agent cannot exceed its declared scope — regardless of how its reasoning is manipulated. Blocked before execution, not detected after.
Every action writes to a hash-chained, append-only ledger signed by cryptographic keys outside the application layer. The audit record cannot be altered by the application itself. Structurally immutable, not policy-protected.
AI applications are never issued direct database credentials. AEGIS is the only path to governed data sources. Rogue AI apps cannot obtain working credentials. By design, not by detection.
Every enforcement decision automatically produces a tamper-evident record that satisfies OCC SR 11-7, GDPR Article 30, SOC 2, NSA MCP security requirements, and EU AI Act Article 14. Proof that governance ran — not a policy that says it should have.
Every component has a defined failure mode — no silent fail-opens. Under any failure, AEGIS either enforces from cache or fails closed. Degradation is always observable, always logged. Failure is predictable, never silent.
The AEGIS SDK lives inside the agent process. It sees full reasoning context, internal session state, and complete tool-call history before any request leaves the process. One import. Unconditional coverage.
SDK · Python primary · TypeScript secondaryThe agent's signed AIM declares its tier, permitted tools, data sources, and CLI commands. The capability check runs before execution and is deterministic — no probabilistic logic, no ML, no false negatives.
Zone 1 · AIM capability checkerEight enforcement components run in sequence — injection classifier, parameter validator, output inspector, CLI interceptor, MCP gateway, rate limiter, behavioral scorer, data-source proxy. Each can stop the action before the next runs.
p99 < 15 ms · sub-4 ms classifierDecision and outcome hashed, linked to the prior block, and signed by a KMS-managed key. INSERT-only DB grants make immutability structural — no UPDATE, no DELETE.
Zone 2 · hash-chained CCRDaily global anchor hashes published to a customer-controlled external location — WORM store, transparency log, notary. Retroactive tampering becomes detectable even by a fully-compromised AEGIS.
External trust root · enterprise tierFour-tier trust model
Every capability must be explicitly declared — tool access, data sources, CLI commands, MCP servers, API endpoints, output classes. No wildcard grants. No default-allow. What isn't declared is blocked, not rate-limited.
When the AIM schema changes, missing or unrecognized fields fail closed — never open. Schema upgrades ship with explicit migrations. The contract never silently degrades.
High-volume data operations must be explicitly declared in the AIM. Undeclared bulk queries are blocked at the proxy layer, not rate-limited.
AIM signatures are tied to a specific key ID so keys can rotate without breaking historical records. An AIM that fails signature verification is denied unconditionally — regardless of tier, regardless of the agent's history.
A stateful service that tracks behavior across agents and sessions — separate from the stateless enforcement plane. Detects coordinated attack patterns and anomalies that only become visible across multiple agents or over time. Horizontally scalable without touching the enforcement plane.
Three deployment modes: single-region SaaS, multi-region with Zone 1 replicas per region, or hub-and-spoke with Zone 1 inside each customer VPC and Zones 0/2/3/4 at a customer-designated hub. MCP Tunneling lets agents reach private-network MCP servers without inbound ports while AEGIS governs every connection.
To both act and erase evidence of acting, an attacker must compromise three structurally separated zones simultaneously. Zone 1 cannot write the ledger. Zone 2 cannot sign. Zone 0 keys never leave custody.
MCP Tunneling enables Claude agents to reach private-network MCP servers over an outbound-only encrypted connection — no inbound ports, no VPN, no firewall rule changes. AEGIS Zone 1 operates as the enforcement layer for every tunneled connection, identically to on-prem MCP traffic.
Deterministic binary authorization against the signed manifest before every action. No probabilistic logic, no ML, no false negatives. The most critical component.
distilBERT + ONNX on CPU. Five-layer defense: structural delimiters, source tagging, ML classification, behavioral divergence, response analysis. Parameter validator blocks all MCP calls if unavailable.
Three-pass post-action: PII redaction, indirect-injection scan of MCP responses, AIM output-class compliance. Uncertainty resolves to redaction — never disclosure.
AST-level command interception for shell access. The only component that never fails open for any tier under any condition. An unparseable command does not run.
AIM-gated connections. Manifest hash verification with automatic traffic suspension on any change. Per-replica nonce cache for replay defense. All server responses pass through the output inspector before re-entering the agent context.
Token bucket per (agent_id, tool_name). Default 120 req/min, burst 20. Sustained breach signals the Fusion Engine. Per-replica — no hub coordination in the hot path.
Continuous scoring against each agent's 14-day baseline. Does not block the hot path — verdict attaches asynchronously. Can suspend or kill a session on sustained anomaly.
AI-application-only proxy to databases and APIs. Legacy apps untouched. AI apps receive short-lived proxy tokens — never direct credentials. Three-layer rogue-app defense: no credentials issued to AI apps; stolen tokens rejected on AIM mismatch; network policy restricts DB port to the proxy IP only.
Each record SHA256-linked to its predecessor. Deleting, reordering, or altering any record breaks the chain — detected on the next verification pass.
RS256 signatures with keys held in HSM/KMS. Key ID (kid) in every record enables rotation without invalidating historical records.
Background verifier runs continuously in a separate trust context. Any chain break generates a non-suppressible CRITICAL incident.
Enterprise tier: daily global anchor hashes published to a customer-controlled WORM store, transparency log, or notary. Retroactive tampering detectable even by a fully-compromised AEGIS.
CCR writes complete before the result returns to the agent. WAL-buffering ensures durability — if the audit store is temporarily unavailable, enforcement continues, a CRITICAL alert fires, and records replay in order on recovery, preserving chain integrity.
Every consequential enforcement decision includes a human-legible rationale: which signals triggered, which policy applied, what threshold was crossed. Suitable for regulator, auditor, and disputed-action review.
Per-agent detail · application binding fingerprints · credential lifecycle · shadow detection · incident workflow — all in the control plane
| Adversary | Primary control | Residual risk |
|---|---|---|
| A — Compromised agent | Deterministic AIM check (Zone 1); agent has no path to Zones 0 or 2. | Over-provisioned AIM (customer configuration risk) |
| B —Prompt Injection-via-logs | CCR content tagged untrusted; analyst copilot cannot take actions; full provenance on every claim. | Misleading advisory text (no action path) |
| C — Malicious insider | Zone separation; dual-control on destructive actions; all admin actions logged to immutable ledger. | Two-party collusion (forensically evident) |
| D — Infrastructure / DB attacker | HSM/KMS keys (never on disk); CCR hash-chain detects tampering; INSERT-only DB grants prevent silent rewrite. | Superuser + key compromise simultaneously |
| E — Model poisoner | Human review gate; label-distribution drift detection; shadow-mode evaluation; signed models; rollback. | Novel slow-poison below drift threshold |
| F — Availability attacker | Reserved enforcement budget; per-agent fairness load shedding; fail-closed under overload. | Sustained volumetric DDoS (shared with infra) |
| G — Rogue AI appdata source proxy | Provisioning-as-gate: no AI app receives direct DB credentials; three-layer defense (no creds / proxy rejects / network policy). | Un-migrated legacy service account credentials |
| H — Credential thiefdata source proxy | Token scoping; AIM enforcement; behavioral anomaly detection; short TTL. | Theft + use within TTL from consistent IP |
| I — Lateral movementdata source proxy | Per-data-source AIM capability check; new-destination behavioral detection. | Over-scoped AIM |
| J — Query exfiltrationdata source proxy | Volume ceiling; query behavioral scoring; output inspection; bulk-query AIM declaration required. | Bulk-legitimate vs. bulk-malicious ambiguity |
Accent rows: adversaries specific to the Data Source Proxy Layer (G–J)
14-day bootstrap window · per-agent fairness shed · sustained shed fires capacity alert — never silent
Human oversight: AEGIS-powered intelligence proposes; never applies. Model updates require human approval before deployment.
ALIGNEDAIM = model documentation. Behavioral baselines = ongoing performance monitoring. CCR = tamper-evident model activity log. Agent Registry = model inventory.
ALIGNEDGOVERN: AIM = governance documentation per agent. MEASURE: Fusion Engine behavioral baselines = ongoing risk measurement. MANAGE: Zone 1 enforcement = runtime risk management at the action layer.
ALIGNEDTamper-evident CCR with hash-chain. Every enforcement decision, admin action, and Copilot session in the immutable record.
IN PROCESSAll 9 recommendations addressed. 7 built-in from initial design; 6 targeted enhancements added in direct response to the CSI publication.
13 / 14 ADDRESSEDAlso aligned · SEC / FINRA AI guidance · multi-tenant isolation with auditable contribution record
Every forwarded event carries a ccr_record_id linking back to the authoritative tamper-evident CCR record. PII redaction rules that govern CCR output apply identically to the syslog stream — the forwarder cannot be configured to bypass redaction.
| Platform | What it governs | Relationship to AEGIS |
|---|---|---|
| ServiceNow GRC / IRM | Risk register, control library, audit workflows. AI governance module manages AI use cases as IT assets and tracks model metadata. Documentation and workflow management. | Complementary. ServiceNow is the policy documentation layer; AEGIS is the runtime enforcement and proof layer. CCR events populate ServiceNow audit workflows. |
| OneTrust | Data privacy, consent, GDPR/CCPA, data inventory, privacy impact assessments. AI governance handles AI use case intake and bias documentation. Policy-centric, assessment-driven. | Adjacent. OneTrust governs the data the AI accesses. AEGIS governs what the AI agent does with that data at runtime. Different regulatory questions. |
| MetricStream | Enterprise GRC, operational risk, audit management. Strong in financial services. AI governance is assessment and documentation oriented. | Complementary. MetricStream captures the operational risk framework; AEGIS produces the enforcement evidence that makes the framework provably true at the AI agent layer. |
| Credo AI · Arthur AI | AI model risk — fairness, bias, accuracy, model cards, responsible AI assessments. Purpose-built for model-layer governance. | Different layers. Credo / Arthur evaluate whether your models are fair and safe. AEGIS governs what deployed agents do with those models — whether the agent acts within its declared scope. |
| AEGISruntime enforcement | AI agent runtime behavior — enforced at execution, recorded in a tamper-evident ledger. | The white space. GRC platforms document what AI agents are governed by. AEGIS enforces it at the moment of execution and produces cryptographic evidence that enforcement happened. |
Runtime behavioral governance enforces what an AI agent is authorized to do at the moment it acts — for every action, on every agent — rather than only at provisioning time. AEGIS declares a signed behavioral contract for each agent up front, enforces it before every action executes, and proves the outcome in a tamper-evident record.
Yes — and most of it was built in before the guidance existed. When the NSA published its Cybersecurity Information Sheet on MCP Security (CSI U/OO/6030316-26, May 2026), AEGIS already addressed 7 of the 13 applicable recommendations by design. Six additional controls were implemented in direct response to the publication: message replay detection, parameter validation against declared tool schemas, indirect prompt injection scanning of MCP responses, MCP server manifest change detection with automatic traffic suspension, per-agent rate limiting, and AIM-as-RBAC as the authoritative access control layer.
Network controls, identity platforms, and PAM operate at provisioning time — they govern who is acting and what credentials they hold. AEGIS sits above those controls — it enforces capability and behavior at runtime, governing what they were never designed to govern. It does not replace those controls; it governs the surface they cannot reach.
Network controls, identity platforms, and PAM operate at provisioning time — they govern who is acting and what credentials they hold. AEGIS sits above those controls — it enforces capability and behavior at runtime, governing what AI agents actually do once connected. It does not replace those controls; it governs what they were never designed to govern.
Every consequential decision is written to the CCR — a tamper-evident audit ledger with a hash-chain. Enterprise tier can publish daily global anchor hashes to a customer-controlled WORM store, transparency log, or notary, making retroactive tampering detectable even by a fully-compromised system.
AEGIS maps directly to regulatory frameworks: the AIM serves as model documentation, behavioral baselines provide ongoing monitoring, the CCR is a tamper-evident model-activity log, and the Agent Registry is a model inventory — aligning with OCC SR 11-7. For EU AI Act Article 14, AEGIS-powered intelligence proposes but never applies; model updates require human approval before deployment. It is also aligned with NIST AI RMF and SEC/FINRA AI guidance.
Instrument an agent in under 15 minutes with one import: pip install aegis-sdk, then wrap your client with aegis.protect(). Every action is then governed by the agent's signed behavioral contract and witnessed in the audit record.
Every component has a defined failure mode — no silent fail-opens. Under any failure, AEGIS either enforces from cache or fails closed. The CLI interceptor (AEGIS Shell) never fails open under any condition: an unparseable command does not run. Degradation is always observable and always logged.
Get a technical briefing with the founding team.