Research

Technical notes for evaluating AI agent security.

Public research for security and platform teams comparing runtime controls for AI coding agents, productivity agents, and MCP tool use. Each brief includes a technical model, concrete evaluation criteria, and the signals a buyer should expect to inspect in a real deployment.

Field guide18 minMay 6, 2026

Securing Claude Cowork: An Enterprise Runtime PrimerClaude Cowork is not just chat in a desktop shell. It is a local agent runtime with file access, plugins, connectors, MCP reach, browser paths, and delegated work that can outlive a single prompt.Cowork can remain useful while high-impact actions become visible, explainable, and enforceable.Cowork action latticeClaude Coworkruntime securitypluginsMCPRead research brief

Cowork action lattice

identity

host

repo

tool

proposal

policy kernel

verdict

Comparative analysis16 minMay 6, 2026

Claude Code vs. Claude Cowork: Security Model, Evidence, and Control PlaneClaude Code and Claude Cowork share agentic DNA, but they do not create the same security evidence. One lives close to the terminal and repository. The other expands into desktop work, plugins, connectors, files, and asynchronous collaboration.Normalize for policy. Preserve provenance for investigation and buyer trust.Evidence overlayClaude CodeClaude Coworksecurity modelevidenceRead research brief

Evidence overlay

agent intent layer

t+00

prompt

t+01

tool

t+02

process

endpoint behavior layer

exec

open

connect

Architecture17 minMay 6, 2026

MCP Server Security 101: Inventory, Tool Calls, and Pre-Execution ControlMCP turns an AI agent from a text generator into an operator for files, SaaS apps, databases, internal APIs, browsers, and memory stores. An allowlist alone is not a security model.MCP governance is strongest when it evaluates the structured call before the server performs a side effect.MCP control pathMCPtool securityinventorygatewayRead research brief

MCP control path

agent

policy gateway

servertoolargs

upstream

deny before side effect

Incident analysis15 minMay 6, 2026

When a Coding Agent Deletes a Database: Control Design After the Replit IncidentA public database-deletion incident showed the real failure mode for coding agents: not that agents are useless, but that powerful tools need pre-execution controls, production boundaries, and evidence when instructions fail.The right control blocks or escalates the proposed operation before the irreversible side effect.Destructive-action chainClaude Codedatabase safetyruntime controlsincident responseRead research brief

Destructive-action chain

content

model

action

block

Threat model9 minMay 1, 2026

AI Agent Runtime Threat ModelThe security boundary moved from chat to action. Useful controls evaluate the exact file, command, MCP call, destination, identity, and policy before the agent completes the operation.A runtime event should explain what happened and why it was allowed.Decision latticeruntime securityworkstationspolicyRead research brief

Decision lattice

identity

host

repo

tool

proposal

policy kernel

verdict

Architecture8 minMay 1, 2026

Why MCP Tool Calls Need Pre-Execution PolicyMCP turns agents into operators for files, SaaS apps, internal APIs, memory stores, and local services. Inventory is helpful, but the security event is the call.The policy decision happens before the external capability is exercised.Pre-execution gatewayMCPtool callsgovernanceRead research brief

Pre-execution gateway

agent

policy gateway

servertoolargs

upstream

deny before side effect

Security operations7 minMay 1, 2026

Why EDR Misses Agent IntentEndpoint security sees process behavior. Agent security needs the missing layer: prompt, model, tool, repository, user, policy, and verdict in one chain.The two layers answer different questions and become stronger together.Intent overlay timelineEDRinvestigationsauditRead research brief

Intent overlay timeline

agent intent layer

t+00

prompt

t+01

tool

t+02

process

endpoint behavior layer

exec

open

connect

Detection8 minMay 1, 2026

From Prompt Injection to Tool ExecutionPrompt injection becomes operational risk when hidden instructions influence a tool action. The defensive surface has to connect content detection to execution.Detection quality improves when the next action determines severity.Injection kill chainprompt injectiondetectionstool useRead research brief

Injection kill chain

content

model

action

block

Field guide9 minMay 1, 2026

Enterprise Rollout Checklist for Coding AgentsA safe rollout does not start with blocking everything. It starts with inventory, audit mode, policy fit, identity, and a path from one workstation to managed coverage.The rollout should get safer as adoption grows, not slower.Coverage control boardenterprise rolloutcoding agentsenablementRead research brief

Coverage control board

inventory

ring 1

audit

ring 2

warn

ring 3

block

ring 4

Control design8 minMay 1, 2026

Policy Packs as the Agent Governance UnitPolicy packs turn agent controls into reusable operating rules: who they apply to, which surfaces they govern, what they observe, and what they block.Policy feels enterprise-ready when admins can predict the result before rollout.Resolver graphpolicy packsidentitygovernanceRead research brief

Resolver graph

pack resolver

Identity8 minMay 1, 2026

Directory Sync for Agent Policy DecisionsEntra ID, Google Workspace, and Okta are not just login plumbing. For agent governance, directory data decides who gets which controls and who loses access when identity changes.Policy should update when identity changes, not when someone remembers to edit a rule.Identity delta loopEntra IDGoogle WorkspaceOktaRead research brief

Identity delta loop

normalizer

Operations7 minMay 1, 2026

From Audit Mode to Enforcement Without Breaking DevelopersThe best first policy is often observe. Passed actions, warnings, and blocks show teams where enforcement will help and where it will create avoidable friction.Developers keep moving while risky behavior becomes easier to govern.Enforcement impact rampaudit moderolloutdeveloper experienceRead research brief

Enforcement impact ramp

policy impact preview

Incident response8 minMay 1, 2026

Agent Evidence for Incident ResponseWhen an agent touches sensitive systems, incident responders need more than an alert. They need the prompt, action, policy, output summary, identity, host, and timeline in one place.Evidence should be complete enough for response and restrained enough for privacy.Evidence envelopeevidencedeep captureinvestigationsRead research brief

Evidence envelope

correlation

action

policy

evidence