AI agents as infrastructure risk

Thesis

Once agents use tools, the risk surface moves from text generation to operational control. That makes logging, permissions, containment, and failure recovery infrastructure problems.

Outline

01
Agents as operators
How tool use turns model behavior into system action.
02
Infrastructure failure modes
Credential misuse, runaway loops, stale memory, action ambiguity, and partial execution.
03
Reliability contracts
What agent platforms should guarantee before touching production-adjacent systems.
04
Observability requirements
Minimum traces needed for incident review and rollback.
05
Eval implications
How public eval harnesses can measure infrastructure risk without unsafe demonstrations.

Status

Planned

This roadmap page frames the planned essay and remains intentionally non-confidential and public-source oriented.

Public framing

This roadmap page frames a planned piece and remains intentionally non-confidential and public-source oriented.

Writing index The Observability Stack