Risk Management

Balance agent autonomy with appropriate safety measures.

Risk vs. Utility Tradeoff

Risk management is a tradeoff between utility and safety

More autonomy means more utility — but also more risk. You control this balance through:

Job scope — How broad is the agent’s responsibility?
Tool access — What capabilities does the agent have?
Guardrails — What constraints are enforced?
Oversight — How much monitoring and approval is required?

Guiding Principles

Principle of Least Privilege

Give agents only the tools and data they need for their job.

Principle of Earned Trust

Start with narrow scope and expand as the agent proves itself.

Risk Factors

Factor	Lower Risk	Higher Risk
Scope	Specific task	Broad responsibilities
Tools	Read-only access	Write/send/call capabilities
External contact	Internal only	Customer-facing
Data sensitivity	Public data	Confidential information
Reversibility	Easy to undo	Permanent actions

Mitigation Strategies

Better models

Use the most capable models for high-stakes tasks. Default selection is optimized for agentic behavior.

Better instructions

Spend more time on clear, detailed instructions with explicit boundaries.

More testing

Extensive testing before deployment, especially for edge cases.

Guardrails

Configure technical constraints (whitelists, limits) that are enforced by code.

Human approval

Require manual approval for sensitive actions.

Monitoring

Watch what agents do, especially early on.

The Security Agent

The platform includes a security agent that works in the background:

Reviews incoming events and agent plans
Operates independently of the agent’s context
Can veto actions that seem unsafe
Prevents prompt injection attacks

The security agent's assessment in the activity log

Practical Recommendations

Start conservative

Narrow scope, limited tools, close monitoring.

Test thoroughly

Verify behavior before expanding capabilities.

Expand gradually

Add tools and scope incrementally.

Monitor continuously

Watch for unexpected patterns even after deployment.

Our experience shows that advanced agents with broad scope can be safe — it just requires more careful setup and ongoing attention.

Working with Agents

​Risk vs. Utility Tradeoff

​Guiding Principles

Principle of Least Privilege

Principle of Earned Trust

​Risk Factors

​Mitigation Strategies

​The Security Agent

​Practical Recommendations

Risk vs. Utility Tradeoff

Guiding Principles

Risk Factors

Mitigation Strategies

The Security Agent

Practical Recommendations