# Simulations

Amigo's simulation system is an evaluation and testing framework for validating agent behavior before deploying to production. You define simulated users (personas), test scenarios, and success criteria, then run automated conversations to measure how your agent performs.

## How Simulations Work

The simulation system uses five building blocks that compose together:

{% @mermaid/diagram content="%%{init: {"flowchart": {"useMaxWidth": true, "nodeSpacing": 30, "rankSpacing": 40}, "theme": "base", "themeVariables": {"primaryColor": "#D4E2E7", "primaryTextColor": "#100F0F", "primaryBorderColor": "#083241", "lineColor": "#575452", "textColor": "#100F0F", "clusterBkg": "#F1EAE7", "clusterBorder": "#D7D2D0"}}}%%
flowchart TB
P\[Persona] --> UT\[Unit Test]
S\[Scenario] --> UT
SVC\[Service + Version Set] --> UT
M\[Metrics + Success Criteria] --> UT
UT --> UTS\[Unit Test Set]
UTS --> R\[Unit Test Set Run]
R --> A\[Artifacts / Results]

```
style P fill:#DDE3DB,stroke:#2c3827,color:#100F0F,stroke-width:2px
style S fill:#DDE3DB,stroke:#2c3827,color:#100F0F,stroke-width:2px
style UT fill:#F0DDD9,stroke:#AA412A,color:#100F0F,stroke-width:2px
style UTS fill:#F0DDD9,stroke:#AA412A,color:#100F0F,stroke-width:2px
style R fill:#D4E2E7,stroke:#083241,color:#100F0F,stroke-width:2px
style A fill:#E8E2EB,stroke:#C5BACE,color:#100F0F,stroke-width:2px" %}
```

### Building Blocks

| Component                                                                                                    | Purpose                                                                                                                                                       |
| ------------------------------------------------------------------------------------------------------------ | ------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| [**Personas**](/developer-guide/classic-api/core-api/simulations/simulation-personas.md)                     | Simulated user profiles with a background, role, and preferred language. Versioned so you can iterate on persona definitions without breaking existing tests. |
| [**Scenarios**](/developer-guide/classic-api/core-api/simulations/simulation-scenarios.md)                   | Conversation scripts that define the objective, instructions for the simulated user, and how the conversation starts. Also versioned.                         |
| [**Unit Tests**](/developer-guide/classic-api/core-api/simulations/simulation-unit-tests.md)                 | Combine a persona, a scenario, a service (with version set), and success criteria (metrics with thresholds) into a single test case.                          |
| [**Unit Test Sets**](/developer-guide/classic-api/core-api/simulations/simulation-unit-test-sets.md)         | Group multiple unit tests together, each with a configurable run count, to form a test suite.                                                                 |
| [**Unit Test Set Runs**](/developer-guide/classic-api/core-api/simulations/simulation-unit-test-set-runs.md) | Execute a unit test set. The platform runs all unit tests, evaluates metrics, and produces downloadable artifacts with the results.                           |

### Typical Workflow

1. **Define personas** that represent different user archetypes (for example, "confused new user", "expert power user", "frustrated customer").
2. **Define scenarios** that describe what the simulated user is trying to accomplish and how the conversation should start.
3. **Create unit tests** that pair a persona with a scenario, target a specific service and version set, and set success criteria based on conversation metrics.
4. **Group unit tests into sets** with run counts (for example, run each test 5 times for statistical significance).
5. **Execute runs** and review artifacts to see whether your agent meets the defined success criteria.

{% hint style="info" %}
**Versioning**

Personas and scenarios are versioned independently. When you update a persona's background or a scenario's instructions, you create a new version. Unit tests reference the persona and scenario by ID and always use the latest version at run time. This lets you iterate on test definitions without recreating unit tests.
{% endhint %}

{% hint style="success" %}
**Tool Execution Modes**

During simulations, tools are invoked with `invocation_mode: "conversation-simulation"` instead of `"regular"`. This lets your tools mock external calls and avoid side effects. See [Tools: Execution Modes](/developer-guide/classic-api/core-api/tools.md#execution-modes) for implementation details.
{% endhint %}

## API Categories

### Personas

[**Simulation Personas**](/developer-guide/classic-api/core-api/simulations/simulation-personas.md): create, list, search, update, delete, and version simulated user profiles.

### Scenarios

[**Simulation Scenarios**](/developer-guide/classic-api/core-api/simulations/simulation-scenarios.md): create, list, search, update, delete, and version conversation test scenarios.

### Unit Tests

[**Simulation Unit Tests**](/developer-guide/classic-api/core-api/simulations/simulation-unit-tests.md): create, list, search, update, and delete individual test cases.

### Unit Test Sets

[**Simulation Unit Test Sets**](/developer-guide/classic-api/core-api/simulations/simulation-unit-test-sets.md): create, list, search, update, and delete grouped test suites.

### Unit Test Set Runs

[**Simulation Unit Test Set Runs**](/developer-guide/classic-api/core-api/simulations/simulation-unit-test-set-runs.md): execute test suites, monitor progress, cancel runs, and download result artifacts.

## CLI Testing Tools (Agent Forge)

The Agent Forge SDK provides CLI commands that build on top of the simulation APIs for automated testing:

* **`forge simulation run`**: coverage-optimized multi-session simulation that scores recommended responses against the context graph to systematically explore states, behaviors, and tools.
* **`forge simulation bridge`**: Claude-driven multi-scenario testing from a natural language objective, with pass^k consistency testing.
* **`forge simulation plan`**: generate target specs from natural language objectives or metric stress tests.
* **`forge simulation evaluate`**: compare metric scores across simulation runs (before/after diff mode).
* **`forge conversation simulate-step`**: agent-driven step-by-step simulation with interaction insights (current state, behaviors, tools called).

{% hint style="info" %}
These CLI commands use ephemeral test users for parallel execution. See the [Agent Forge README](https://github.com/amigo-ai/agent-forge) for setup and usage.
{% endhint %}

## Related

* Core API: [Services](/developer-guide/classic-api/core-api/services.md)
* Core API: [Tools](/developer-guide/classic-api/core-api/tools.md)
* Data Access: [Simulation Tables](/developer-guide/classic-api/data-access/organization-tables/simulation.md)
* Getting Started: [Authentication](/developer-guide/getting-started/authentication.md)


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.amigo.ai/developer-guide/classic-api/core-api/simulations.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.