Get Agent Test Run

Retrieve a single test run by ID. Poll this endpoint until status reaches a terminal state (passed, failed, or error). The result field is populated on terminal states.

Authentication

AuthorizationBearer

Enter your API key with the Bearer prefix, e.g. ‘Bearer sk_…’.

Path parameters

idstringRequired

Response

The test run.
idstring

Prefixed wire identifier (run_<26 char Crockford base32>). ADR 0015 Cluster 3 hard-break: URL paths accept only this prefixed form; legacy UUID path parameters are rejected with 404 as of Cluster 3.

test_idstring

Prefixed wire identifier (test_<26 char Crockford base32>) of the parent test. ADR 0015 FK consistency.

agent_idstring

Prefixed wire identifier (agent_<26 char Crockford base32>) of the agent this run executed against. ADR 0015 FK consistency.

statusenum

Lifecycle of a test run: queued - running - terminal.

Terminal states:

  • passed - the agent behaviour met the success criteria.
  • failed - the agent behaviour did not meet the success criteria.
  • error - the runner itself could not complete (LLM outage, network error, etc.), distinct from failed which means the agent behaviour was judged and found lacking.
created_atdatetime
started_atdatetime or null
completed_atdatetime or null
resultobject or null
Populated on terminal status only.
errorstring

Human-readable error message when status is error.

Errors

401
Unauthorized Error
404
Not Found Error