Get Test Run

Retrieve a single test run by ID. Poll this endpoint until status reaches a terminal state (passed, failed, or error). The result field is populated on terminal states.

Authentication

AuthorizationBearer

Enter the key with the Bearer prefix, e.g. ‘Bearer API_KEY|ACCESS_TOKEN’.

Path parameters

idstringRequired

Response

The test run.
idstring
test_idstring
agent_idstring
statusenum

Lifecycle of a test run: queued - running - terminal.

Terminal states:

  • passed - the agent behaviour met the success criteria.
  • failed - the agent behaviour did not meet the success criteria.
  • error - the runner itself could not complete (LLM outage, network error, etc.), distinct from failed which means the agent behaviour was judged and found lacking.
Allowed values:
created_atdatetime
started_atdatetime or null
completed_atdatetime or null
resultobject or null
Populated on terminal status only.
errorstring

Human-readable error message when status is error.