LLM-readable documentation index

Get Evaluation Results

Hand off to an LLM

Fetch evaluation results for a batch of events.

Pass either eventIDs (preferred — the externally-stable V3 identifier) or transformationIDs as a comma-separated query parameter. Exactly one of the two must be provided. Up to 100 IDs per request.

For each requested ID the response reports one of three states: a completed result, still-pending, or failed. Results, pending, and failed entries are all keyed by event KSUID regardless of which input form was used.

GET
/v3/eval/results
x-api-key<token>

Authenticate using API Key in request header

In: header

Query Parameters

eventIDs?string

Comma-separated list of event KSUIDs to fetch results for. Between 1 and 100 IDs per request. Mutually exclusive with transformationIDs.

transformationIDs?string

Comma-separated list of transformation IDs to fetch results for. Between 1 and 100 IDs per request. Mutually exclusive with eventIDs. Prefer eventIDs for new integrations.

evaluationVersion?string

Optional evaluation version filter.

Response Body

application/json

application/json

curl -X GET "https://api.bem.ai/v3/eval/results"
{
  "results": {
    "evt_01HXAB...": {
      "fieldMetrics": {
        "/invoice/number": {
          "confidenceScore": 0.97,
          "reasoning": "Matches canonical invoice number in the source document.",
          "hallucination": false,
          "relevanceScore": 1
        }
      },
      "overallConfidence": 0.97,
      "runtime": 3.42,
      "hasHallucinations": false,
      "evaluationVersion": "0.1.0-gemini",
      "createdAt": "2026-04-23T18:05:00Z"
    }
  },
  "pending": [
    {
      "eventID": "evt_01HXCD...",
      "createdAt": "2026-04-23T18:04:55Z"
    }
  ]
}
{
  "message": "string",
  "code": 0,
  "details": {}
}

See also