Get Evaluation Results
Fetch evaluation results for a batch of events.
Pass either eventIDs (preferred — the externally-stable V3
identifier) or transformationIDs as a comma-separated query
parameter. Exactly one of the two must be provided. Up to 100 IDs per
request.
For each requested ID the response reports one of three states: a
completed result, still-pending, or failed. Results, pending,
and failed entries are all keyed by event KSUID regardless of which
input form was used.
Authorization
API Key Authenticate using API Key in request header
In: header
Query Parameters
Comma-separated list of event KSUIDs to fetch results for. Between
1 and 100 IDs per request. Mutually exclusive with
transformationIDs.
Comma-separated list of transformation IDs to fetch results for.
Between 1 and 100 IDs per request. Mutually exclusive with
eventIDs. Prefer eventIDs for new integrations.
Optional evaluation version filter.
Response Body
application/json
application/json
curl -X GET "https://api.bem.ai/v3/eval/results"{
"results": {
"evt_01HXAB...": {
"fieldMetrics": {
"/invoice/number": {
"confidenceScore": 0.97,
"reasoning": "Matches canonical invoice number in the source document.",
"hallucination": false,
"relevanceScore": 1
}
},
"overallConfidence": 0.97,
"runtime": 3.42,
"hasHallucinations": false,
"evaluationVersion": "0.1.0-gemini",
"createdAt": "2026-04-23T18:05:00Z"
}
},
"pending": [
{
"eventID": "evt_01HXCD...",
"createdAt": "2026-04-23T18:04:55Z"
}
]
}{
"message": "string",
"code": 0,
"details": {}
}See also
- System overview — evaluating extraction quality