Trigger Transformation Evaluations

Hand off to an LLM

Queue evaluation jobs for a batch of transformations.

Evaluations run asynchronously and score each transformation's output against the function's schema for confidence, hallucination detection, and relevance. Transformations must belong to events of a supported type: extract, transform, analyze, or join.

Returns immediately with a summary of queued vs. skipped transformations and per-transformation errors. Poll POST /v3/eval/results or GET /v3/eval/results to retrieve results once evaluations complete.

POST
/v3/eval
x-api-key<token>

Authenticate using API Key in request header

In: header

Request Body

application/json

TypeScript Definitions

Use the request body type in TypeScript.

Response Body

application/json

application/json

curl -X POST "https://api.bem.ai/v3/eval" \  -H "Content-Type: application/json" \  -d '{    "transformationIDs": [      "tr_01HXAB...",      "tr_01HXCD..."    ],    "evaluationVersion": "0.1.0-gemini"  }'

{
  "queued": 1,
  "skipped": 0,
  "errors": {
    "tr_01HXCD...": "unsupported event type for evaluation: split. Only 'analyze', 'transform', 'extract', and 'join' are supported"
  }
}

{
  "message": "string",
  "code": 0,
  "details": {}
}

See also