Function Accuracy
Overview of the Function Accuracy API
Hand off to an LLM
Trigger and retrieve evaluations for completed transformations.
Evaluations run asynchronously and score each transformation's output against
the function's schema for confidence, per-field hallucination detection, and
relevance. Evaluations are supported for extract, transform, analyze,
and join events.
Lifecycle
- Trigger —
POST /v3/evalqueues jobs for a batch of transformation IDs and returns immediately withqueued/skippedcounts plus per-ID errors. - Poll —
POST /v3/eval/results(body) orGET /v3/eval/results(query) returns the current state of each requested transformation, partitioned intoresults(completed),pending(still running), andfailed(terminal failures or unknown transformation IDs).
Up to 100 transformation IDs may be submitted per request.
See also
- System overview — evaluating extraction quality