Function Accuracy

Overview of the Function Accuracy API

Hand off to an LLM

Monitor, analyze, and improve function accuracy with metrics, review estimation, and regression testing.

Metrics

Track function performance with accuracy, precision, recall, and F1 scores calculated from labeled transformation data (ground truth).

Review Estimation

Analyze function performance to estimate human review requirements:

  • Calculate sample sizes needed for statistical confidence
  • Find optimal confidence thresholds
  • Estimate review effort for quality assurance workflows

Regression Testing

Compare function versions against historical data:

  • Measure performance improvements or regressions between versions
  • Test new configurations against ground truth data
  • Validate changes before deploying to production

See also

On this page