Function Accuracy
Overview of the Function Accuracy API
Hand off to an LLM
Monitor, analyze, and improve function accuracy with metrics, review estimation, and regression testing.
Metrics
Track function performance with accuracy, precision, recall, and F1 scores calculated from labeled transformation data (ground truth).
Review Estimation
Analyze function performance to estimate human review requirements:
- Calculate sample sizes needed for statistical confidence
- Find optimal confidence thresholds
- Estimate review effort for quality assurance workflows
Regression Testing
Compare function versions against historical data:
- Measure performance improvements or regressions between versions
- Test new configurations against ground truth data
- Validate changes before deploying to production