Overview
The Error Analysis section of Evals helps you understand what’s going wrong with your tools. It shows error codes, frequencies, and patterns that help you diagnose and fix issues.Accessing Error Analysis
In the Evaluation Dashboard:- Select the tool you want to analyze (using filters)
- Scroll to the Error Analysis section
- Review error codes and frequencies
Error Explorer
The Error Explorer lists all error codes that occurred and how often each one happened.Understanding Error Codes
Error codes indicate different types of failures:- 400: Bad Request - Invalid parameters or query syntax
- 404: Not Found - Resource doesn’t exist
- 500: Internal Server Error - Server-side issues
- Timeout: Query execution exceeded time limit
- Permission: Access denied or insufficient permissions
Example Error Explorer
- Error code 400 occurred 3 times
- Error code 500 occurred 1 time
- Timeout errors occurred 2 times
The Error Explorer shows the most common errors first. Focus on fixing high-frequency errors first, as they have the biggest impact.
Understanding Error Types
Error Code 400: Bad Request
What it means: The request was invalid. Common causes:- Invalid parameter values
- SQL syntax errors in query
- Parameter type mismatches
- Missing required parameters
- Check parameter definitions match query placeholders
- Verify parameter types are correct
- Test with valid parameter values
- Review SQL query syntax
Error Code 500: Internal Server Error
What it means: Server-side error occurred. Common causes:- Database connection issues
- Query execution failures
- View definition problems
- Data type mismatches
- Verify view queries are correct
- Check database connections are active
- Test queries manually in SQL IDE
- Review error messages in raw logs
Timeout Errors
What it means: Query took too long to execute. Common causes:- Large result sets
- Complex joins
- Missing indexes
- Inefficient queries
- Add LIMIT to restrict result size
- Optimize query performance
- Add indexes to source views
- Refine WHERE conditions
Permission Errors
What it means: Access denied. Common causes:- Insufficient database permissions
- View access restrictions
- Token authentication issues
- Verify database connection permissions
- Check view access controls
- Regenerate authentication tokens if needed
Error Frequency Analysis
High-Frequency Errors
Errors that occur frequently need immediate attention:- High impact: Affect many agent interactions
- User experience: Cause frustration for agents
- Priority: Fix these first
Low-Frequency Errors
Errors that occur occasionally:- Lower priority: But still worth fixing
- Edge cases: May indicate boundary conditions
- Monitor: Track to see if frequency increases
Error Patterns
Look for patterns in errors:- Time-based: Do errors occur at specific times?
- Parameter-based: Do certain parameter values cause errors?
- Tool-based: Do errors affect specific tools more?
Use the time-series graphs to see when errors spike. This can help identify patterns like specific times of day, after deployments, or related to usage volume.
Using Raw Logs for Error Analysis
Raw logs provide detailed error information:- Scroll to Raw Logs at the bottom of the dashboard
- Filter for errors (look for non-empty error messages)
- Review error messages for each failed invocation
- Identify common patterns
What to Look For
In raw logs, check:- Error messages: Detailed descriptions of what went wrong
- Query executed: See the exact query that failed
- Parameters used: What values caused the error
- Timestamp: When the error occurred
Error messages in raw logs are your best source of information for understanding what went wrong. Always review them when investigating errors.
Fixing Common Errors
Fix 1: Parameter Mismatch
Error: Parameter placeholder doesn’t match parameter name Solution:Fix 2: SQL Syntax Error
Error: Invalid SQL in query Solution:- Test query manually in SQL IDE
- Replace placeholders with actual values
- Fix syntax errors
- Update tool query
Fix 3: Type Mismatch
Error: Parameter type doesn’t match query expectations Solution:- Check parameter type definition
- Ensure query handles type correctly
- Use CAST if needed for type conversion
Fix 4: Timeout
Error: Query execution timeout Solution:Monitoring Error Improvements
After fixing errors:- Monitor Evals: Check if error rates decrease
- Verify Fixes: Confirm specific error codes disappear
- Track Trends: Watch graphs to see improvement
- Iterate: Continue improving based on new errors
After fixing errors, monitor Evals for a few days to confirm the fixes work and no new errors appear.
Best Practices
Regular Monitoring
- ✅ Check Evals regularly (daily or weekly)
- ✅ Set alerts for high error rates
- ✅ Review error trends over time
- ✅ Investigate spikes immediately
Error Prioritization
- High-frequency errors: Fix first (biggest impact)
- Critical errors: Address quickly (data loss, security)
- Low-frequency errors: Fix when convenient
- Edge cases: Document and monitor
Documentation
- Document common errors and fixes
- Share learnings with team
- Update tool documentation based on errors
- Create runbooks for frequent issues
Next Steps
Now that you understand error analysis:- Understanding Query Shapes - Learn about query patterns
- Understanding Query Logs - Dive into detailed logs
- Improving Tools with Evals - Use error insights to improve tools
Understand Query Patterns
Learn how to analyze query patterns from Evals