What Are Evals?
Evals in Pylar provide comprehensive insights into how AI agents interact with your MCP tools. They give you visibility into tool usage, performance, errors, and query patterns—helping you optimize your tools and views for better agent performance.Why Use Evals?
Evals help you:- ✅ Monitor Performance: Track success rates and error patterns
- ✅ Identify Issues: Find tools that fail frequently or have problems
- ✅ Understand Usage: See how agents are using your tools
- ✅ Optimize Queries: Identify slow or inefficient queries
- ✅ Improve Tools: Refine tools based on real agent behavior
- ✅ Track Trends: See how usage changes over time
Evals are your window into production agent behavior. They show you exactly what’s happening when agents use your tools.
Accessing Evals
To open the Evaluation Dashboard:- Navigate to your project in Pylar
- Click the “Eval” button in the top-right corner of the screen
- The Evaluation Dashboard opens
What You’ll See
The Evaluation Dashboard shows:Summary Metrics
- Total Count: How many times tools were invoked
- Success Count: Successful invocations
- Error Count: Failed invocations
- Success Rate: Percentage of successful calls
- Error Rate: Percentage of failed calls
Visual Insights
- Time-Series Graphs: See how usage, successes, and errors change over time
- Trend Analysis: Understand patterns in tool performance
Error Analysis
- Error Explorer: See what errors occurred and how often
- Query Shape: Understand patterns in query types
Raw Logs
- Detailed Records: Every tool call with full context
- Query Details: See exactly what queries were executed
- Error Messages: Understand why failures occurred
Key Concepts
Success vs. Error
- Success: Tool invocation returned a valid result
- Error: Tool invocation failed to return a result
Success Rate
Error Rate
Using Evals to Improve
Iterative Improvement Process
- Monitor: Use Evals to see how tools perform
- Identify Issues: Find errors, slow queries, or patterns
- Refine: Update tools or views based on insights
- Verify: Check Evals again to confirm improvements
- Repeat: Continuously improve based on real usage
Use Evals regularly to catch issues early and continuously improve your tools. Don’t wait for problems to be reported—monitor proactively.
Next Steps
Ready to explore your Evals?- Evals Dashboard - Navigate the dashboard and understand metrics
- Analyzing Errors - Understand error patterns and how to fix them
- Understanding Query Shapes - Learn about query patterns
- Improving Tools with Evals - Use insights to optimize your tools
Explore Your Dashboard
Learn how to navigate and understand the Evals dashboard