aiagents

package

v0.2.0 Latest Latest Go to latest Published: Oct 20, 2025 License: Apache-2.0 Imports: 4 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/grokify/slogo

Links

Open Source Insights

README ¶

AI Agents Business Metrics Examples

This directory contains OpenSLO examples for AI agent platforms, covering both end-user experience metrics and business performance indicators.

What are AI Agent Metrics?

AI agent metrics track the health, quality, and business impact of AI-powered agents that assist users with tasks. These metrics go beyond traditional service metrics to include AI-specific concerns like response quality, accuracy, task completion, and cost efficiency.

Files in this Directory

agent-availability.go

Service availability and reliability:

ExampleAgentAvailabilitySLO - Overall agent service availability (aggregated)
ExamplePerUserAgentAvailabilitySLO - Per-user availability to ensure consistent service

response-quality.go

AI response quality and accuracy:

ExampleAgentResponseQualitySLO - User satisfaction ratings (aggregated)
ExamplePerUserResponseQualitySLO - Per-user quality tracking
ExampleAgentAccuracySLO - Hallucination rate and factual correctness

response-time.go

Agent performance and latency:

ExampleAgentResponseTimeSLO - P95 response time (aggregated)
ExamplePerUserResponseTimeSLO - Per-user response time consistency
ExampleAgentFirstTokenLatencySLO - Time to first token for streaming responses

task-completion.go

Task success and completion metrics:

ExampleTaskCompletionRateSLO - Overall task completion rate
ExamplePerUserTaskCompletionSLO - Per-user task success tracking
ExampleTaskAbandonmentRateSLO - User frustration indicator
ExampleMultiStepTaskSuccessSLO - Complex task completion tracking

user-engagement.go

User activity and retention:

ExampleDailyActiveUsersSLO - DAU tracking
ExampleUserRetentionSLO - 7-day retention rate
ExampleSessionDurationSLO - Average session length
ExampleConversationTurnsSLO - Conversation depth and engagement

cost-efficiency.go

AI operational costs and efficiency:

ExampleTokenUsagePerTaskSLO - Token efficiency per task
ExamplePerUserCostSLO - Per-user cost tracking
ExampleCostPerSuccessfulTaskSLO - ROI measurement
ExampleCacheHitRateSLO - Response caching efficiency

ai_agents_test.go

Validation tests for all 20 AI agent SLOs

Key Metric Categories

1. Availability & Reliability

Monitor whether agents are accessible and functional when users need them.

2. Quality & Accuracy

Track response quality, user satisfaction, and accuracy (hallucination detection).

3. Performance

Measure response times, first-token latency, and overall system responsiveness.

4. Task Success

Monitor task completion rates, abandonment, and multi-step task success.

5. User Engagement

Track DAU, retention, session depth, and conversation metrics.

6. Cost Efficiency

Monitor token usage, per-user costs, and cache hit rates for sustainability.

Aggregated vs Per-User Metrics

Many metrics include both versions:

Aggregated Metrics: Platform-wide view

Total success rate
Average response time
Overall DAU

Per-User Metrics: Individual experience view

Percentage of users with >99% success rate
Percentage of users with acceptable latency
Distribution of costs per user

This dual approach helps identify:

Overall platform health
Users with poor experiences
Outliers requiring intervention

Example Metric Sources

The examples use various data sources:

Prometheus (real-time metrics):

sum(rate(agent_session_starts_total{status="success"}[5m]))
histogram_quantile(0.95, rate(agent_response_duration_seconds_bucket[5m]))

BigQuery (historical/analytical):

SELECT COUNT(DISTINCT user_id) FROM returning_users WHERE days_since_first_use <= 7

When to Use These Metrics

AI agent metrics are essential for:

AI chatbots and assistants: Customer support, internal tools
Agent platforms: Multi-agent orchestration systems
Workflow automation: AI-powered task execution
Copilot features: Code assistants, writing assistants
Autonomous agents: Research agents, data analysis agents

Critical Success Factors for AI Agents

Availability: Agents must be reliably accessible
Quality: Responses must be accurate and helpful
Performance: Fast responses maintain user engagement
Task Success: Users must accomplish their goals
Cost Control: LLM costs must be sustainable
User Retention: Users must find ongoing value

References

Anthropic's AI Safety Best Practices
OpenAI's Best Practices for Production
Measuring AI Agent Performance
OpenSLO Specification: https://github.com/OpenSLO/OpenSLO

Documentation ¶

Index ¶

func ExampleAgentAccuracySLO() v1.SLO
func ExampleAgentAvailabilitySLO() v1.SLO
func ExampleAgentFirstTokenLatencySLO() v1.SLO
func ExampleAgentResponseQualitySLO() v1.SLO
func ExampleAgentResponseTimeSLO() v1.SLO
func ExampleCacheHitRateSLO() v1.SLO
func ExampleConversationTurnsSLO() v1.SLO
func ExampleCostPerSuccessfulTaskSLO() v1.SLO
func ExampleDailyActiveUsersSLO() v1.SLO
func ExampleMultiStepTaskSuccessSLO() v1.SLO
func ExamplePerUserAgentAvailabilitySLO() v1.SLO
func ExamplePerUserCostSLO() v1.SLO
func ExamplePerUserResponseQualitySLO() v1.SLO
func ExamplePerUserResponseTimeSLO() v1.SLO
func ExamplePerUserTaskCompletionSLO() v1.SLO
func ExampleSessionDurationSLO() v1.SLO
func ExampleTaskAbandonmentRateSLO() v1.SLO
func ExampleTaskCompletionRateSLO() v1.SLO
func ExampleTokenUsagePerTaskSLO() v1.SLO
func ExampleUserRetentionSLO() v1.SLO
func SLOs() []v1.SLO

Constants ¶

This section is empty.

Variables ¶

This section is empty.

Functions ¶

func ExampleAgentAccuracySLO ¶

func ExampleAgentAccuracySLO() v1.SLO

ExampleAgentAccuracySLO measures the accuracy of agent responses.

func ExampleAgentAvailabilitySLO ¶

func ExampleAgentAvailabilitySLO() v1.SLO

ExampleAgentAvailabilitySLO measures the error rate of AI agent services for end users. This tracks whether users can successfully start and interact with AI agents (success vs errors).

func ExampleAgentFirstTokenLatencySLO ¶

func ExampleAgentFirstTokenLatencySLO() v1.SLO

ExampleAgentFirstTokenLatencySLO measures time to first token (duration) for streaming responses.

func ExampleAgentResponseQualitySLO ¶

func ExampleAgentResponseQualitySLO() v1.SLO

ExampleAgentResponseQualitySLO measures the quality of agent responses based on user feedback.

func ExampleAgentResponseTimeSLO ¶

func ExampleAgentResponseTimeSLO() v1.SLO

ExampleAgentResponseTimeSLO measures the response time (duration) for agent interactions.

func ExampleCacheHitRateSLO ¶

func ExampleCacheHitRateSLO() v1.SLO

ExampleCacheHitRateSLO measures efficiency of response caching.

func ExampleConversationTurnsSLO ¶

func ExampleConversationTurnsSLO() v1.SLO

ExampleConversationTurnsSLO measures the number of turns in agent conversations.

func ExampleCostPerSuccessfulTaskSLO ¶

func ExampleCostPerSuccessfulTaskSLO() v1.SLO

ExampleCostPerSuccessfulTaskSLO measures cost efficiency for successful outcomes.

func ExampleDailyActiveUsersSLO ¶

func ExampleDailyActiveUsersSLO() v1.SLO

ExampleDailyActiveUsersSLO measures daily active users engaging with agents.

func ExampleMultiStepTaskSuccessSLO ¶

func ExampleMultiStepTaskSuccessSLO() v1.SLO

ExampleMultiStepTaskSuccessSLO measures success rate for complex multi-step tasks.

func ExamplePerUserAgentAvailabilitySLO ¶

func ExamplePerUserAgentAvailabilitySLO() v1.SLO

ExamplePerUserAgentAvailabilitySLO measures agent availability for individual users.

func ExamplePerUserCostSLO ¶

func ExamplePerUserCostSLO() v1.SLO

ExamplePerUserCostSLO measures cost efficiency on a per-user basis.

func ExamplePerUserResponseQualitySLO ¶

func ExamplePerUserResponseQualitySLO() v1.SLO

ExamplePerUserResponseQualitySLO tracks response quality on a per-user basis.

func ExamplePerUserResponseTimeSLO ¶

func ExamplePerUserResponseTimeSLO() v1.SLO

ExamplePerUserResponseTimeSLO tracks response time (duration) on a per-user basis.

func ExamplePerUserTaskCompletionSLO ¶

func ExamplePerUserTaskCompletionSLO() v1.SLO

ExamplePerUserTaskCompletionSLO tracks task completion on a per-user basis.

func ExampleSessionDurationSLO ¶

func ExampleSessionDurationSLO() v1.SLO

ExampleSessionDurationSLO measures average user session duration.

func ExampleTaskAbandonmentRateSLO ¶

func ExampleTaskAbandonmentRateSLO() v1.SLO

ExampleTaskAbandonmentRateSLO measures how often users abandon tasks.

func ExampleTaskCompletionRateSLO ¶

func ExampleTaskCompletionRateSLO() v1.SLO

ExampleTaskCompletionRateSLO measures how often agents successfully complete user tasks.

func ExampleTokenUsagePerTaskSLO ¶

func ExampleTokenUsagePerTaskSLO() v1.SLO

ExampleTokenUsagePerTaskSLO measures token efficiency per completed task.

func ExampleUserRetentionSLO ¶

func ExampleUserRetentionSLO() v1.SLO

ExampleUserRetentionSLO measures user retention over time.

func SLOs ¶

func SLOs() []v1.SLO

Types ¶

This section is empty.

Source Files ¶

View all Source files

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL