agentsUpdated·Falk Gottlob··updated ·11 min read

Build Your Product Health Agent

End-of-day pulse check on your product. Key metrics, customer satisfaction signals, performance health, and support trends - delivered at 4pm so you know exactly where things stand.

agentsmetricsproduct healthhow-to
Helpful?

Try it live
See this agent running in the sandbox

Stream a simulated run, inspect the notifications it would send on Slack and email, and see exactly where it sits in the 7-stage PM OS flow. No password required.

The short version

The Product Health agent runs daily at 4 PM and gives you one synthesized story across seven dashboards: engagement (30% of health score), conversion (25%), retention (20%), performance (15%), and customer satisfaction (10%). It pulls from Amplitude or Mixpanel, Sentry, Zendesk, PagerDuty, and Slack, and produces a narrative summary with green/yellow/red flags. The point is to stop checking seven tools and start the evening knowing whether your product got healthier or sicker today. The agent catches three things you'd miss otherwise: silent performance creep, support volume that correlates with error rates, and underperforming cohorts hidden inside healthy global metrics.

You leave the office at 5:30pm without knowing whether your product got healthier or sicker today. You know DAU went up or down - maybe. You have a vague sense that something's wrong with the checkout flow. You didn't check NPS. You didn't check error rates. You hope the support team isn't drowning, but you don't actually know.

This is the problem the Product Health Agent solves.

The Real Problem (It's Not the Data, It's the Synthesis)

You have data everywhere. Mixpanel tells you engagement metrics. Datadog tells you error rates. Slack tells you support volume. Amplitude tells you retention. Intercom tells you NPS. Your CRM tells you churn. But you're not checking all of them every day. Even if you did, they'd tell you separate stories.

What you need is a synthesis: One report that tells you the real story of product health today.

The gap looks like this:

What you need to know: Is my product getting better or worse?

What you actually have: Seven different dashboards. No unified view. A 2pm meeting where someone says "I saw in Slack that error rates are up," but you never confirmed it. A customer called at 4:45pm upset about performance, and you don't know if it's a systemic issue or one-off. Support says they're busy but you don't know if it's a surge in volume or a systemic problem they're surfacing.

What you'll do: Send an email asking "Hey, someone check our error rates?" or manually pull data from three tools and stitch it together. By the time you have a picture, it's 6pm and the day is over.

The Product Health Agent synthesizes all of this automatically. Every day at 4pm, you get one report that answers: Is my product healthy?

What This Agent Does

At 4pm every weekday, the agent delivers a comprehensive product health snapshot. It's not a dump of metrics. It's a synthesized narrative that tells you what changed, why it matters, and where to look if something's off.

Key Metrics Dashboard

The agent pulls your core health metrics and shows them alongside yesterday's numbers:

Engagement

  • DAU (Daily Active Users): 45,200 (↑ 2% from yesterday)
  • Session length: 12m 34s (↓ 1% from yesterday)
  • Feature adoption: X% of users using feature Y (→ flat)

Activation & Conversion

  • Signup-to-activation rate: 42% (↓ 3% from yesterday - worth investigating)
  • Free trial conversion: 18% (↑ 1% from yesterday)
  • Onboarding completion: 67% (↓ 2% from yesterday)

Retention

  • 7-day retention: 52% (↓ 1% from yesterday)
  • 30-day retention: 28% (↑ 0.5% from yesterday)
  • Churn cohort [Q1 sign-ups]: 8% monthly churn

Revenue

  • MRR: $847k (↑ $2.4k from yesterday)
  • ARPU: $18.40 (→ flat)
  • Upgrade rate: 12% of free users (↓ 0.5% from yesterday)

Each metric shows: Today's number, direction of change, and trend severity (green for good, yellow for concerning, red for alarm).

Metric Movements (What Changed & Why)

The agent doesn't just show numbers. It explains what moved and why:

↑ Major movement: "DAU up 2% - above usual 0.5% daily growth. Likely due to [email campaign shipped yesterday / new feature launch / external press]."

↓ Concerning movement: "Activation rate down 3%. 2-day average is 45%, so this is notable. Possible causes: [checkout flow change / new user cohort quality / signup volume from lower-intent channel]. Recommend checking."

→ Flat with pattern: "Retention flat but lagging cohort [March 20] is underperforming. 30-day retention is 3% below historical average. Watch this cohort closely."

The agent cross-references data sources to infer cause:

  • Did we ship a feature yesterday? (From commit logs / roadmap) → Might explain metric change
  • Did we run a campaign? (From marketing Slack channel) → Might explain DAU spike
  • Did we change onboarding? (From product Slack) → Might explain activation drop
  • Did we see a surge in support tickets? (From Zendesk) → Might indicate a systemic issue

Customer Satisfaction Pulse

NPS Trend

  • Current NPS: 42 (↓ 2 points from last week)
  • New detractors this week: 3 (customers who were promoters, now detractors)
  • New promoters this week: 5
  • Average response rate: 18% of users surveyed

Specific feedback themes

  • Promoters mention: "Easy to use," "Great support," "Shipped feature X"
  • Detractors mention: "Performance issues," "Confusing UX," "Missing feature Y"

CSAT by feature

  • Feature X: 4.2/5 stars (last month: 4.5) - Declining satisfaction, investigate
  • Feature Y: 3.8/5 stars (new feature, ramping) - Below target, consider UX adjustments

Performance Health

Application performance

  • P99 latency: 1,240ms (↑ 60ms from yesterday)
  • Error rate: 0.8% (↑ 0.1% from yesterday, elevated but not critical)
  • Uptime: 99.9% (→ on target)

Critical errors

  • [Auth flow timeout]: 120 errors in past 24 hours (↑ from 40 yesterday) - INVESTIGATE
  • [Database timeout]: 45 errors (↓ from 60 yesterday) - Improving
  • [Checkout validation]: 22 errors (→ flat)

Infrastructure signals

  • Database query time: 45ms median (↑ 5ms from yesterday)
  • Cache hit rate: 92% (↓ 1% from yesterday)
  • Queue depth: 450 jobs (normal under 500)

If latency or error rates spike, the report highlights it: "🚨 Auth errors up 200% since yesterday. Possible cause: [recent deployment / traffic surge / infrastructure issue]. Recommend checking logs."

Support Volume & Trend

Today's ticket volume

  • New tickets opened: 23 (average 18)
  • Tickets closed: 19 (average 16)
  • Current backlog: 34 tickets (average 28)

Trend

  • 🔴 Backlog growing (opened 23, closed 19)
  • 🟡 Slightly elevated volume but manageable
  • Response time: avg 2.3 hours (SLA: 4 hours)

Top support topics

  • Billing questions: 6 tickets
  • Auth issues: 4 tickets (Correlates with error spike above)
  • Onboarding questions: 3 tickets
  • Performance complaints: 2 tickets (Alert: matches NPS detractor feedback)

The agent flags correlations: "Support volume up 28% and performance complaints are rising. Possible systemic issue: Check error logs and performance metrics."

Today's Summary (Narrative)

The report ends with a 2-3 sentence narrative:

Example 1 (Healthy day): "Product health is solid. DAU up 2%, activation flat, retention stable. Auth errors spiked earlier but resolved. Support volume normal. Confidence: High."

Example 2 (Concerning day): "Mixed day. DAU up slightly but activation dropping (concerning signal). Detractor feedback mentions performance issues. Error logs show auth and checkout slowness. Support backlog growing. Confidence: Medium - recommend check-in with eng on performance issues."

Example 3 (Alert day): "⚠️ Critical alert. Error rate elevated 200% (mainly auth flow). Support backlog growing. NPS down. Onboarding completions dropped 5%. Potential systemic issue. Recommend emergency response. Confidence: High."

How It Works: The Synthesis Logic

The agent doesn't pull random metrics. It builds the health picture from five sources, weighted by importance:

1. Engagement trajectory (30% of health score)

  • Is DAU trending up or down?
  • Is session length healthy?
  • Are active users returning?

2. Conversion health (25% of health score)

  • Signup quality is up (more conversions)?
  • Onboarding is strong (high completion)?
  • Free trial to paid conversion is healthy?

3. Retention & churn (20% of health score)

  • Cohorts retaining well?
  • Churn rate stable?
  • Are we losing previous gains?

4. Performance health (15% of health score)

  • Error rates low?
  • Latency acceptable?
  • Uptime solid?

5. Customer satisfaction (10% of health score)

  • NPS trending up?
  • Support volume reasonable?
  • No systemic complaints?

The agent weighs these together to produce an overall health score: Green (healthy), Yellow (watch it), or Red (act now).

Why This Actually Works

I built this because I was checking seven dashboards at different times of day and getting conflicting pictures. On Tuesday, "DAU is up so we're doing great." On Wednesday, "Activation dropped so something broke." On Thursday, "Support queue is long but I don't know if it's us or just a busy day."

The Product Health Agent catches three things you'd normally miss:

The silent performance problem: Latency creeps up from 800ms to 1,240ms over a week. If you only check daily, you see a small change and miss it. But the agent compares to yesterday AND to a rolling 7-day average, so you see the trend.

The support signal: Support tickets spike. But maybe it's just a busy day, or maybe it's a systemic issue. The agent correlates support volume with error rates, NPS feedback, and feature changes to tell you if it's a real problem or noise.

The cohort weakness: Your overall retention looks fine. But the March 20 cohort is underperforming. If you only look at global metrics, you'd miss it. The agent breaks down by cohort and flags outliers.

Data sources and setup

Prerequisites: Complete the Claude setup guide first. This agent needs the following MCP connections active:

  • Amplitude or Mixpanel - reads DAU, session length, retention, and feature adoption
  • Sentry - reads error rates and performance metrics
  • Zendesk - reads support ticket volume and trends
  • PagerDuty - reads incident data and on-call status (optional)
  • Slack - reads #product and #support channels for context

Schedule: Runs daily at 4:00 PM via cron. Output posts to Slack.

Quick test: Open Claude and ask: "Give me today's product health score: error rates, support volume trends, key feature adoption, and performance metrics."

For the full agent fleet and scheduling details, see Your AI Agent Fleet.

What Good Looks Like

After running this agent for two weeks:

Week 1: You start getting a daily snapshot. You notice it catches things you wouldn't have checked manually (cohort underperformance, error rate creep). You share the report with leadership. They start asking fewer generic questions because the report answers them.

Week 2: You're acting on yellow flags earlier. Before a problem becomes a crisis. When the agent flags "activation dropping" on Tuesday, you investigate. You find the checkout flow change from the new design broke something subtle. You fix it by Wednesday. Without the daily report, you wouldn't have noticed until Friday.

Week 3: Your team is more responsive. The report surfaces real problems (not noise), so when you say "we need to look at X," the team trusts it. Support and eng are less defensive because they can see when they're solving problems.

Week 4: You're telling a better story to leadership. Your exec reviews have data backing them up. "Product health is improving" isn't vague anymore. You show the specific metrics, the trend, the correlation with launches.

The Full Agent Prompt

Ready to build this? The complete agent instruction file is available at /artifacts/agent-product-health.md. It includes:

  • All metrics and data sources
  • Health scoring logic
  • Threshold definitions
  • Correlation rules for cross-source insights
  • Output format
  • Complete test prompt to validate

Copy it into your agent platform, connect your analytics and monitoring tools, and run it. Setup takes 45 minutes. It saves you 20 minutes daily in manual dashboard checking.

Final Thought

You can't fix what you can't see. And you can't see it if you're checking seven different dashboards at seven different times of day, each with their own story.

The Product Health Agent gives you one story. Every day at 4pm. Based on all your data. Synthesized into a narrative. Flagging what matters. Explaining what changed.

Your product's health is always changing. This report makes sure you know what direction it's moving.


Ready to build? Start with the full agent prompt at /artifacts/agent-product-health.md. Copy-paste-ready, includes all metrics, thresholds, health scoring logic, and a test prompt.

Share this post

Download the artifact

Ready to use. Copy into your project or share with your team.

Download

Also on Medium

Full archive →

Keep Reading

Posts you might find interesting based on what you just read.