Putting It All Together: Debugging in Action

Follow a real investigation: "Person creation is slow sometimes" - from problem to solution.

The Scenario

User Report: "Creating a person is slow sometimes - takes 5+ seconds!"

Without observability, this would be a nightmare to debug. Let's see how our tools help us solve it.

Action: Look at person.service.execution.time metrics

Discovery:

Pattern: Every 3rd request is slow (round-robin load balancing)

Conclusion: Only service-2 is affected!

Action: Check service health status

Discovery:

Conclusion: service-2 is struggling, confirmed by health checks

Action: Search for warnings from service-2

Query: service_name:"service-2" AND level:"WARN"

Discovery: Multiple warnings about connection pool exhaustion:

2024-04-23 14:23:45 WARN [abc-123] HikariPool
  "Connection pool exhausted, waiting..."

Pattern: Happens every ~5 minutes on service-2

Conclusion: Root cause identified! 🎯

Check configuration:

application.yml (service-2):
  datasource:
    hikari:
      maximum-pool-size: 5  ❌ TOO SMALL!

Other services have: maximum-pool-size: 20 ✅

Fix: Update service-2 config to match (maximum-pool-size: 20)

Problem Solved! 🎉