devopsadvanced

Monitoring and Alerting Setup

Generates monitoring configuration with dashboards, alerts, and runbooks.

Prompt

Design a monitoring and alerting setup for the following system:

**System**: {{system}}
**Stack**: {{stack}}
**Monitoring tool**: {{tool}} (Prometheus/Datadog/Grafana/CloudWatch)
**SLOs**: {{slos}}

Provide:
1. **Metrics to collect**:
   - RED metrics (Rate, Errors, Duration) for each service
   - USE metrics (Utilization, Saturation, Errors) for infrastructure
   - Business metrics specific to {{system}}
2. **Dashboard layout**: panels, queries, and thresholds
3. **Alert rules**: with severity levels (page/warn/info), thresholds, and evaluation windows
4. **Runbooks**: for each alert -- what it means, how to diagnose, how to fix
5. **SLO tracking**: error budget calculation and burn rate alerts
6. **On-call routing**: escalation paths based on severity

Include the actual configuration (PromQL queries, dashboard JSON, or alert YAML).

Variables

{{system}}{{stack}}{{tool}}{{slos}}

Use Cases

  • Setting up monitoring for new services
  • Improving observability coverage
  • SRE onboarding documentation

Compatible Models

claude-sonnet-4-20250514gpt-4o

Tags

monitoringalertingobservabilitysre

Details

Author
PromptIndex
Updated
2026-04-01
Difficulty
advanced

Related Prompts