Use when defining SLIs/SLOs, managing error budgets, or building reliable systems at scale. Invoke for incident management, chaos engineering, toil reduction, capacity planning.
6.4
Rating
0
Installs
DevOps & Infrastructure
Category
Excellent SRE skill with comprehensive coverage of reliability engineering practices. The description clearly articulates when to invoke the skill (SLO/SLI definition, error budgets, incident management, chaos engineering). Structure is exemplary with a clean reference table pointing to detailed guidance files. Task knowledge is strong, covering the full SRE workflow from assessment through automation. The constraint lists (MUST DO/MUST NOT DO) provide actionable guardrails. Novelty is solid - SRE tasks like error budget calculation, chaos experiment design, and toil automation are token-intensive and benefit from codified expertise. Slightly less novel than highly specialized domains since SRE principles are well-documented, but the integrated workflow and decision framework add meaningful value. A well-crafted skill that would significantly assist an agent in reliability engineering tasks.
Loading SKILL.md…