Skip to main content

AI Skills for PagerDuty

Discover 28+ Incident management

Install any skill with /learn

/learn @owner/skill-name

Browse AI Skills for PagerDuty

wshobson wshobson / incident-runbook-templates

28.6K

Create structured incident response runbooks with step-by-step procedures, escalation paths, and recovery actions. Use when building runbooks, responding to incidents, or establishing incident response procedures.

plugin developmentslackpagerdutyfrontendstripe +3

openclaw openclaw / pagerduty

996

Manage incidents, on-call schedules, and alerts via PagerDuty API. Trigger and resolve incidents programmatically.

operationspagerdutyfrontendapi

majiayu000 majiayu000 / ai-mlops

80

Production MLOps and ML/LLM/agent security skill for deploying and operating ML systems in production (registry + CI/CD, serving, monitoring/drift, evaluation loops, incident response/runbooks, and governance), including GenAI security (prompt injection, jailbreaks, RAG security, privacy, and sup...

developmentpagerdutyfrontenddesign +3

majiayu000 majiayu000 / alerting-rules-agent

80

Designs and configures alerting rules for monitoring systems

developmentpagerdutyfrontenddesign +2

majiayu000 majiayu000 / architecture-discipline

80

Use when designing/modifying system architecture or evaluating technology choices. Enforces 7-section TodoWrite with 22+ items. Triggers: "design architecture", "system design", "architectural decision", "should we use [tech]", "compare [A] vs [B]", "add new service", "microservices", "database c...

developmentawsgcpfrontenddesign +7

majiayu000 majiayu000 / chaos-engineer

80

Expert in resilience testing, fault injection, and building anti-fragile systems using controlled experiments.

developmentkubernetesawsfrontenddesign +6

majiayu000 majiayu000 / clay-cost-tuning

80

Optimize Clay costs through tier selection, sampling, and usage monitoring. Use when analyzing Clay billing, reducing API costs, or implementing usage monitoring and budget alerts. Trigger with phrases like "clay cost", "clay billing", "reduce clay costs", "clay pricing", "clay expensive", "clay ...

marketingslackemailfrontenddocx +2

majiayu000 majiayu000 / clay-cost-tuning

80

Optimize Clay costs through tier selection, sampling, and usage monitoring. Use when analyzing Clay billing, reducing API costs, or implementing usage monitoring and budget alerts. Trigger with phrases like "clay cost", "clay billing", "reduce clay costs", "clay pricing", "clay expensive", "clay ...

marketingslackemailfrontenddocx +2

majiayu000 majiayu000 / coderabbit-cost-tuning

80

Optimize CodeRabbit costs through tier selection, sampling, and usage monitoring. Use when analyzing CodeRabbit billing, reducing API costs, or implementing usage monitoring and budget alerts. Trigger with phrases like "coderabbit cost", "coderabbit billing", "reduce coderabbit costs", "coderabbi...

marketingslackemailfrontenddocx +2

majiayu000 majiayu000 / Cost Observability and Monitoring

80

Techniques for gaining visibility into cloud spending, attributing costs to business units, and detecting financial anomalies.

marketingawspagerdutystripeapi +3

majiayu000 majiayu000 / Data Quality Monitoring

80

Techniques and tools for ensuring the accuracy, completeness, and reliability of data across the pipeline.

data-analyticsslackpagerdutyfrontenddatabase

majiayu000 majiayu000 / deployment-automation-enforcer

80

Use when designing deployment pipelines, CI/CD, terraform, or infrastructure automation. Enforces rollback checkpoint then TodoWrite with 19+ items. Triggers: "deploy", "CI/CD", "kubernetes", "terraform". If thinking "rollback later" - use this first.

developmentgithubgitlabfrontenddesign +7

majiayu000 majiayu000 / devops-sre

80

Use this skill when designing or reviewing CI/CD pipelines, deployment strategies, observability systems, incident response, or any system involving production operations and reliability. Applies operational thinking to specifications, designs, and implementations.

developmentdockerkubernetesfrontenddesign +10

majiayu000 majiayu000 / error-coordinator

80

Expert error coordinator specializing in distributed error handling, failure recovery, and system resilience. Masters error correlation, cascade prevention, and automated recovery strategies across multi-agent systems with focus on minimizing impact and learning from failures.

developmentsentrypagerdutyfrontenddesign +4

majiayu000 majiayu000 / Escalation and Ownership

80

Defining clear ownership models and escalation procedures to ensure incidents are handled by the right people at the right time.

developmentpagerdutyfrontendstripe +2

majiayu000 majiayu000 / exa-cost-tuning

80

Optimize Exa costs through tier selection, sampling, and usage monitoring. Use when analyzing Exa billing, reducing API costs, or implementing usage monitoring and budget alerts. Trigger with phrases like "exa cost", "exa billing", "reduce exa costs", "exa pricing", "exa expensive", "exa budget".

data-analyticsslackemailfrontenddocx +2

majiayu000 majiayu000 / exa-cost-tuning

80

Optimize Exa costs through tier selection, sampling, and usage monitoring. Use when analyzing Exa billing, reducing API costs, or implementing usage monitoring and budget alerts. Trigger with phrases like "exa cost", "exa billing", "reduce exa costs", "exa pricing", "exa expensive", "exa budget".

data-analyticsslackemailfrontenddocx +2

majiayu000 majiayu000 / monitoring-error-rates

80

This skill enables Claude to monitor and analyze application error rates to improve reliability. It is used when the user needs to track and understand errors occurring in their application, including HTTP errors, application exceptions, database errors, external API errors, background job errors...

claudedevelopmentpagerdutyapidatabase

majiayu000 majiayu000 / alert-management

80

Implement comprehensive alert management with PagerDuty, escalation policies, and incident coordination. Use when setting up alerting systems, managing on-call schedules, or coordinating incident response.

developmentpagerdutyfrontenddesign +3

majiayu000 majiayu000 / alerting

80

Real-time alerting and notification system for Univers infrastructure. Use this when you need to monitor system health, service status, and send proactive alerts when thresholds are exceeded or services fail.

developmentslackmicrosoft-teamsfrontenddocx +4