Remove use-cases automated-diagnostics
article thumbnail

Debugging Kubernetes with Automated Runbooks & Ephemeral Containers by Jake Cohen

PagerDuty

In our previous blog , we discussed the difficulty in capturing all relevant diagnostics during an incident before a “band-aid” fix is applied. In Kubernetes, this is done using the kubectl exec command. For these reasons, it is best to use automation that removes the need for users to exec into running pods.

article thumbnail

Automate Major Incident Management Step-by-Step for Better, Faster Response by Hannah Culver

PagerDuty

This can be done by embedding automation across the incident management lifecycle for major incidents, and bringing in humans where it makes sense. Before you know there’s an incident Before responders know an incident is happening, there’s a great opportunity to let machines take the brunt of the work via event-driven automation.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Automation Seasons Freezings Wrap Up and New Year’s Resolutions by Madeline Stack

PagerDuty

Hopefully you have had the chance to follow along with us for the month of December for Seasons Freezings , the time of year you are locked out of production, so you have time to explore new ideas like automation. Customers still tell us that toil is plaguing them. Design principles for creating self-service automation.

article thumbnail

Extending Automation Actions Across the PagerDuty Platform by Joseph Mandros

PagerDuty

While many PagerDuty products and features exist to make this mission a reality, we are going to focus on the latest and greatest with PagerDuty Automation Actions ® , part of the PagerDuty Process Automation ® portfolio. New Updates and Integrations With Automation Actions . Automation Actions for Customer Service Ops.

article thumbnail

Automating Common Diagnostics for Kubernetes, Linux, and other Common Components by Joseph Mandros

PagerDuty

Register for our Automated Diagnostics webinar event on August 16th to learn about common diagnostics for common components and how we provide out-of-the-box job templates for you to get started right away. In this blog, we’re going to talk about some basic diagnostics examples for components that are most relevant to our users.

Alert 65
article thumbnail

What is Automated Diagnostics and Why Should You Care? by Joseph Mandros

PagerDuty

Given this statistic, this means that 50% of an incident’s lifespan is spent on the beginning stages of an incident (the diagnostic and triage phases), rather than on actual remediative actions. Automation in Incident Response. Before we go any further, let’s first define diagnostic data. Applying Automated Diagnostics.

article thumbnail

APAC Retrospective: Learnings from a Year of Tech Outages – Dismantling Knowledge Silos by David Ridge

PagerDuty

This gap in knowledge, skill and access is known as “The Automation Gap”. Using an automation orchestration tool to enable event-driven automation, organisations can empower on-call responders with immediate access to automated runbooks, personally crafted by subject matter experts.

Outage 52