February 9, 2021 By Andrea Sayles 3 min read

Business continuity and disaster recovery (BCDR) plans need to keep pace with increasing business demands and growth in physical and compute infrastructures. For a hyperconnected digital business, even a small disruptive event can ripple through the entire organization. Today most businesses have BCDR plans. But do these plans deliver operational resilience during the moment of truth?

With the world becoming increasingly uncertain and risks proliferating, IT leaders must look beyond crisis management to be able to achieve operational resilience. A review of an organization’s operational resilience posture must prioritize the following four areas.

Data center resilience

Many organizations still have aging data center facilities that aren’t well aligned with current business and technology demands. Cloud migration – even cloud repatriation – in many cases takes place in multiple stages and is seldom planned or governed in an integrated manner. As complexity increases, there may be blind spots that could be easily missed.

For example, power distribution and cooling systems in data centers are often weak links in BCDR plans. In many legacy data centers, cooling systems are connected separately to diesel generators, not to the uninterrupted power supply. This puts compute infrastructures at risk of overheating should there be a generator failure. With business growth and changes in compute infrastructures, power equipment and capacities can become out of alignment, exposing your business to huge risk.

Modern businesses need next-generation data centers that are failsafe, responsive and workload-aware, while complying with industry standards, regulations and green/energy norms.

Integrated business continuity strategy

Closely aligned with a data center strategy should be a holistic BCDR strategy that considers all types of risks (system failure, natural disaster, human error or cyberattack) and outage scenarios, and provides plans for mitigation with minimal or no impact to the business. The strategy must also consider organization and culture, business processes, technology, standards and regulations. And no strategy or plan can be effective unless it is regularly tested. A well-planned data center design, integrated system testing and a regular functional test of the BCDR plan can help IT managers quickly detect equipment fault or vulnerabilities in near real time.

Recoverability and reliability

Business continuity best practices suggest that backup sites be built at physically different locations, in different seismic zones. Cloud-based data protection and recovery allows organizations to back up and store critical data and applications off-site, so they are protected from local disruptions. However, managing backup and storage as well as disaster and cyber recovery for a hybrid environment with hundreds or thousands of applications isn’t easy. Organizations simply don’t have the resources or the skills and expertise to do so.

Recovery at scale within minutes or seconds of an outage in such complex environments can only be achieved with an orchestrated recovery platform – a platform that also allows frequent tests to establish recovery reliability. While manual tests are slow, error-prone and dependent on availability of skilled resources, an orchestrated recovery platform can help eliminate human error and improve recoverability and recovery reliability.

Rapid response and recovery

While many organizations have robust BCDR plans, the need for planned production downtime inhibits their test schedules. Some of them use manual runbooks to perform failover/failbacks. This requires a significant amount of training and experience. By automating the runbook, tests and failover/failback processes, organizations can conduct regular disaster and cyber recovery drills that can keep the runbooks current and the execution smooth during real disasters.

It’s not enough to have data backups or IT infrastructure components available in real time. Organizations need the ability to quickly recover critical applications and data supporting business operations. Increasing cases of cyberattacks put the highest emphasis on the integrity of data being replicated in real time, as the backup data itself can also be corrupted.

As the stakes get higher, achieving operational resilience is a business imperative. Many organizations have witnessed devastating outages over the past few years, some of which could have been avoided. The cost of ignoring these situations will be increasingly dear in a post-COVID, hyper-digitized era.

To learn more about how a small disruptive event can have a ripple effect across your company, and what you can do to prevent it, explore the Moment of Truth.

Was this article helpful?
YesNo

More from Cloud

A clear path to value: Overcome challenges on your FinOps journey 

3 min read - In recent years, cloud adoption services have accelerated, with companies increasingly moving from traditional on-premises hosting to public cloud solutions. However, the rise of hybrid and multi-cloud patterns has led to challenges in optimizing value and controlling cloud expenditure, resulting in a shift from capital to operational expenses.   According to a Gartner report, cloud operational expenses are expected to surpass traditional IT spending, reflecting the ongoing transformation in expenditure patterns by 2025. FinOps is an evolving cloud financial management discipline…

IBM Power8 end of service: What are my options?

3 min read - IBM Power8® generation of IBM Power Systems was introduced ten years ago and it is now time to retire that generation. The end-of-service (EoS) support for the entire IBM Power8 server line is scheduled for this year, commencing in March 2024 and concluding in October 2024. EoS dates vary by model: 31 March 2024: maintenance expires for Power Systems S812LC, S822, S822L, 822LC, 824 and 824L.ha 31 May 2024: maintenance expires for Power Systems S812L, S814 and 822LC. 31 October…

24 IBM offerings winning TrustRadius 2024 Top Rated Awards

2 min read - TrustRadius is a buyer intelligence platform for business technology. Comprehensive product information, in-depth customer insights and peer conversations enable buyers to make confident decisions. “Earning a Top Rated Award means the vendor has excellent customer satisfaction and proven credibility. It’s based entirely on reviews and customer sentiment,” said Becky Susko, TrustRadius, Marketing Program Manager of Awards. Top Rated Awards have to be earned: Gain 10+ new reviews in the past 12 months Earn a trScore of 7.5 or higher from…

IBM Newsletters

Get our newsletters and topic updates that deliver the latest thought leadership and insights on emerging trends.
Subscribe now More newsletters