Contingency Planning

Description   Related Tools    
Toggle All | Print Page Print Page
Background
Contingency planning can be defined in a number of ways. The National Institute of Standards and Technology (NIST) defines contingency planning as management policies and procedures designed to maintain or restore business operations, including computer operations, possibly at an alternate location, in the event of emergency, system failure, or disaster. The Information Technology Infrastructure Library (ITIL) defines disaster recovery as a series of processes that focus only upon the recovery processes, principally in response to physical disaster, that are contained within business continuity management (BCM). The Department of Health and Human Services (HHS) Enterprise Performance Life Cycle (EPLC) defines a contingency/disaster recovery plan as the strategy and organized course of action that is to be taken if things don't go as planned or if there is a loss of use of the established business product or system due to a disaster such as a flood, fire, computer virus, or major failure.
 

Overview
Contingency planning is one component of a much broader emergency preparedness process that includes items such as business practices, operational continuity, and disaster recovery planning. Preparing for such events often involves implementing policies and processes at an organizational level and may require numerous plans to properly prepare for, respond to, recover from, and continue activities if impacted by an event. Project managers must also consider the impacts of disruptions and plan, in alignment with organizational standards and policies, for such events. As one component of a comprehensive risk management approach, contingency planning should identify potential vulnerabilities and threats and then implement approaches to either prevent such incidents from happening or limit their potential impact. CDC's Office of Security and Emergency Preparedness (OSEP) defines a vulnerability as anything that would represent CDC's susceptibility to harm, be it loss of capability, staff, property, reputation, etc. OSEP defines a threat as representing an event that can adversely impact CDC personnel, property or operations. Threats can generally be grouped into three category types:

Although contingency planning sometimes is thought of as an Operations and Maintenance Phase activity, contingency measures should be identified and integrated at all phases of the project life cycle. NIST defines a seven-step contingency planning process to developing and maintaining a viable contingency planning program. These seven progressive steps are designed to be integrated throughout a project's life cycle and help guide stakeholders in the planning, development, implementation, key success factors, and maintenance of contingency plans.

  1. Identify any specific regulatory requirements related to contingency planning. Develop a formal contingency planning policy statement that provides stakeholders the authority and guidance necessary to develop an effective contingency plan. Obtain executive approval, and publish policies such policies.
  2. Conduct a business impact analysis (BIA) to identify and prioritize critical systems, business processes, and components. Include impact of events, allowable outage durations, and recovery priorities.
  3. Identify and implement preventive controls and measures to reduce the effects of disruptions, increase availability, and reduce contingency costs.
  4. Develop recovery strategies ensuring critical systems, business processes, infrastructure, etc can be recovered quickly and effectively following a disruption. Integrate them into system architecture.
  5. Develop contingency plans containing detailed guidance and procedures to recover from disruptions.
  6. Plan testing, training, and exercises to reinforce, validate, and test contingency plans to identify gaps and to prepare recovery personnel for unforeseen events. Document lessons learned and incorporate them into updates to contingency plans.
  7. Maintain contingency plans as living documents. Update them regularly to reflect changes in any influencing factors.

Contingency plan development is a critical component in the process of developing and implementing a comprehensive emergency preparedness program. In general, as defined by NIST, there are five main components of a project contingency plan:

For contingency planning to be successful, stakeholders must continuously reexamine areas of operational importance with a focus on things such as business processes, systems, and alternatives analysis; recovery strategies, maintenance, training, and plan execution. These activities occur at both an organization and project level. Information gained is used to develop plans addressing specific areas of importance. Types of contingency plans that should be considered may include:

NIST
The image above shows how NIST relates the various types of contingency plans

Contingency plans are developed to facilitate responses to anything that may impact normal operations. These plans should contain information and strategies designed to guide stakeholders in the restoration of normal operations and describe strategies for ensuring the recovery of business products and operations in accordance with defined objectives and timeframes. The actual type(s) of plan(s) created, the information they contain, and the defined response(s) are dependant upon factors such as:

For projects, the development of a strong contingency plan must begin early in a project's life with the identification of items such as related organizational and operational policies and procedures, project requirements, and availability requirements of the project's product or service Planning activities should continue throughout the project's life as concepts evolve into designs and solutions are incorporated throughout the product's development, testing, and implementation. For example, NIST identifies that:

NIST identifies three high-level phases that should be considered when planning how post disruption/disaster activities should be executed.

  1. Notification/Activation Phase includes the process of beginning the recovery process through the notification of recovery personnel and stakeholders, performing damage assessments, etc
  2. Recovery Phase includes the actions performed by the recovery teams to repair and/or restore operations in accordance with defined contingency/disaster recovery plans
  3. Reconstitution Phase includes actions necessary to restore normal operating conditions

What actions are taken, details of those actions, processes for executing them, and response activities associated with these three phases should be detailed within the appropriate contingency plan(s). It is then the responsibility of the System Owner to ensure that copies of such plans are distributed and details of which are communicated to the appropriate stakeholders which may include, but is not limited to:

Additional information on contingency planning can be found in the NIST Contingency Planning Guide for information Technology Systems. Information on risk management can be found in the CDC Unified Process Risk Management Practices Guide and the NIST Special Publication 800-30, Risk Management Guide to Information Technology Systems.
 


Best Practices

Practice Activities