Professional Documents
Culture Documents
Module 3.1
Reliability
Information should be reliable and correct
Timeliness
Information must be available whenever required
Impact of Downtime
Lost Productivity Number of employees impacted (x hours out * hourly rate)
Know the downtime costs (per hour, day, two days...)
Lost Revenue Direct loss Compensatory payments Lost future revenue Billing losses Investment losses
Financial Performance Revenue recognition Cash flow Lost discounts (A/P) Payment guarantees Credit rating Stock price
Other Expenses Temporary employees, equipment rental, overtime costs, extra shipping costs, travel expenses...
2009 EMC Corporation. All rights reserved. Introduction to Business Continuity - 6
Response Time
Recovery Time
Detection
Repair
Restoration
Incident
Diagnosis
Recovery
Time Incident
Repair time
MTBF: Average time available for a system or component to perform its normal operations between failures MTTR: Average time required to repair a failed component
2% 1% 0.2%
99.9%
99.99%
0.1%
0.01%
8 hrs 45 min
52.5 min
10 min 5 sec
1 min
99.999%
99.9999%
2009 EMC Corporation. All rights reserved.
0.001%
0.0001%
5.25 min
31.5 sec
6 sec
0.6 sec
Introduction to Business Continuity - 8
BC Terminologies
Disaster recovery
Coordinated process of restoring systems, data, and infrastructure required to support ongoing business operations in the event of a disaster Restoring previous copy of data and applying logs to that copy to bring it to a known point of consistency Generally implies use of backup technology
Disaster restart
Process of restarting from disaster using mirrored consistent copies of data and applications
BC Terminologies (Cont.)
Recovery Point Objective (RPO) Point in time to which systems and data must be recovered after an outage Amount of data loss that a business can endure Recovery Time Objective (RTO) Time within which systems, applications, or functions must be recovered after an outage Amount of downtime that a business can endure and survive
Weeks
Days
Hours Minutes
Minutes Seconds
Synchronous Replication
Seconds
Global Cluster
Recovery-point objective
2009 EMC Corporation. All rights reserved.
Recovery-time objective
Introduction to Business Continuity - 10
Designing and developing contingency plans and disaster recovery plan (DR Plan)
BC Technology Solutions
Following are the solutions and supporting technologies that enable business continuity and uninterrupted data availability:
Single point of failure Multi-pathing software Backup and replication
Backup recovery Local replication
Remote replication
Client
FC Switches
IP
Storage Array Storage Array Remote Site Redundant Network
Multi-pathing Software
Configuration of multiple paths increases data availability Even with multiple paths, if a path fails I/O will not reroute unless system recognizes that it has an alternate path Multi-pathing software helps to recognize and utilizes alternate I/O path to data
Remote Replication
Data from the production devices is copied to replica devices on a remote array In the event of a failure, applications can continue to run from the target device
Backup/Restore
Backup to tape has been a predominant method to ensure business continuity Frequency of backup is depend on RPO/RTO requirements
2009 EMC Corporation. All rights reserved. Introduction to Business Continuity - 15
Module Summary
Key points covered in this module: Importance of Business Continuity
Resides between application and SCSI device driver Provides Intelligent I/O path management Transparent to the application Automatic detection and recovery from host-to-array path failures
2009 EMC Corporation. All rights reserved.
Storage Network
STORAGE