Professional Documents
Culture Documents
EMC Solutions
Abstract
This white paper demonstrates how you can use EMC Data Domain Boost for
Databases and Applications (DDBDA) to protect SAP HANA by enabling backup,
recovery, and migration across physical and virtualized environments. DDBDA
provides you with a simple and effective way to accomplish these operations.
January 2015
Copyright 2015 EMC Corporation. All Rights Reserved.
For the most up-to-date listing of EMC product names, see EMC Corporation
Trademarks on EMC.com.
All trademarks used herein are the property of their respective owners.
Introduction.......................................................................................................................................... 7
Purpose ........................................................................................................................................... 7
Scope .............................................................................................................................................. 7
Audience ......................................................................................................................................... 7
Conclusion ......................................................................................................................................... 27
Summary ....................................................................................................................................... 27
Findings ......................................................................................................................................... 27
Appendix ............................................................................................................................................ 29
Since its launch, SAP HANA has been an extremely successful product. Initially used
for SAP Business Warehouse application analytics, it now supports SAP Business
Suite, the flagship enterprise resource planning (ERP) application. As well as
supporting a wider range of applications, SAP HANA now has more deployment
options, such as a factory-configured appliance, tailored datacenter integration (TDI),
and virtualization with VMware.
Because SAP HANA is so widely used, customers face many challenges as they
deploy and expand their SAP HANA environment. IT organizations must ensure that:
SAP HANA has the same levels of data protection as a traditional storage-
based database management system, while accommodating shrinking
backup timeframes.
CIOs increase productivity and efficiency in maintaining SAP systems while
minimizing the impact of backup processes that storage administrators and
infrastructures must support.
Administrators address the efficiency challenges posed by the distributed
storage of data across different SAP software components and systems (ERP,
CRM, and so on).
EMC has developed Data Domain Boost for Databases and Applications (DDBDA), a
software tool that empowers database administrators to use SAP HANA Studio for
managing backup operations, performing restores, and leveraging all the benefits
offered by EMCs Data Domain and Data Domain Boost products.
In this White Paper, we show how DDBDA can enable simple, flexible, efficient, and
optimized backups for your SAP HANA environment with the following benefits:
A 99 percent reduction in network utilization and backup storage
requirements by transferring and storing only unique, deduplicated data
Easier migration to virtualized SAP HANA, thus realizing the TCO, agility, and
availability advantages of the vSphere environment
Compatibility between the DDBDA as your SAP HANA backup solution and the
virtual HANA database, including backup, specific backup set restore, most-
recent restore, and point-in-time restore
Simpler, more efficient manual or automated backup and restore processes
using EMC Data Domain
This solution was tested and validated in several scenarios, including migration,
backup, and restore between both physical and virtual HANA environments.
Scope This document provides test validation of migration, backup, and recovery of virtual
SAP HANA with EMC DDBDA and describes the following:
Key solution technologies
Key business benefits of using DDBDA with virtual SAP HANA
How to configure DDBDA for virtual SAP HANA migration and backup
How to recover virtual SAP HANA with DDBDA
SAP does not currently support incremental or differential data backups. For more
information on SAP HANA backup and recovery, please refer to the SAP HANA
Administration Guide and SAP Note 1642148.
SAP does not currently support Backint to copy an existing SAP system on SAP HANA.
EMC Data Domain supports multiple protocols, such as NFS, and it can be used as a
destination for the SAP HANA backup activity with type FILE to restore an SAP
application with a different hostname or schema in the target system. This is not
included in the scope of this paper.
Audience This document is intended for infrastructure architecture and operation owners, as
well as SAP HANA datacenter integration architects, and SAP HANA operations teams,
who are responsible for defining and implementing backup and recovery solutions for
SAP HANA that span both physical and virtualized environments.
In this solution, EMC VNX provides the persistence for data and log volumes for both
the physical and virtual HANA systems. EMC Data Domain protects the HANA
databases and migrates the databases from the physical environment to the
virtualized environment.
Software resources
Table 2 describes the software components used in the deployment.
Data Domain deduplication storage systems reduce the amount of data to process by
only backing up data that have not previously been processed, skipping all
unnecessary duplicates in the process. This can reduce backup storage requirements
by up to 30 times and can protect up to 28.5 PB (petabytes) of logical capacity.
EMC Data Domain (DD) Boost is a software option available for all Data Domain
systems, and is a requirement for cloud-enabled infrastructures. DD Boost has two
components: a plug-in that runs on the backup server or client and a component that
runs on the Data Domain system.
Data Domain Boost Data Domain Boost for Databases and Applications (DDBDA) is an EMC product that
for Databases and enables you to perform backups and restores of DB2, SAP HANA, or SAP with Oracle
Applications database data with a Data Domain system. You can use database-specific backup
(DDBDA) and recovery tools to perform other DBA operations.
DDBDA performs backups to, and restores from, a Data Domain system by using the
DD Boost interface.
A backup to a Data Domain system takes advantage of the DD Boost feature by using
the following components:
The DD Boost library API, which enables the backup software to communicate
with the Data Domain system
The Distributed Segment Processing (DSP) component, which reviews the
data that is already stored on the Data Domain system and sends only unique
data for storage. DSP enables the backup data to be deduplicated on the
database or application host to reduce the amount of data transferred over
the network. During the restore of a backup to the client, the Data Domain
system converts the stored data to its original non-deduplicated state before
sending the data over the network.
DDBDA integration DDBDA is integrated with the SAP HANA BACKINT interface to back up and recover
with SAP HANA SAP HANA database data and redo log files. The SAP HANA BACKINT agent enables
DD Boost for SAP HANA to directly connect to the HANA database.
1. The SAP HANA database server runs the hdbbackint program, installed as part
of DDBDA, and passes a list of pipes to back up.
2. The hdbbackint program processes the SAP HANA parameters from the
configuration file and starts the child hdbbackint processes that back up the
required data.
3. The child processes send the database data and tracking information to the
Data Domain system for storage.
The following supported SAP HANA tools with DD Boost for SAP HANA can be used to
perform backup and recovery operations in an SAP HANA environment:
SAP HANA command line interface (CLI) and the hdbsql commands
SAP HANA Studio GUI
SAP HANA SAP HANA is an in-memory database that combines transactional data processing,
database analytical data processing, and application logic processing in memory. This design
enables real-time online application processing (OLAP) analysis on an online
transaction processing (OLTP) data structure.
In a distributed scale-out environment, one node (usually the first one installed) is
the master node. It handles the workload of the NetWeaver stack, the statistics and
the tables located in the row store. For a HANA scale-out database, it is a requirement
to have at least three nodes. In a two-node landscape, the table distribution and the
degree of parallel processing do not usually provide worthwhile benefits over a
single-node configuration.
Data and log information are automatically saved to disk at regular savepoints as
shown in Figure 5 and described in the following sections.
Data volume
Data in memory is asynchronously written to the persistence mechanism as a
savepoint every five minutes by default. This data volume capture method uses a
locking mechanism to prevent modification of pages while copying modified pages
and capturing open transactions before finally increasing the savepoint version and
releasing the lock.
SAP HANA on SAP HANA single host now runs in production on vSphere, the foundation of the
vSphere VMware vCloud Suite. You can combine the power of the SAP HANA in-memory
platform with vSphere to achieve faster time-to-value, better service levels, and lower
TCO for your production environments, and take the next step to the software-defined
datacenter.
For detailed information on virtualizing SAP HANA with VMware vSphere, refer to SAP
Note 1788665 and to the VMware documentation section.
A variety of techniques and tools are available to customers when migrating SAP
HANA on vSphere. In Migrating from a physical to a virtualized environment with
DDBDA, we show how to migrate SAP HANA on vSphere using Data Domain Boost for
SAP HANA.
DDBDA for SAP You can use Data Domain Boost for SAP HANA for an online backup of the entire
configuration database. After you install the DDBDA software on the operating system of the
database host, use the following steps to configure it:
1. Manually create the following subdirectory, if it does not already exist:
/usr/sap/<SID>/SYS/global/hdb/opt
2. Copy the /opt/ddbda/bin/hdbbackint file to that directory or create a
symbolic link that points from the following directory to the executable file:
/usr/sap/<SID>/SYS/global/hdb/opt/hdbbackint
3. Modify the parameter settings of the following SAP HANA template
configuration file:
/opt/ddbda/config/sap_hana_ddbda.utl
In this solution, we set the parameters as shown in Table 3.
Parameter Value Comments
DEVICE_HOST e92-dd670-2 Data Domain server hostname
Migrating from a This test scenario validates DDBDA for migrating SAP HANA from a physical to a
physical to a virtual platform with a different OS patch level.
virtualized
environment with As shown in Figure 9, we successfully restored to a virtual SAP HANA database from
the backup of a physical HANA saved on Data Domain, using DDBDA.
DDBDA
In this solution, the OS patch of the virtual SAP HANA database host is upgraded from
SUSE Linux 11 SP2 to SP3 by migration.
Note: To install SAP HANA on vSphere, follow the SAP HANA Server Installation and
Update Guide and Best Practices and Recommendations for Scale-up Deployments
of SAP HANA on VMware vSphere documentation.
Note: Follow the DDBDA for SAP configuration section to install and configure
DDBDA for virtual SAP HANA.
Test validation
You can use the following steps to validate the migration:
1. Query the record count of the physical HANA database table and the OS
version, as shown in Figure 10. You use this later to validate the results after
the virtual HANA restore.
Figure 10. Query record count of physical HANA database table and OS version
2. With HANA Studio, initiate a data backup for the physical HANA database,
select Backint for Destination Type, and type 4migration for Backup Prefix, as
shown in Figure 11. Record the backup name, because you need it next for
the virtual HANA restore.
Validation results
We verified the table records count and OS version from the recovered virtual HANA
database. Figure 13 shows the table records count matching the physical HANA with
the OS patch upgraded. The HANA database is migrated to a virtual machine with a
different OS version using DDBDA and the data is validated.
Figure 13. Validate table records count and OS version on virtual HANA
Virtual HANA This test scenario validates that DDBDA integrated with the SAP HANA BACKINT
backup and backup solution is compatible with the virtual HANA database. The testing includes:
recovery with Reducing network bandwidth requirements with DD Boost
DDBDA
Recovering virtual HANA to its most recent time
Recovering virtual HANA to a specific point-in-time
Note: Mebibyte per second (MiB/s) is a unit of data transfer rate equal to 1,048,576 bytes
per second. This is almost 5 percent more than the more commonly used megabyte per
second (MB/s), equal to 1,000,000 bytes per second.
After DD Boost is integrated with the SAP HANA BACKINT, only unique data is
transferred through the network. This significantly reduces the network bandwidth
requirements. When we performed virtual HANA backup with DD Boost, as shown in
Figure 15, the average network usage is only 0.5 MiB/s.
Test procedure
You can use the following test validation steps for your recovery:
1. Query how many table records are in the virtual HANA database after 1,000
records are inserted, as shown in Figure 16.
You use this later to validate the restore results.
2. Delete one HANA data file manually to simulate a data file crash.
3. Restart the virtual HANA database. Due to the missing data file, the restart
fails.
4. In the SAP HANA Studio recovery window, select Recover the database to its
most recent state for Specify Recover Type.
5. Specify the log backups location and select the latest available data backup.
Figure 17 shows an example of this.
Figure 17. Recover virtual HANA database to its most recent time
Test procedure
You can use the following test validation steps for your recovery:
1. Query the current table records count and timestamp. For example, as shown
in Figure 19.
2. Drop one table to simulate a database logic error, as shown in Figure 20.
3. In the SAP HANA Studio recovery window, select Recover the database to the
following point in time as the recovery type and specify the point-in-time from
step 1 in the Date and Time fields, as shown in Figure 21.
4. Specify a log backups location and select a latest available data backup.
5. Start the recover operation.
Validation results
After the recovery, we started the recovered virtual HANA database and verified the
table records count. Figure 22 shows that the table records match the values
recorded before the table records were deleted.
Figure 22. Validate table records number after database point-in-time recovery
As our tests validated, DDBDA protects the SAP HANA database with much less
network bandwidth consumption and successfully recovers databases to the most
recent time or to a specific point-in-time.
For full SAP environment protection that includes applications, such as ERP or BI,
EMC recommends that you use additional backup software to regularly back up the
operating system, instance profile, transport domain configuration file, and other
critical files that are located under the /sapmnt, /usr/sap/, and /hana/ shared file
systems.
Perform the following steps to configure an SAP HANA scheduled backup of database
data with Data Domain:
1. Set the database base parameters in the backup script, as shown in the
Appendix, and near the beginning of the script, set the following:
hana_SID: Specify the SAP HANA System ID (SID)
hana_instance: Specify the SAP HANA instance number
hdbsql_cmd: Specify the full path to hdbsql
The SAP Basis administrator can also schedule a backup of SAP Business Suite on
HANA through transaction code DB13 in the SAP GUI:
1. Select BACKINT for Destination Type in the Action Parameters tab, as shown
in Figure 24.
Test validation
The Backup Catalog in HANA Studio logs the status of the scheduled database
backups. The green squares indicate a successful backup, as shown in Figure 26.
Findings The virtual SAP HANA backup and restore with DDBDA for SAP HANA test results
presented in this paper confirm that this solution:
Reduces network utilization by up to 99 percent, because DDBDA avoids
network resource conflicts, especially for multiple virtual SAP HANA servers
on a single ESXi server environment.
Easily migrates existing physical SAP HANA databases to virtual HANA
databases for more rapid provisioning, zero downtime, and low TCO benefits.
Easily restores virtual HANA backups with the most recent or a specific point-
in-time backup, including manual and automated backups through CLI and
SAP GUI.
SAP For additional information, refer to the following on the SAP website:
documentation
SAP Note 1995460 - Single SAP HANA VM on VMware vSphere in production
SAP Note 1788665 - SAP HANA Support for VMware Virtualized Environments
SAP Note 1642148 - FAQ: SAP HANA Database Backup & Recovery
SAP Note 1844468 - Homogeneous System Copy Steps on SAP HANA
SAP HANA Administration Guide - SAP HANA Platform SPS 08
SAP HANA Server Installation and Update Guide - SAP HANA Platform SPS 08
SAP HANA Guidelines for Being Virtualized with VMware vSphere
################################################
# Specify the SAP HANA System ID, SID. For example:
# hana_SID=PV2
#
# Specify the SAP HANA instance number. For example:
# hana_instance=00
#
# Specify the full path to hdbsql, for example:
# hdbsql_cmd=/usr/sap/PV2/HDB00/exe/hdbsql
################################################
hana_SID=
hana_instance=
hdbsql_cmd=
################################################
# Specify the SAP HANA user key for backup. This key
# is created by the hdbuserstore tool. For example:
# hana_key=BACKUP
################################################
hana_key=