You are on page 1of 6

REAL TIME DATA

WAREHOUSING
Presented by
Yash Mishra
Ronit Mukherjee
WHAT IS REAL TIME DATA WAREHOUSING?
• Real Time Data Warehousing is a historical and analytic component of an enterprise-level data
stream .In other words, data moves straight from the originating source to all uses that do not require
some form of staging. This movement takes place soon after the original data is written. A Real Time
Data Warehouse can be defined as a system that represents the characteristics and the actual situation
of the organization.

• The data warehouse component contains both a real-time and a static partition. The real-time partition
gets its data an instance at a time from a transformation engine. At regular intervals, consistent subsets
of the real- time data are written to the static partition. The static partition maintains the history and
serves as the base for supplying dependent data marts with periodic data.
CHALLENGES OF REAL TIME DATA
WAREHOUSING

• Challenge 1: Enabling Real time ETL

• Challenge 2: Modelling Real Time fact tables

• Challenge 3: OLAP queries VS Changing Data

• Challenge 4: Scalability and Query Contention

• Challenge 5: Real Time alerting


REQUIREMENT FOR REAL TIME DATA
WAREHOUSING
By adopting real-time data warehousing, it becomes necessary to cope with at least two radical data state
changes. First, it is necessary to perform continuous data update actions, due to the continuous data
integration, which should mostly concern row insertions. Second, these update actions must be performed
in parallel with the execution of OLAP, which – due to its new real-time nature – will probably be
solicited more often. Therefore, the main contributions of this paper are threefold:

• Maximizing the freshness of data by efficiently and


rapidly integrating most recent OLTP data into the data
warehouse;

• Minimizing OLAP response time while simultaneously


performing continuous data integration;

• Maximizing the data warehouse’s availability by reducing


its update time window, in which users and OLAP
applications are off-line.
BENEFITS OF REAL TIME DATA
WAREHOUSING
1. FASTER DECISIONS: Make decisions quicker based on more current and more accurate, transaction
ally consistent, data. Why wait till tomorrow to faster decision making in your real-time data warehouse
make a decision you can make today – if you have the data that is?
2. IDEAL DATA LOAD: Lower the load on the data source. Many organizations, especially the ones
operating 24×7, have a hard time identifying the ideal data faster recovery with your real-time data
warehouse load window that is eliminated by low-impact, log-based change data capture (CDC).Faster
recovery of data with a real time data warehouse
3. FASTER RECOVERY: Recover quicker from any data transformation or load issues. If the nightly
batch job fails then it may take the next batch window to recover, whereas more frequent updates allow
quicker interventions.
4. NO BATCH WINDOWS: Eliminate the batch window that requires the source database and possibly
the data warehouse to be dormant during the load so that (temporarily) inconsistent data is not reflected in
queries. optimize use of your power data warehouse by running transformations in the database
5. OPTIMIZATION: Take advantage of the opportunity to make more optimum use of the powerful data
warehouse or data mart database by running transformations in the database rather than in a separate run-
time environment.

You might also like