You are on page 1of 26

An Introduction to

Data Warehousing and


Business Intelligence

By James Sabarimuthu
Cognizant DWBI & PM
Do You know...?
• UNIX / Linux / Windows
• Databases and its types
• Application
• Forms
• ER - Model
• Batch process
Learning Offerings...
• Concepts
• Products
• Solutions
• Consulting
• Services
What is DW?

• “A copy of transaction data specifically


structured for querying and analysis”
- Ralph Kimball
More about DW
• Highly Integrated
• Historical and Non volatile
• Contains Raw and Summary data
• For Decision making process
• Usually huge and big
What is BI?

• Business Intelligence is the technology


infrastructure for supporting the
business decision making process
using maximum information from data
warehouse
DW Databases
• Oracle 11g
• ESSbase - Multi Dimensional Analysis
• Exadata - Extreme performance
• Teradata - For Big Data Analysis(Aster
data)
• IBM Netezza - Datawarehouse
Appliance
• Endeca - A hybrid search-analytical DB
Foo’s Form Data
Parcel ID From Address To Address Sender Sender Phone Bill Payee Invoice ID Shipment Amt Employee ID Received Date

10001 Chennai Address FK 5 Kumar 9442222222 Kumar Invoice 1 95 12003 Mar 4, 2011

10002 Coimbatore Address FK 6 Ram +91 983333333 Ram Invoice 1 200 12003 Mar 4, 2011

10003 Trichi Address FK 10 Bob 044 22323453 Jim Invoice 3 30 30003 Mar 20, 2011

10004 Address FK 1 Address FK 11 Sender FK 1 9999999999 Sender FK 1 Invoice 4 50 10030 May 1, 2011

10005 Address FK 2 Address FK 12 Sender FK 2 9999999999 Sender FK 2 Invoice 5 30 20003 Apr 5, 2011

10006 Address FK 3 Address FK 13 Sender FK 3 Sender FK 3 Invoice 6 60 40042 May 4, 2011

10007 Address FK 4 Address FK 1 Sender FK 4 9999999999 Sender FK 15 Invoice 7 400 12004 Jun 6, 2011

10008 Address FK 5 Address FK 15 Sender FK 5 9999999999 Sender FK 17 Invoice 7 55 12004 Apr 3, 2011

10009 Address FK 6 Address FK 5 Sender FK 6 9999999999 Sender FK 30 Invoice 7 655 12004 Jun 2, 2011
Facts and
Date SK Date
Dimensions
Month Quarter Year
20110304 Mar 4, 2011 March 1 2011
20110320 Mar 20, 2011 March 1 2011
20110405 Apr 5, 2011 April 2 2011
20110504 May 4, 2011 May 2 2011
20110501 May 1, 2011 May 2 2011
20110606 Jun 6, 2011 June 2 2011
20110602 Jun 2, 2011 June 2 2011

Date SK Employee SK Parcel Count Total Amount


20110304
20110320
10000001
10000002
3
1
325
30 Star Schema
20110501 10000002 1 50
20110504 10000003 1 60
20110606 10000004 2 1055
20110602 10000004 1 55

Employee Employee ID Employee Store Store


SK Name Location

10000001 12003 Kavitha Murugan Traders Chennai

10000002 30003 Raju Store Name 1 Trichi

10000003 40042 Kevin Store Name 2 Coimbatore

10000004 12004 Geetha Store Name 3 Madurai


Business Intelligence
DW Architecture
OLTP Apps
OLAP
Data -1 Data
Analysis
Warehouse

OLTP Apps ETL Raw Data BI


Data 2 Summary Data Tool Reports
Meta Data
Data Marts
Third Party Data Mining
Data
Star Schema
Snowflake Schema
Data Integration tools
• Informatica PowerCenter
• Informatica PowerExchange
• IBM Datastage
• Oracle Data Integrator(ODI)
• Microsoft SSIS
• Endaca - Latitude Information
Integration Suite
BI Tools
• Oracle Business Intelligence (OBIEE)
• Oracle Hyperion Performance Suit
• IBM Cognos
• BI Publisher(BIP)
• SAS BI
• SAP Business Objects
• Actuate
• Microsoft SSRS
Information Lifecycle
Management
• Data Archival
• Data Integration
• Data Cleansing
• Data Quality
• Data Purging
• Test Data management
Data Quality
Management
• Data Cleansing
• Data Profiling
• Data Standardization (Rules Engine)
• DQ Monitoring
• Location Intelligence
• Trillium Software (TS Quality, TS
Director...)
Master Data
Management
• Master of all data
• Need of the hour for big companies
• Multidomain MDM
• Reduce Cost of ownership
• Achieve single source of truth
Test Data
Management

• Data Subsets
• Data Masking
• Data Builder
Major Challenges

• ‘I have all data from existing sources


dumped into one place. But i don’t know
what is correct and believable!!!’
Major Challenges

• Merger of two company results in data


which are not matching with each other
and result in integration issue.
Major Challenges

• Non Standard development results in


maintenance nightmare
Major Challenges

• Latency in receiving the data is always


overlooked while developing the system
which results in incorrect information
DW & BI Jobs
• ETL / DI Specialist (A Tool Expert)
• BI Specialist (A Tool Expert)
• Data Analyst / Data Modeler
• DW Application Database Administrator
• Infrastructure Administrator
• DW and BI Leads
• Data Stewards
Famous Authors

• Ralph Kimball
• W. H. Inmon
Websites
• Forrester.com
• Gartner.com
• Dwinfocenter.org
• Sites of Oracle, Teradata, Netezza,
Endeca, Informatica, Trillium Softwares
etc...

You might also like