Professional Documents
Culture Documents
GE Confidential
TCS Internal
DATA WAREHOUSE
Data Warehouse is
Subject-Oriented
Integrated
Time-Variant
Non-volatile
collection of data in support of managements decision.
GE Confidential
TCS Internal
DW Components
Metadata Layer
Extraction
FS1
FS2
.
.
.
Transmission
FSn
N
E
T
W
O
R
K
Legacy System
GE Confidential
Cleansing
S
T
A
G
I
N
G
Transformation
Data Mart
Population
Aggregation
Summarization
ODS
DM1
DW
DM2
DMn
A
R
E
A
OLAP ANALYSIS
Knowledge Discovery
TCS Internal
TCS Internal
TCS Internal
Financial
Transactions
Time
Customer
GE Confidential
Organization
Product
TCS Internal
Contd.
Product
Financial
Transactions
Channel
Organization
Customer
Segment
GE Confidential
Geography
TCS Internal
Contd.
Dimension1
Fact1
Dimension4
Dimension3
Fact2
Dimension5
Dimension6
GE Confidential
Fact3
Dimension7
TCS Internal
TCS Internal
GE Confidential
TCS Internal
Types of Facts
There are three types of facts:
Additive: Additive facts are facts that can be summed up through all of the
dimensions in the fact table.
Semi-Additive: Semi-additive facts are facts that can be summed up for some of
the dimensions in the fact table, but not the others.
Non-Additive: Non-additive facts are facts that cannot be summed up for any of
the dimensions present in the fact table.
GE Confidential
TCS Internal
GE Confidential
TCS Internal
TCS Internal
TCS Internal
GE Confidential
TCS Internal
1.
2.
3.
At this level, the data modeler will specify how the logical data
model will be realized in the database schema.The steps for
physical data model design are as follows:
Convert entities into tables.
Convert relationships into foreign keys.
Convert attributes into columns.
Modify the physical data model based on physical constraints /
requirements.
GE Confidential
TCS Internal
GE Confidential
TCS Internal
Dimensional Hierarchy
Geography Dimension
World Level
America
Europe
Asia
Pa
re
nt
Continent Level
Re
la
tio
n
World
Country Level
USA
State Level
FL
City Level
Miami
Canada
GA
VA
Tampa
CA
Orlando
Attributes: Population,
Tourists Place
GE Confidential
TCS Internal
Argentina
WA
NaplesDimension
Member / Business
Entity
Dimensional Modeling
STEP 1
Identify Subjects (Dimensions)
Identify Hierarchies of a Dimension
Identify Attributes of levels in Hierarchies
Define Grain
Country
Industry Segment
State
Industry Type
City
Fin. Class
Customer
GE Confidential
TCS Internal
Contd.
Dimensional Modeling
STEP 2
Use KPIs to identify the Facts
Group the Facts in a logical set
Financial
Transactions
Trans. Amount
No. of Bonds
No. of Transactions
Service Cost
...
GE Confidential
Non-Financial
Transactions
...
TCS Internal
Contd.
Dimensional Modeling
STEP 3
Link the Group of Facts to the Dimensions that
participate in the Facts
Customer
Time
Product
Financial
Transactions
Channel
GE Confidential
TCS Internal
Organization
Dimensional Modeling
STEP 4
Define Granularity for each Group of Facts
Customer
(Customer)
Time
(Day-Hour)
Product
(Scheme)
Financial
Transactions
Channel
(Channel)
GE Confidential
TCS Internal
Organization
(Branch)
Legacy
Select
Client/
Server
Extract
Transform
OLTP
Integrate
External
Maintain
Metadata
Repository
Data Mart
DATA
WAREHOUSE
Data Mart
Operational Systems
Enterprise wide Data
GE Confidential
TCS Internal
A
P
I
U
S
E
R
S
Select
Data Mart
Metadata
Repository
Extract
Transform
Data Mart
DATA
WAREHOUSE
Integrate
Maintain
Data Mart
Data Preparation
Operational Systems
Data
GE Confidential
TCS Internal
A
P
I
U
S
E
R
S
Select
Extract
Transform
OLTP
External
Data Mart
Data Mart
Integrate
Maintain
Data Mart
Data Preparation
Operational Systems
Data
GE Confidential
TCS Internal
A
P
I
U
S
E
R
S
Data Marts
Legacy
Client/
Server
Select
Extract
Transform
OLTP
External
Metadata
Repository
DATA MART
Integrate
Maintain
Data Preparation
Operational Systems
Data
GE Confidential
TCS Internal
A
P
I
U
S
E
R
S