Professional Documents
Culture Documents
WAREHOUSING
►Subject Oriented
►Non Volatile
►Integrated
►Time Variant
Subject Oriented
• Exampleforaninsurancecompany:
ApplicationsArea DataWarehouse
AutoandFirePolicy
AutoandFirePolicy
ProcessingSystems
Commercial and ProcessingSystems
Commercial and Policy
Customer Policy
LifeInsurance Customer
LifeInsurance
Systems
Systems
Data
Data
Claims
Claims
Losses Premium
Processing Losses Premium
Accountin g System Processing
Accountin g System
Billing System
Billing System
System
System
7
Integrated
• Dataisstoredonceinasingleintegratedlocation
(e.g. insurancecompany)
AutoPolicy
AutoPolicy
Processing DataWarehouse
Processing
System Database
System
Customer FirePolicy
FirePolicy
Processing
Processing
data
System
System
stored
inseveral Subject=Customer
FACTS, LIFE
FACTS, LIFE
Commercial, Accounting
databases Commercial, Accounting
Applications
Applications
8
Time - Variant
• Dataisstoredasaseriesofsnapshotsorviewswhichrecordhowitiscollectedacrosstime.
DataWarehouseData
Time Data
{
Key
Dataisavailableon-lineforlongperiodsof timefortrendanalysisand
9
Non-Volatile
• Existingdatainthewarehouseisnotoverwrittenorupdated.
External
Sources
Production Data
D
Daattaa Warehouse
Databases
PP
ro
rd
ou
dc
utionn
ctio W
Waa
rreehh
oouu
ssee Database
AA
pp
plicataio
plic ns
tions EE
nn
vviriroonn
mmeenn
tt
• Load
• Update
• Read-Only
• Insert
• Delete
10
OLAP
OnlineAnalyticalprocessing
Amodelthroughwhichuserscansliceanddicedata.
ForAnalytical&reporting purpose.
Fasterresponse.
Example OLTP Model
OLAP Model
Differences………………..
“TransactionBasedProcess”
systems.
BatchLoad
“WarehouseBasedProcess”
Decisionsupportfor Summarize&
Transform
managementuse. Refine
16
DWH Life Cycle
Business Analyst
Data Modular
ETL Developer
Report Developer
Testing
DWH Architecture
Bottom-upapproach
Datamartsarefirstcreatedtoprovidereportingandanalyticalcapabilitiesfor
specificbusinessprocesses
Datamartscontainatomicdataand,ifnecessary,summarizeddata.
Thesedatamartscaneventuallybeunionedtogethertocreateacomprehensive
datawarehouse.
Bill Inmon
Top-downapproach
Datawarehouseasacentralizedrepositoryfortheentireenterprise.
Datawarehouseisdesignedusinganormalizedenterprisedatamodel.
"Atomic"data,thatis,dataatthelowestlevelofdetail,arestoredinthedata
warehouse.
Dimensions & Measures
Datawarehouseconsistsofdimensionsandmeasures
Dimensionsallowdataanalysisfromvariousperspectives.Product
dimensioncouldhelpyouseewhichproductsbringinthemost revenue.
Measuresarenumericrepresentationsofasetoffactsthathave
occurred. Examplesofmeasuresincludedollarsofsales.
Dimensional Data Modeling
► Conceptual modeling
► Logical Modeling
► Physical Modeling
Before start implementing the schema design a
Data modeler should understand the following
process