Professional Documents
Culture Documents
A data warehouse is a subject-oriented, integrated, time-variant, and nonupdatable collection of data in support of managements decision-making process.
Subject-Oriented
High level Entities like Customers, Patients, Students, Products and time. Data gathered from several internal system of records or from sources external to the organization.
Integrated
Time-Variant
Time dimension is used in Data Warehousing to study the trends and changes.
Nonupdatable
New data is always added as a supplement to DB, rather than replacement. The DB continually absorbs this new data, incrementally integrating it with previous data.
In Simple Words A data warehouse is simply a single, complete, and consistent store of data obtained from a variety of sources and made available to end users in a way they can understand and use it in a business context.
Scientific Databases
Digital Libraries
Integration System
Personal Databases
Collects and combines information Provides integrated view, uniform user interface Supports sharing
Why a Warehouse
For analysis and decision support, end users require access to data captured and stored in an organizations operational or production systems. This data is stored in multiple formats, on multiple platforms, in multiple data structures, with multiple names, and probably created using different business rules
IMS
Mainframe Applications
M anage me nt Re porting Sale s/M arke ting Custome r Re lations Re se rv e Analysis Risk Analysis
DB2/2
PC Applications
???
Extract Programs Data Cleansers/Scrubbers Translators/Transformers Timing Tools Data Loading File Transfer
Reserv es
Customers
Rates
Policies
External Sources
Claims Premiums
DB/6000
Midrange
DB/400
auditing data warehouse usage to provide user chargeback information replicating, subsetting, and distributing data maintaining effient data storage management archiving and backing-up data implementing recovery following failure security management
In computers, the path of data from source document to data entry to processing to final reports. Data changes format and sequence (within a file) as it moves from program to program. Is known as Data flow
Data Flow
Inflow- The processes associated with the extraction,
cleansing, and loading of the data from the source systems into the data warehouse.
Architectures
Many database architectures has been implemented 2 architectures need to be quoted: 1. 2. OLTP (OnLine Transaction Processing) Data Warehouse (OLAP)(online analytical processing) OLTP is used to store data and query it frequently and is based on normalized schemas. Data warehouse is used to store data history and is based on fact tables and dimension tables.
Special Thanks to
Google.com
and other sites.
Thank You