DATA WAREHOUSES
DATA WAREHOUSES
= central repositories of ==integrated data== that stores current and historical data in ==one place== optimised for analytics previous model:
- data scored in numerous separate databases within one organization
- these databases were optimised for storage and retrieval
- tables with minimum redundancy and smallest no. of fields make updates fast, but requires multiple operations to extract useful analytics.
DATA MARTS
=subset of data stored in a data warehouse orientated for a specific business need.
DATA WAREHOUSES VS DATA MARTS
SAME:
- designed for structured data clearly defined data types easily searchable
- and speed of access = read-only
DIFFERENCES
differences | DATA WAREHOUSE | DATA MART |
---|---|---|
stores data from | multiple subject areas | one subject area |
contains | detailed info | detailed info + summaries |
integrates | all data sources | only info for a given subject |
uses a dimensional model | not necessarily | yes, e.g. Star schema, snowflake |