Dimension modeling and Data normalization |
|
|
A significant difference of dimension modeling when compared to transaction data modeling is the normalization process. While OLTP stresses on need for normalization, OLAP database emphasises on retrieval performance and hence allows data redundancy. This article helps to understand what is normalization and what is dimension modeling. Data NormalizationThis is a technique used in data modeling that emphasizes on avoiding storing the same data element at multiple places. We follow the 3 rules of normalization called the First Normal Form, Second Normal Form, and Third Normal Form to achieve a normalized data model.
Theoretically We have further rules called the Boyce-Codd Normal Form, Fourth Normal Form and the Fifth Normal form. In practice we don’t use the rules beyond 3NF. Dimensional ModelingThis is the form of data modeling used in data warehousing where we store business related attributes in tables called Dimension Tables and the business related metrics/measures in tables called Fact Tables. The business metrics are associated with the business attributes by relating the dimension tables with the fact table. The dimension tables are not related to one another. |
Free tutorials on fundamental concepts of business intelligence and data warehousing: