Here is a basic process in Business Intelligence.
Maybe some people will be confused, why we need data warehouse?
Without data warehouse, we can also analyze the data.
We can get the data and create the report directly.
So, what the benefits of data warehouse in an organization?
Here we list 4 reasons why we need data warehouse.
- Integrate data from various data sources and centralize the data into one place.
- Have data loaded into data warehouse so that reporting won’t impact live system or database.
That is why we have a seperate data warehouse and the data is stored in the data warehouse.
We can make a scheduled job running at night to centralize the data from operational databases to data warehouse.
- Easy access (one place of data and single source of truth).
It is easy for people to go to data warehouse to get the data, and they don’t need to worry other problems, e.g., we have so many data sources and where can I get the data?
We can trust the data warehouse where we can get the data.
- Build model: choose the best design model to get the best flexibility and performance, especially for those large datasets.
We usually use kimball methodology – star schema/snowflake schema (de-normalization).
For example, we use the star schema to improve the query performance.
There are also some other methodologies we can use in data warehouse, e.g., inmon methodology, datavault methodology.
If you are interested in or have any problems with data warehouse, feel free to contact me.
Or you can connect with me through my LinkedIn.