Data Warehousing Basic Interview Questions with Answers

A data warehouse can be considered as a storage area where interest specific or relevant data is stored irrespective of the source........
A virtual data warehouse provides a collective view of the completed data. A virtual data warehouse has no historic data.....
Stages of a data warehouse helps to find and understand how the data in the warehouse changes.
An active data warehouse represents a single state of the business. Active data warehousing considers the analytic perspectives of customers and suppliers.
Data Modeling is a technique used to define and analyze the requirements of data that supports organization’s business process.
The entity-relationship model is a method used to represent the logical flow of entities/objects graphically that in turn create a database...
Data warehousing relates to all aspects of data management starting from the development, implementation and operation of the data sets...
Dimensional modeling is a design concept which is used by designers of building data warehouses. The data is stored in two types of tables...
Snapshot refers to a complete visualization of data at the time of extraction. It occupies less space and can be used to back up and restore data quickly...
What is Data warehousing? - A data warehouse can be considered as a storage area where interest specific or relevant data is stored irrespective of the source...
Data warehousing fact & dimension tables - As mentioned, data in a warehouse comes from the transactions. Fact table in a data warehouse consists of facts..
What is ETL process in data warehousing? - ETL is Extract Transform Load. It is a process of fetching data from different sources, converting the data into a consistent and clean....
Difference between data mining and data warehousing - Data warehousing is merely extracting data from different sources, cleaning the data and storing it in the warehouse.........
What is an OLTP system and OLAP system?: Online Transaction and Processing helps and manages applications based on transactions involving high volume of data......
What are cubes? - A data cube stores data in a summarized version which helps in a faster analysis of data........
What is snow flake scheme design in database? - A snowflake Schema in its simplest form is an arrangement of fact tables and dimension tables.....
Data warehousing analysis service - Analysis service provides a combined view of the data used in OLAP or Data mining......
Data warehousing sequence clustering algorithm - Sequence clustering algorithm collects similar or related paths, sequences of data containing events....
Explain discrete and continuous data in data mining - Discreet data can be considered as defined or finite data. E.g. Mobile numbers, gender.....
Series algorithm in data mining - Time series algorithm can be used to predict continuous values of data. Once the algorithm is skilled.......
What is XMLA? - XMLA is XML for Analysis which can be considered as a standard for accessing data in OLAP, data mining or data sources.......
Data warehousing & Business Intelligence - Data Warehousing helps you store the data while business intelligence helps you to control the data for decision making.........
What is Dimensional Modeling? - Dimensional modeling is often used in Data warehousing. In simpler words it is a rational or consistent design technique.........
What is surrogate key? - ETL is Extract Transform Load. It is a process of fetching data from different sources, converting the data into a consistent and clean.........
What is the purpose of Fact less Fact Table? - Fact less tables are so called because they simply contain keys which refer to the dimension tables..........
What is a level of Granularity of a fact table? - A fact table is usually designed at a low level of Granularity. This means that we need to find the lowest level of information.......
Data warehousing star & snowflake - A snow flake schema design is usually more complex than a star schema. In a star schema a fact table is surrounded......
What is the difference between view and materialized view? - A view is created by combining data from different tables. Hence, a view does not have data of itself....Materialized view usually used in data warehousing has data........
What is a Cube and Linked Cube - A data cube stores data in a summarized version which helps in a faster analysis of data. Where as linked cubes use the data cube.........
Data warehousing junk dimension - In scenarios where certain data may not be appropriate to store in the schema, this data (or attributes) can be stored in a junk dimension..........
What are fundamental stages of Data Warehousing? - Stages of a data warehouse helps to find and understand how the data in the warehouse changes..........
What is Virtual Data Warehousing? - The aggregate view of complete data inventory is provided by Virtual Warehousing.........
What is active data warehousing? - The transactional data captured and reposited in the Active Data Warehouse........
Difference between dependent and independent data warehouse - Dependent data ware house are build ODS........
What is data modeling and data mining? What is this used for? - Designing a model for data or database is called data modelling.........
Difference between ER Modeling and Dimensional Modeling - Dimensional modelling is very flexible for the user perspective.........
What is snapshot with reference to data warehouse? - A snapshot of data warehouse is a persisted report from the catalogue........
What is degenerate dimension table? - The dimensions that are persisted in the fact table is called dimension table........
What is Data Mart? - Data Mart is a data repository which is served to a community of people......
Difference between metadata and data dictionary - Metadata describes about data. It is ‘data about data’. It has information about how and when......
Various methods of loading Dimension tables - The following are the methods of loading dimension tables........
Difference between OLAP and data warehouse - The following are the differences between OLAP and data warehousing.......
Foreign key columns in fact table and dimension table - The primary keys of entity tables are the foreign keys.......
What is cube grouping? - A transformer built set of similar cubes is known as cube grouping.......
Define the term slowly changing dimensions (SCD) - Slowly changing dimension target operator is one of the SQL warehousing operators......
"What is a Star Schema? - The simplest data warehousing schema is star schema........
Differences between star and snowflake schema - Star Schema: A de-normalized technique in which one fact table is associated with several dimension tables.......
se of lookup tables and Aggregate tables - At the time of updating the data warehouse, a lookup table is used.......
What is real time data-warehousing? - The combination of real-time activity and data warehousing is called real time warehousing.......
What is conformed fact? What is conformed dimensions use for? - Allowing having same names in different tables is allowed by Conformed facts......
Define non-additive facts - The facts that can not be summed up for the dimensions present in the fact table are called non-additive facts......
Define BUS Schema - A BUS schema is to identify the common dimensions across business processes......
Difference between SAS tool and other tools - The differences between SAS and other tools are......
Why is SAS so popular? - Statistical Analysis System is an integration of various software products which allows the developers to perform......
What is data cleaning? How can we do that? - Data cleaning is also known as data scrubbing. Data cleaning is a process......
Explain in brief about critical column - A column (usually granular) is called as critical column which changes the values over a period of time......
What is data cube technology used for? - Data cube is a multi-dimensional structure. Data cube is a data abstraction to view aggregated data from a number of perspectives.......
Data Scheme is a diagrammatic representation that illustrates data structures and data relationships to each other in the relational database within the data warehouse.....
Bitmap indexes make use of bit arrays (bitmaps) to answer queries by performing bitwise logical operations......
In hierarchical, networked or relational databases, the data can be extracted, cleansed and transferred in two directions. The ability of a system to do this is refered to as bidirectional extracts.....
Data collection frequency is the rate at which data is collected. However, the data is not just collected and stored......
Cardinality is the term used in database relations to denote the occurrences of data on either side of the relation.....
In Chain Data Replication, the non-official data set distributed among many disks provides for load balancing among the servers within the data warehouse.....
Key areas of activity in which favorable results are necessary for a company to reach its goal.....