Data Warehousing Questions

Q:

What are the different models used in cluster analysis?

Answer

There are many algorithms that can be used to analyze the database to check the maintenance of all the data sets that are already present. The different types of cluster models include as follows:


- Connectivity models: these are the models that connect one cluster to another cluster. This includes the example of hierarchical clustering that is based on the distance connectivity of one model to another model. 


- Centroid models: these are the models that are used to find the clusters using the single mean vector. It includes the example of k-means algorithm.


- Distribution models: it includes the specification of the models that are statistically distributed for example multivariate normal distribution model.


- Density models: deals with the clusters that are densely connected with one another in the regions having the data space. 


- Group models: specifies the model that doesn’t provide the refined model for the output and just gives the grouping information

Report Error

View answer Workspace Report Error Discuss

0 1238
Q:

Define what is a Fact, Dimension and Measure ?

Answer

Fact is key performance indicator to analyze the business.From the additive values, which are used to analyse the business by using report generation. Dimension is used to analyze the fact, are the non-additive ( text format) values, Which are used to drill down reports.Without dimension there is no meaning for fact. Measure is the quantity of data we create to process.

Report Error

View answer Workspace Report Error Discuss

5 1181
Q:

What are fact tables and dimension tables?

Answer

Business facts or measures and foreign keys are persisted in fact tables which are referred as candidate keys in dimension tables. Additive values are usually provided by the fact tables which acts as independent variables by which dimensional attributes are analyzed.


Attributes that are used to constrain and group data for performing data warehousing queries are persisted in the dimension tables.


 

Report Error

View answer Workspace Report Error Discuss

0 1153
Q:

What are the prime responsibilities of Data Integration Administrator?

Answer

- Scheduling and executing the batch jobs.


- Configuring, starting and stopping the real-time services


- Adapters configuration and managing them.


- Repository usage, Job Server configuration.


- Access Server configuration.


- Batch job publishing.


- Real-time services publishing through web services.

Report Error

View answer Workspace Report Error Discuss

0 1127
Q:

Explain about various caches available in Data Integrator

Answer

  •  NO_CACHE – It is used for not caching values.

  •  PRE_LOAD_CACHE – Result column preloads and compares the column into the memory, prior to executing the lookup.

  •  PRE_LOAD_CACHE is used when the table can exactly fit in the memory space.

  •  DEMAND_LOAD_CACHE – Result column loads and compares the column into the memory when a function performs the execution.

  •  DEMAND_LOAD_CACHE is suitable while looking up the highly repetitive values with small subset of data.

Report Error

View answer Workspace Report Error Discuss

0 1097
Q:

How do we measure progress in Data Integration?

Answer

Look for the existence of the following items:


- Generic Data Models


- An Enterprise Data Platform


- Identify the Data Sources


- Selection of a MDM Product


- Implementation of a Customer Master Index or appropriate alternative

Report Error

View answer Workspace Report Error Discuss

0 1030
Q:

What are the benefits of data integration?

Answer

Following are the benefits of data integration:


- Makes reporting, monitoring, placing customer information across the enterprise flexible and convenient.


- Data usage is efficient.


- Cost Effective.


- Risk adjusted profitability management as it allows accurate data extraction.


- Allows timely and reliable reporting, as data quality is the prime technology for business challenges.

Report Error

View answer Workspace Report Error Discuss

0 1023
Q:

What does BUS Schema means ?

Answer

BUS Schema is composed of a master suite of confirmed dimension and standardized definition if facts.

Report Error

View answer Workspace Report Error Discuss

6 975