Data Warehousing Questions

Q:

How to perform incremental load in DataStage?

Answer

-Daily loading is known as incremental load.


-When data is selected from source, selected records are loaded between timestamp of last load and the current time 


-The parameter that are passed to perform are last loaded date and current date


-The first parameter is the stored last run date is read through job parameters


-The second parameter is the current date

Report Error

View answer Workspace Report Error Discuss

1 2282
Q:

What is the difference between agglomerative and divisive Hierarchical Clustering?

Answer

- Agglomerative Hierarchical clustering method allows the clusters to be read from bottom to top and it follows this approach so that the program always reads from the sub-component first then moves to the parent. Whereas, divisive uses top-bottom approach in which the parent is visited first then the child. 


- Agglomerative hierarchical method consists of objects in which each object creates its own clusters and these clusters are grouped together to create a large cluster. It defines a process of merging that carries on till all the single clusters are merged together into a complete big cluster that will consists of all the objects of child clusters. Whereas, in divisive the parent cluster is divided into smaller cluster and it keeps on dividing till each cluster has a single object to represent.

Report Error

View answer Workspace Report Error Discuss

1 1766
Q:

Difference between ER Modeling and Dimensional Modeling.

Answer

Dimensional modelling is very flexible for the user perspective. Dimensional data model is mapped for creating schemas. Where as ER Model is not mapped for creating shemas and does not use in conversion of normalization of data into denormalized form.


ER Model is utilized for OLTP databases that uses any of the 1st or 2nd or 3rd normal forms, where as dimensional data model is used for data warehousing and uses 3rd normal form.


ER model contains normalized data where as Dimensional model contains denormalized data.

Report Error

View answer Workspace Report Error Discuss

0 1639
Q:

How do you generate Sequence number in Datastage?

Answer

Sequence numbers can be generated in Datastage using certain routines. They are


-KeyMgtGetNextVal


-KeyMgtGetNextValConn

Report Error

View answer Workspace Report Error Discuss

1 1307
Q:

What are the components of DataStage?

Answer

Datastage has two components


-Client Components – Designer, Director, Manager and Administrator


-Server Components – Server, Repository and Plug-ins.

Report Error

View answer Workspace Report Error Discuss

1 1135