· Types of Stages in DS? Explain with Examples
· What are active stages and passive stages?
· Can you filter data in hashed file? (No)
· Difference between sequential and hashed file?
· How do you populate time dimension?
· Can we use target hashed file as lookup? (Yes)
· What is Merge Stage?
· What is Job Sequencer?
· What are stages in sequences?
· How do you pass parameters?
· What parameters you used in your project?
· What are log tables?
· What is job controlling?
· Facts and dimension tables?
· Confirmed dimensions?
· Difference between OLTP and OLAP?
· Difference between star schema and snow flake schema?
· What are hierarchies? Examples?
· What are materialized views?
· What is aggregation?
· What is surrogate key? Is it used for both fact and dimension tables?
· Why do you go for oracle sequence generator rather than datastage routine?
· Flow of data in datastage?
· Initial loading and incremental loading?
· What is SCD? Types?
· How do you develop SCD type2 in your project?
· How do you load dimension data and fact data? Which is first?
· Difference between oracle function and procedure?
· Difference between unique and primary key?
· Difference between union and union all?
· What is minus operator?
· What is audit table?
· If there is a large hash file and a smaller oracle table and if you are looking up from
· transformer in different jobs which will be faster?
· Tell me about SCD’s?
· How did you implement SCD in your project?
· What are derivations in transformer?
· How do you use surrogate key in reporting?
· Logs view in datastage, logs in Informatica which is clear?
· How does pivot stage work?
· What is surrogate key? What is the importance of it? How did you implement it in your
· project?
· Totally how many jobs did you developed and how many lookups did you use totally?
· How do constraint in transformer work?
· How will you declare a constraint in datastage?
· How will you handle rejected data?
· Give me some performance tips in datastage?
· Can we use sequential file as a lookup?
· How does hash file stage lookup?
· Why can’t we use sequential file as a lookup?
· What is data warehouse?
· What is ‘Star-Schema’?
· What is ‘Snowflake-Schema’?
· What is difference between Star-Schema and Snowflake-Schema?
· What is mean by surrogate key?
· What is ‘Conformed Dimension’?
· What is Factless Fact Table?
· When will we use connected and unconnected lookup?
· Which cache supports connected and unconnected lookup?
· What is the difference between SCD Type2 and SCD Type3?
· What is difference between data mart and data warehouse?
· What is composite key?
· What is surrogate key? When you will go for it?
· What is dimensional modeling?
· What are SCD and SGT? Difference between them? Example of SGT from your project.
· How do you import your source and targets? What are the types of sources and targets?
· What is Active Stages and Passive Stages means in datastage?
· What is difference between Informatica and DataStage? Which do you think is best?
· What are the stages you used in your project?
· What do you mean by parallel processing?
· What is difference between Merge Stage and Join Stage?
· What is difference between Copy Stage and Transformer Stage?
· What is difference between ODBC Stage and OCI Stage?
· What is difference between Lookup Stage and Join Stage?
· What is difference between Change Capture Stage and Difference Stage?
· What is difference between Hashed file and Sequential File?
· What are different Joins used in Join Stage?
· How you decide when to go for join stage and lookup stage?
· What is partition key? Which key is used in round robin partition?
· How do you handle SCD in datastage?
· What are Change Capture Stage and Change Apply Stages?
· How many streams to the transformer you can give?
· What is primary link and reference link?
· What is routine? What is before and after subroutines? These are run after/before job or
· stage?
· What is Config File? Each job having its own config file or one is needed?
· What is Node?
· What is IPC Stage? What it increase performance?
· What is Sequential buffer?
· What are Link Partioner and Link Collector?
· What are the performance tunning you have done in your project?
· Did you done scheduling? How? Can you schedule a job at the every end date of month?
· How?
· What is job sequence? Had you run any jobs?
· What is status view? Why you clear this? If you clear the status view what internally
· done?
· What is hashed file? What are the types of hashed file? Which you use? What is default?
· What is main advantage of hashed file? Difference between them. (static and dynamic)
· What are containers? Give example from your project.
· What are parameters and parameter file?
· How do you convert columns to rows and rows to columns in datastage? (Using Pivot
· Stage).
· What is Pivot Stage?
· What is execution flow of constraints, derivations and variables in transformer stage?
· What are these?
· How do you eliminate duplicates in datastage? Can you use hash file for it?
· If 1st and 8th record is duplicate then which will be skipped? Can you configure it?
· How do you import and export datastage jobs? What is the file extension? (See each
· component while importing and exporting).
· How do you rate yourself in DataStage?
· Explain DataStage Architecture?
· What is repository? What are the repository items?
· What is difference between routine and transform?
· When you write the routines?
· What is the complex situation you faced in DataStage?
· System variable, what are system variables used your project?
· What are the different datastage functions used in your project?
· Difference between star schema and snow flake schema?
· What is confirmed, degenerated and junk dimension?
· What are confirmed facts?
· Different type of facts and their examples?
· What are approaches in developing data warehouse?
· Different types of hashed files?
· What are routines and transforms? How you used in your project?
· Difference between Data Mart and Data Warehouse?
· What is surrogate key? How do you generate it?
· What are environment variables and global variables?
· How do you improve the performance of the job?
· What is SCD? How do you developed SCD type1 and SCD type2?
· How do you generate surrogate key in datastage?
· What is job sequence?
· What are plug-ins?
· How much data you can get every day?
· What is the biggest table and size in your schema or in your project?
· What is the size of data warehouse (by loading data)?
· How do you improve the performance of the hashed file?
· What is IPC Stage?
· What are the different types of stages and used in your project?
· What are the operations you can do in IPC Stage and transformer stage?
· What is merge stage? How do you merge two flat files?
· What is difference between ODBC and ORACLE OCI stage?
· What difference between sequential file and hashed file?
· Can you use sequential file as source to hashed file? Have you done it? What error it will
· give?
· Why hashed file improve the performance?
· Can aggregator and transformer stage used for sorting data? How
· How many input links you can give to transformer?
· Definition of Slowly Changing Dimensions? Types?
· What is iconv and oconv functions?
· What is the advantage of using OCI stage as compared to ODBC stage
· What is the difference between Interprocess and inprocess? Which one is the best?