bala krishna
unread,Jan 30, 2011, 11:27:28 PM1/30/11Sign in to reply to author
Sign in to forward
You do not have permission to delete messages in this group
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to INFOSHARE DATASTAGE(NOV2010-FEB2011)
COVANSIS
(CHENNAI)
1. Tell me about your project?
2. Data scrubbing?
3. How many jobs are there in your project?
4. Take one job & describe each and every stage? Process?
5. In your all jobs, which job is critical?
6. Are you involved in your job designing?
7. Are you involved in unit testing?
CS SOFTWARE
SOLUTIONS
1. Tell me about your self?
2. Tell me about your project?
3. What is your daily work in your office?
4. Tell me about your role in your project?
5. What are the fact tables & dimension tables in your project?
6. Definition of DWH?
7. Particular point in time means? (ppit will comes in my DWH
definition)
8. Difference between Datamart & data warehouse?
9. Difference between OLTP & OLAP?
10. Types of schemas?
11. Difference between star schema & snowflake schema?
12. Difference between top-down & bottom-up approaches?
13. Which approach is used in your project?
14. Normalization?
15. In any schema, fact table is normalized. Is it sure?
16. What are the attributes in your dimension tables?
17. What is the aim of your project?
18. What is staging area?
19. What is the sequence of your job in your project?
20. Types of hash files?
21. Difference between static & dynamic hash files?
22. Which performance tuning technique will do in your project?
23. Are you heard about integrity stage?
IBM
(CALCUTTA)
1. Tell me about your self & your project?
2. You are uses batches (or) sequence for run the multiple jobs in
sequence?
3. What are the stages you are used in your project?
4. Which type of hash file you are using?
5. Star schema?
6. Types of star schemas?
7. Difference between OLTP & OLAP?
8. Main difference between OLTP & OLAP? About dimensional?
9. How do you trigger the job?
10. Surrogate key?
11. We have biz id key, so why we are using surrogate key?
12. We have to maintain bizkey_timestamp (concatenate), in this time
which one is better,
this key or surrogate key?
13. Tell me the structure of the hash file in your project?
14. Why we are using pre-load file option in hash file?
15. When you stored the hash file in one directory, are you open that
file? In which
format that will opened (like word format or note pad)?
16. Conformed dimension?
17. Order of execution? Stage variable, constraint, derivations?
18. How can you compare the rows using transformer stage?
19. Which type of dimensions you are using in your project?
20. Are you involved in job designing?
21. Your project is developing project (or) supporting project?
22. Every day you getting the data from client?
23. In your project, which one is used? Job sequence or batch?
24. Using hash file as lookup, how many rows you are compared with
source data?
IBM
(CALCUTTA)
1. Tell me about your self?
2. Tell me about your current project?
3. What are the stages you are using in your project?
4. How the flat files get & where you going to place?
5. What exactly your daily work from morning to evening?
6. Can you explain the project architecture?
7. What are you doing in transformer?
8. How do you check dependencies?
9. How can you do the conversions? $ To Rs.
10. What exactly hash file?
11. How you will load the initial load and incremental load?
12. What is the difference between local container & shared container?
13. Can we use local container in multiple jobs?
14. What about Datastage functions?
15. Where we can use that functions?
16. What is an expression?
17. Where we can use stage variables?
18. What is an environmental variable?
19. Difference between environmental variable & job parameters?
20. Difference between job parameters & stage variables?
21. What is semi-additive fact?
22. Fact less fact table?
23. Default size of hash file?
24. More than 2 GB, what will you do? It will go to abort the job?
25. Different types of links?
26. for hash file,
27. How many maximum columns retrieve from hash file?
28. Difference between procedures & functions?
29. Example of plug-in-stage?
30. Difference between built-in-stage & plug-in-stage?
31. Active stage & passive stage?
32. Link collector & link partitioner?
IBM
(PUNE)
1. Difference between job sequence & batch?
2. How can we import the DS jobs individually?
3. Degenerated dimension?
4. Where the hash file is stored? In server (or) client?
5. Types of hash file?
6. We will take one hash file as lookup in Datastage, how it will work
in oracle? Like + operation, - Operation, manipulated operation (or)
etc any other operations?
7. Link collector?
8. How this link collector will work in oracle?
9. How can you call the routines in Datastage?
10. We have 2 jobs job A & job B in DS? How can we run job B before
than job A without using job sequence & batch?
11. Use of job control?
12. DS 7.1 is working on windows (or) UNIX?
13. From my input, I got the dates 31-09-1981 & 31-02-1982. 31-02-1982
is invalid date.
just I want only valid dates without invalid dates. How can we
done in DS?
INFOSYS INTERVIEW ON DATASTAGE
1. Tell me about your self?
2. Tell me about project?
3. Explain your etl architecture?
4. Are you involved in design of jobs?
5. Are you involved in any testing?
6. Various levels (types) in unit testing?
7. What is staging area?
8. Is it possible to load data without staging area?
9. Are you using staging area in any time?
10. Are you done any calculations in your projects? How?
11. How can you done debugs?(particular job was fail)
12. To display job as fail, how can you identify …which one is the
exact error?
13. 1000 flat files were loaded in to oracle database. It will take 1
hr time. How to solve?
14. For above problem...where can you find out the problem?(in source,
in target, in transformer)
15. How to do error handling?
ORACLE:
16. Data partitioning in oracle 9i?
17. Difference between procedure and function?
18. Use of procedures?
19. Using procedures…we can returns values?
20. Using functions… one function can’t return values. What is that
function?
21. What is the package?
22. Use of package?
23. Where I can use package?
24. What is synonym?
25. Use of view?
26. Example of view for table?
27. Types of views?
28. What is materialized view?
29. When we use materialized view?
30. How to solve complex query?
31. How can we improve query performance?
32. I and you wrote 2 queries. Those are given same output. But…how
can you find out which one is the better query?
DATASTAGE:
33. How do we update single file?
34. Error handling in Datastage etl job?
35. Version controls? How it works?
36. Did you use any version controls?
37. Why we are using version controls?
38. Did you use any scheduler tools?
39. Any job schedulers in your projects?
40. What is incremental loading?
41. How we will design jobs for incremental loadings?
42. Types of dwh?
43. Types of approaches?
44. Need full dynamic design to pass to another environment. We don’t
want to change any paths or any other in that project or job. But that
job will execute with out any errors. What is the process?
45. How do you run jobs in your projects?
46. How to trigger your jobs?
47. How the jobs are running in your projects?
48. Difference between Datamart and data warehouse?
49. What are the dimensions in your projects?
50. How can they inform the errors?(through mails or sms)
MATRIX
INTERVIEW
1. What exactly repository? Mean file or any other DB? ANS universal
db.
2. What is Meta data?
3. What are the stages you are using in your project?
4. Are you creating any routines?
5. How can you export the job design to the repository?
6. In which format you export? ANS .dsx.
7. What is the environmental variable?
8. Why we are using this environmental variable?
9. What exactly the hash file?
10. When we enable the runtime column propagation option, what will
happen?
11. Difference between batch & job sequence?
12. How to schedule the jobs?
13. Which scheduler tools r u used?
14. Using aswin, how can you schedule or where can you schedule?
15. Fact load? (Which one is first load…mean dimension or fact)?
16. Ralph Kimball’s approaches?
17. Difference between DWH & DM?
18. ICONV & OCONV?
19. What is the difference procedures & functions?
20. Tell me about your self?
21. How can run the job using command line?
22. How can you run the batch using command line?
23. Referential integrity?
WIPRO
(BANGALORE)
1. Tell me about your self?
2. There are an emp id, sal, ename, dname, deptno in a table. Find out
the dname, in
which Dept the max sal is received?
3. Difference between function & procedures?
4. in a file, empno, ename, sal, deptno, dname. Find out who will
receives the max
salaries of 3 employers in each dept? In DS?
5. Pseudo column in oracle?
6. Difference between snow flake & star flake?
7. Degenerated dimension?
8. Job parameters?
9. Environment variables?
10. In odbc stage, what is the isolation level?
11. In odbc stage, what is transaction size & array size?
12. Scenario of link collector? [For which purpose you are using link
collector in your
Project]
13. How to improve the performance of jobs?
14. Process of SCD type2?
WIPRO
(BANGALORE-2)
1. Tell me about your self?
2. Tell me about your project?
3. Types of triggers in oracle?
4. Isolation level?
5. Array size & transactional size?
6. There are student name, student id, branch, total marks are
available in one table. We want top 3 scorers in each branch? How can
we done in DS?
7. There are student name, student id, branch, total marks are
available in one table. We want top 3 scorers? How can we done in DS?
8. We have student names, total marks & 5 department names in one
table. We want to store those 5 departments in 5 different tables. How
can you done?
9. Difference between snow flake & star flake?
10. Degenerated dimension?
11. Scd 2 process?
12. Are you used any environmental variables in your project?
13. What is an environmental variable?
14. How to improve the performance of job?
15. In how many ways you have to remove duplicates?
16. Are you write any routines in your project?
17. What is SCD?
18. Surrogate key?
19. Example of degenerative dimension?
WIPRO (CHENNAI
CONSULTANT)
1. In job sequence, what is the use of sequencer stage?
2. Tell me the fact table & dimension tables in your project?
3. What exactly the hash file?
4. Types of hash file?
5. How to load the target? (Directly load or bulk load)
6. What about aggregator stage?
7. Are you write any routines?
[We have 3 columns, in column A & B they are given values, in
column c write the
routine for add that column A & B]
8. We have so many rows like 3000 rows in source, in extraction time
only we have to
filter the data? How can you filter?
WIPRO
(DIRECT)
1. Tell me about your self?
2. Tell me about your project?
3. Tell me about your role in your project?
4. Types of dimension tables?
5. Types of fact tables?
6. Types of schemas?
7. Surrogate key?
8. Difference between surrogate key & primary key?
9. Use of surrogate key?
10. What is slowly changing dimension?
11. Types of SCD’s?
12. Which dimension you is used in your project?
13. Tell me the scd 2 process?
14. You have to face any critical problem in your project?
15. What is the use of your project to your client?
16. Difference between server jobs & parallel jobs?
17. Why your client choose DS server 7.5 rather than parallel?
18. In hash (lookup) 1000 rows are there, and in source we have
million rows. How to join these to tables?
19. In hash (lookup) 1000 rows are there, and in source we have
million rows. In this hash, dept table are there. And in source we
have the EMP table. In this EMP table also we have the dept name. But
in EMP have any other dept name rather than in dept table, just reject
that row. How can you done? How can you find it?
20. In the above situation, in EMP table every row compare with that
1000 rows. In that time we have to loss the performance. What you are
doing for improve the performance?
21. How to join 2 tables?
22. How to improve the performance of a job?
23. Are you used any data cleansing tools in your project?
24. Difference between Datamart & DWH?
25. What is data cleansing?
26. What is star schema?
27. What are the attributes in your fact table?
28. How to do error handling? Are you done any error handling in your
project?
29. We have to load 100 rows in a target, but when 10 rows are
transactioned, then that transaction will not working due to some
reasons. We have to load remaining 90 rows without duplicates. How can
you done?
30. When the transaction is in process…in that time the system will
restart due to internal problems. In that time what will happened?