Go Back   2018-2019 StudyChaCha > >




  #2  
Old April 21st, 2014, 06:13 PM
Vinodt
Super Moderator
 
Join Date: Jun 2013
Posts: 42,143
Default Re: ICFAI MBA Data Warehousing and Data Mining (MB3G1IT) Paper

As per your request here I am sharing the previous year ICFAI University question paper of MBA Data Warehousing and Data Mining (MB3G1IT)


1. Which of the following data warehouse process managers transform and manages the data?
(a) Query manager
(b) Database manager
(c) Event manager
(d) Warehouse manager
(e) Load manager.

2. Which of the following task is performed by the copy management tool in a load manager?
(a) Job control
(b) Complex checking
(c) Backup and archive
(d) Fast load
(e) Simple transformation.

3. Which of the following statements is/are true about Massively Parallel-Processing (MPP) machine?
I. MPP machines require the use of a distributed lock manager to maintain the integrity of the distributed
resources across the system as a whole.
II. Design and management of a data warehouse on an MPP machine is considerably more difficult than on
Symmetric Multi-Processing (SMP) system.
III. The amount of CPU power that can be made available in an MPP machine is independent.
(a) Only (I) above
(b) Only (II) above
(c) Only (III) above
(d) Both (I) and (II) above
(e) Both (II) and (III) above.

4. It is certain that large data sorts are going to be required within the data warehouse. If we can gauge the size of
the largest transaction that will realistically be run, we can use this to size the temporary requirements. If not, the
best we can do is tie it to the size of a partition. If the number of concurrent queries allowed are 4 and the size of the
partition is 3GB, then we need to set the temporary space (T) to
(a) 7 GB
(b) 12 GB
(c) 24 GB
(d) 27 GB
(e) 36 GB.

5. Which of the following backup software packages is produced by ‘HP’?
(a) OmnibackII
(b) ADSM
(c) Alexandria
(d) Epoch
(e) Networker.

6. Which of the following statements is/are true about various types of partitioning?
I. Vertical partitioning can take two forms: normalization and row splitting.
II. Before using a vertical partitioning we need to be very sure that there will be no requirements to perform
major join operations between the two partitions.
III. Horizontal hardware partitioning technique spreads the processing load by horizontally partitioning the fact
table into small segments and then physically storing each segment in a different node.
(a) Only (I) above
(b) Only (II) above
(c) Only (III) above
(d) Both (I) and (II) above
(e) All (I), (II) and (III) above.

7. Which of the following statements is/are true about the phases in data warehouse delivery process?
I. Technical blueprint phase is the stage where the first production deliverable is produced.
II. Build the vision phase must deliver an overall architecture that satisfies the long-term requirements and a
definition of the components that must be implemented in the short term in order to derive any business
benefit.
III. The purpose of the business case is to identify the projected business benefits that should be derived from
using the data warehouse.
(a) Only (I) above
(b) Only (II) above
(c) Only (III) above
(d) Both (I) and (II) above
(e) Both (II) and (III) above.

8. Which of the following is/are not produced in the technical blueprint stage of data warehouse delivery process?
I. Detailed design of database.
II. Essential components of database design.
III. Server and data mart architecture.
IV. Backup and recovery strategy.
(a) Only (I) above
(b) Only (III) above
(c) Both (I) and (IV) above
(d) (I), (III) and (IV) above
(e) (II), (III) and (IV) above.

9. Which Redundant Array of Inexpensive Disks (RAID) levels use byte wise striping of data with parity disk?
(a) Level 1
(b) Level 2
(c) Level 3
(d) Level 4
(e) Level 5.

10.Which of the following statements is/are false about query management process?
I. Query management process is the system process that manages the queries and speeds them up by directing
queries to the most effective data source.
II. Like other system processes, query management process generally operates during the regular load of
information into the data warehouse.
III. Query management process must ensure that no single query can affect the overall system performance.
(a) Only (I) above
(b) Only (II) above
(c) Only (III) above
(d) Both (I) and (II) above
(e) Both (II) and (III) above.

Page 3 of 20
11.Which of the following tasks is performed by system management tool in a warehouse manager?
(a) Job control
(b) Create indexes
(c) Backup and archive
(d) Generate star schemas
(e) Query profile analysis.

12.Which of the following statements is/are true about fact data and dimension data?
I. Fact data represents a physical transaction that has occurred at a point in time and as such is unlikely to
change on an ongoing basis during the life of the data warehouse.
II. In general, dimension data in a star schema or snowflake schema is designed to minimize the cost of
change and is typically very low volume data (i.e, under 5GB).
III. Fact data will have only one foreign key whereas reference data will have one primary key.
(a) Only (I) above
(b) Only (II) above
(c) Only (III) above
(d) Both (I) and (II) above
(e) Both (II) and (III) above.

13.The reason(s) for partitioning the fact table is/are
I. To increase the performance.
II. To assist the management of the data.
III. To assist backup/recovery.
(a) Only (I) above
(b) Only (II) above
(c) Only (III) above
(d) Both (I) and (II) above
(e) All (I), (II) and (III) above.

14.Which of the following statements is/are true about metadata?
I. As a part of extraction and load process, metadata is used to map data sources to the common view of
information within the data warehouse.
II. As a part of the warehouse management process, metadata is used to direct a query to the most appropriate
data source.
III. As a part of query management process, metadata is used to automate the production of summary tables.
(a) Only (I) above
(b) Only (II) above
(c) Only (III) above
(d) Both (I) and (II) above
(e) Both (II) and (III) above.

15.Fact table identification process requires four steps. Arrange the following four steps in correct sequence.
I. Look for elemental transactions.
II. Check if fact is a dimension.
III. Check if dimension is a fact.
IV. Determine key dimensions.
(a) I-II-III-IV
(b) I-IV-II-III
(c) I-III-II-IV
(d) IV-I-III-II
(e) IV-II-III-I.

16.As with any relational system, foreign keys within a fact table can be structured in two ways: using intelligent
keys and using non-intelligent keys. Which of the following statements are true about intelligent and nonintelligent
keys?
I. In intelligent keys, each key represents the unique identifier for the item in the real world.
II. In non-intelligent keys, each unique key is generated automatically and refers to the unique identifier for
the item in the real world.
III. Usage of intelligent key can be costly and time consuming.
IV. Unless it is certain that identifiers will not change, it is safer to use intelligent keys.
(a) Both (I) and (II) above
(b) Both (II) and (III) above
(c) (I), (II) and (III) above
(d) (II), (III) and (IV) above
(e) All (I), (II), (III) and (IV) above.

17.Which of the following statements is/are false about hardware architectures used in data warehouse solutions?
I. A Symmetric Multi-Processing (SMP) machine is a set of loosely coupled CPUs, each of which has its own
memory and disk.
II. A Massively Parallel-Processing (MPP) machine is a set of tightly coupled CPUs, that share memory and
disk.
III. A Non Uniform Memory Architecture (NUMA) machine is basically a tightly coupled cluster of
Symmetric Multi-Processing (SMP) nodes.
(a) Only (III) above
(b) Both (I) and (II) above
(c) Both (I) and (III) above
(d) Both (II) and (III) above
(e) All (I), (II) and (III) above.

18.Which of the following statements is/are false about Online Analytical Processing (OLAP) tools?
I. OLAP tools do not learn.
II. OLAP tools create new knowledge.
III. OLAP tools are more powerful than data mining.
IV. OLAP tools cannot search for new solutions.
(a) Only (I) above
(b) Only (III) above
(c) Both (I) and (II) above
(d) Both (II) and (III) above
(e) (I), (II) and (IV) above.

19.NetSol, a reputed Garments company is maintaining the database of the customers’ office & home phone
numbers. But it is having a problem of wasting space when the customers are using either of them. So which of
the following processes can help the company in eliminating the redundant data?
(a) Analyzing
(b) Normalizing
(c) Structuring
(d) Randomizing
(e) Actualizing.

20.Which of the following backup software packages is produced by ‘Legato’?
(a) OmnibackII
(b) ADSM
(c) Alexandria
(d) Epoch
(e) Networker.


21.In an organization, what is the relationship between DEPARTMENT and EMPLOYEE?
(a) One-to-one relationship
(b) One-to-many relationship
(c) Many-to-many relationship
(d) Many-to-one relationship
(e) Symmetric relationship.

22.Which of the following are the data mining techniques?
I. Association rules.
II. Neural networks.
III. Normalization.
IV. Genetic algorithms.
(a) Both (I) and (II) above
(b) Both (I) and (III) above
(c) (I), (II) and (IV) above
(d) (II), (III) and (IV) above
(e) All (I), (II), (III) and (IV) above.

23.Consider the student table with attributes: sname, snum, totalmarks, semester. Write an SQL statement to display
the name, total marks and student number whose snum is CB14.
(a) select sname, totalmarks, snum from student where snum=‘CB14’;
(b) select sname, totalmarks, snum from student where snum=CB14;
(c) select sname, totalmarks from student where snum=CB14;
(d) select sname, totalmarks from student where snum=‘CB14’;
(e) select sname, totalmarks where snum=‘CB14’;.

24.According to Freud’s theory of psychodynamics, the human brain was described as a
(a) Decision tree
(b) Neural network
(c) Learning
(d) Knowledge
(e) Visualization technique.

25.Which of the following are the stages in Knowledge Discovery Process?
I. Data encapsulation.
II. Data selection.
III. Enrichment.
IV. Reporting.
(a) Both (I) and (II) above
(b) Both (I) and (III) above
(c) (I), (II) and (III) above
(d) (II), (III) and (IV) above
(e) All (I), (II), (III) and (IV) above.

26.In Knowledge Discovery Process, which of the following is a coding operation in which an attribute with
cardinality ‘n’ is replaced by ‘n’ binary attributes?
(a) Flattening
(b) Replication
(c) Redundancy
(d) Falsification
(e) Atomicity.

27.In general, data mining algorithms should not have a complexity higher than
(a) n(log n)
(b) (log n)
(c) (n+1)(log n)
(d) n(log (n+1))
(e) 2n(log n).


28.In data mining, which of the following statements are true about the various types of knowledge?
I. Shallow knowledge is the information that can be analyzed using Online Analytical Processing (OLAP)
tools.
II. Multi-dimensional knowledge is the information that can be easily retrieved from databases using a query
tool such as Structured Query Language (SQL).
III. Hidden knowledge is the information that can be found easily by using pattern recognition or machinelearning
algorithms.
IV. Deep knowledge is the information that is stored in the database but can only be located if we have a clue
that tells us where to look.
(a) Both (I) and (II) above
(b) Both (III) and (IV) above
(c) (I), (II) and (III) above
(d) (II), (III) and (IV) above
(e) All (I), (II), (III) and (IV) above.

29.Which of the following statements is/are true about Genetic algorithms?
I. These algorithms can be viewed as a kind of meta-learning strategy.
II. Any programmer can write the basic structure of this algorithm easily.
III. Solutions found by these algorithms are coded symbolically and it is very hard to read as compared to
neural networks.
(a) Only (I) above
(b) Only (II) above
(c) Both (I) and (II) above
(d) Both (II) and (III) above
(e) All (I), (II) and (III) above.

30.Which of the following statements is/are true about the various forms of neural networks?
I. A perceptron consists of a simple three-layered network with input units called photo-receptors,
intermediate units called associators and output units called responders.
II. A back propagation network not only has input and output nodes, but also a set of intermediate layers with
hidden nodes.
III. A Kohenen self-organizing map is a collection of neurons or units, each of which is connected to a small
number of other units called its neighbors.
(a) Only (I) above
(b) Only (II) above
(c) Both (I) and (II) above
(d) Both (II) and (III) above
(e) All (I), (II) and (III) above.


Rest of the Questions are attached in below file which is free of cost
__________________
Answered By StudyChaCha Member
Reply With Quote Quick reply to this message
Sponsored Links






















  #3  
Old October 30th, 2015, 11:00 AM
Unregistered
Guest
 
Posts: n/a
Default Re: ICFAI MBA Data Warehousing and Data Mining (MB3G1IT) Paper

I need a sample paper of ICFAI MBA Data Warehousing and Data Mining (MB3G1IT) exam. So here can you provide me a sample paper of this exam?
Reply With Quote Quick reply to this message
  #4  
Old October 30th, 2015, 11:03 AM
shabnams
Super Moderator
 
Join Date: Dec 2012
Posts: 19,799
Default Re: ICFAI MBA Data Warehousing and Data Mining (MB3G1IT) Paper

I have a complete sample paper of ICFAI MBA Data Warehousing and Data Mining (MB3G1IT) exam. So here I am providing you as you want.


1. Which of the following data warehouse process managers transform and manages the data?
(a) Query manager
(b) Database manager
(c) Event manager
(d) Warehouse manager
(e) Load manager.

2. Which of the following task is performed by the copy management tool in a load manager?
(a) Job control
(b) Complex checking
(c) Backup and archive
(d) Fast load
(e) Simple transformation.

3. Which of the following statements is/are true about Massively Parallel-Processing (MPP) machine?
I. MPP machines require the use of a distributed lock manager to maintain the integrity of the distributed
resources across the system as a whole.
II. Design and management of a data warehouse on an MPP machine is considerably more difficult than on
Symmetric Multi-Processing (SMP) system.
III. The amount of CPU power that can be made available in an MPP machine is independent.
(a) Only (I) above
(b) Only (II) above
(c) Only (III) above
(d) Both (I) and (II) above
(e) Both (II) and (III) above.

4. It is certain that large data sorts are going to be required within the data warehouse. If we can gauge the size of
the largest transaction that will realistically be run, we can use this to size the temporary requirements. If not, the
best we can do is tie it to the size of a partition. If the number of concurrent queries allowed are 4 and the size of the
partition is 3GB, then we need to set the temporary space (T) to
(a) 7 GB
(b) 12 GB
(c) 24 GB
(d) 27 GB
(e) 36 GB.

5. Which of the following backup software packages is produced by ‘HP’?
(a) OmnibackII
(b) ADSM
(c) Alexandria
(d) Epoch
(e) Networker.

i. e x e
Page 2 of 20
6. Which of the following statements is/are true about various types of partitioning?
I. Vertical partitioning can take two forms: normalization and row splitting.
II. Before using a vertical partitioning we need to be very sure that there will be no requirements to perform
major join operations between the two partitions.
III. Horizontal hardware partitioning technique spreads the processing load by horizontally partitioning the fact
table into small segments and then physically storing each segment in a different node.
(a) Only (I) above
(b) Only (II) above
(c) Only (III) above
(d) Both (I) and (II) above
(e) All (I), (II) and (III) above.

7. Which of the following statements is/are true about the phases in data warehouse delivery process?
I. Technical blueprint phase is the stage where the first production deliverable is produced.
II. Build the vision phase must deliver an overall architecture that satisfies the long-term requirements and a
definition of the components that must be implemented in the short term in order to derive any business
benefit.
III. The purpose of the business case is to identify the projected business benefits that should be derived from
using the data warehouse.
(a) Only (I) above
(b) Only (II) above
(c) Only (III) above
(d) Both (I) and (II) above
(e) Both (II) and (III) above.

8. Which of the following is/are not produced in the technical blueprint stage of data warehouse delivery process?
I. Detailed design of database.
II. Essential components of database design.
III. Server and data mart architecture.
IV. Backup and recovery strategy.
(a) Only (I) above
(b) Only (III) above
(c) Both (I) and (IV) above
(d) (I), (III) and (IV) above
(e) (II), (III) and (IV) above.

9. Which Redundant Array of Inexpensive Disks (RAID) levels use byte wise striping of data with parity disk?
(a) Level 1
(b) Level 2
(c) Level 3
(d) Level 4
(e) Level 5.

10.Which of the following statements is/are false about query management process?
I. Query management process is the system process that manages the queries and speeds them up by directing
queries to the most effective data source.
II. Like other system processes, query management process generally operates during the regular load of
information into the data warehouse.
III. Query management process must ensure that no single query can affect the overall system performance.
(a) Only (I) above
(b) Only (II) above
(c) Only (III) above
(d) Both (I) and (II) above
(e) Both (II) and (III) above.

Page 3 of 20
11.Which of the following tasks is performed by system management tool in a warehouse manager?
(a) Job control
(b) Create indexes
(c) Backup and archive
(d) Generate star schemas
(e) Query profile analysis.

12.Which of the following statements is/are true about fact data and dimension data?
I. Fact data represents a physical transaction that has occurred at a point in time and as such is unlikely to
change on an ongoing basis during the life of the data warehouse.
II. In general, dimension data in a star schema or snowflake schema is designed to minimize the cost of
change and is typically very low volume data (i.e, under 5GB).
III. Fact data will have only one foreign key whereas reference data will have one primary key.
(a) Only (I) above
(b) Only (II) above
(c) Only (III) above
(d) Both (I) and (II) above
(e) Both (II) and (III) above.

13.The reason(s) for partitioning the fact table is/are
I. To increase the performance.
II. To assist the management of the data.
III. To assist backup/recovery.
(a) Only (I) above
(b) Only (II) above
(c) Only (III) above
(d) Both (I) and (II) above
(e) All (I), (II) and (III) above.

14.Which of the following statements is/are true about metadata?
I. As a part of extraction and load process, metadata is used to map data sources to the common view of
information within the data warehouse.
II. As a part of the warehouse management process, metadata is used to direct a query to the most appropriate
data source.
III. As a part of query management process, metadata is used to automate the production of summary tables.
(a) Only (I) above
(b) Only (II) above
(c) Only (III) above
(d) Both (I) and (II) above
(e) Both (II) and (III) above.

15.Fact table identification process requires four steps. Arrange the following four steps in correct sequence.
I. Look for elemental transactions.
II. Check if fact is a dimension.
III. Check if dimension is a fact.
IV. Determine key dimensions.
(a) I-II-III-IV
(b) I-IV-II-III
(c) I-III-II-IV
(d) IV-I-III-II
(e) IV-II-III-I.

Page 4 of 20
16.As with any relational system, foreign keys within a fact table can be structured in two ways: using intelligent
keys and using non-intelligent keys. Which of the following statements are true about intelligent and nonintelligent
keys?
I. In intelligent keys, each key represents the unique identifier for the item in the real world.
II. In non-intelligent keys, each unique key is generated automatically and refers to the unique identifier for
the item in the real world.
III. Usage of intelligent key can be costly and time consuming.
IV. Unless it is certain that identifiers will not change, it is safer to use intelligent keys.
(a) Both (I) and (II) above
(b) Both (II) and (III) above
(c) (I), (II) and (III) above
(d) (II), (III) and (IV) above
(e) All (I), (II), (III) and (IV) above.

17.Which of the following statements is/are false about hardware architectures used in data warehouse solutions?
I. A Symmetric Multi-Processing (SMP) machine is a set of loosely coupled CPUs, each of which has its own
memory and disk.
II. A Massively Parallel-Processing (MPP) machine is a set of tightly coupled CPUs, that share memory and
disk.
III. A Non Uniform Memory Architecture (NUMA) machine is basically a tightly coupled cluster of
Symmetric Multi-Processing (SMP) nodes.
(a) Only (III) above
(b) Both (I) and (II) above
(c) Both (I) and (III) above
(d) Both (II) and (III) above
(e) All (I), (II) and (III) above.

18.Which of the following statements is/are false about Online Analytical Processing (OLAP) tools?
I. OLAP tools do not learn.
II. OLAP tools create new knowledge.
III. OLAP tools are more powerful than data mining.
IV. OLAP tools cannot search for new solutions.
(a) Only (I) above
(b) Only (III) above
(c) Both (I) and (II) above
(d) Both (II) and (III) above
(e) (I), (II) and (IV) above.

19.NetSol, a reputed Garments company is maintaining the database of the customers’ office & home phone
numbers. But it is having a problem of wasting space when the customers are using either of them. So which of
the following processes can help the company in eliminating the redundant data?
(a) Analyzing
(b) Normalizing
(c) Structuring
(d) Randomizing
(e) Actualizing.

20.Which of the following backup software packages is produced by ‘Legato’?
(a) OmnibackII
(b) ADSM
(c) Alexandria
(d) Epoch
(e) Networker.

Page 5 of 20
21.In an organization, what is the relationship between DEPARTMENT and EMPLOYEE?
(a) One-to-one relationship
(b) One-to-many relationship
(c) Many-to-many relationship
(d) Many-to-one relationship
(e) Symmetric relationship.

22.Which of the following are the data mining techniques?
I. Association rules.
II. Neural networks.
III. Normalization.
IV. Genetic algorithms.
(a) Both (I) and (II) above
(b) Both (I) and (III) above
(c) (I), (II) and (IV) above
(d) (II), (III) and (IV) above
(e) All (I), (II), (III) and (IV) above.

23.Consider the student table with attributes: sname, snum, totalmarks, semester. Write an SQL statement to display
the name, total marks and student number whose snum is CB14.
(a) select sname, totalmarks, snum from student where snum=‘CB14’;
(b) select sname, totalmarks, snum from student where snum=CB14;
(c) select sname, totalmarks from student where snum=CB14;
(d) select sname, totalmarks from student where snum=‘CB14’;
(e) select sname, totalmarks where snum=‘CB14’;.

24.According to Freud’s theory of psychodynamics, the human brain was described as a
(a) Decision tree
(b) Neural network
(c) Learning
(d) Knowledge
(e) Visualization technique.

25.Which of the following are the stages in Knowledge Discovery Process?
I. Data encapsulation.
II. Data selection.
III. Enrichment.
IV. Reporting.
(a) Both (I) and (II) above
(b) Both (I) and (III) above
(c) (I), (II) and (III) above
(d) (II), (III) and (IV) above
(e) All (I), (II), (III) and (IV) above.

26.In Knowledge Discovery Process, which of the following is a coding operation in which an attribute with
cardinality ‘n’ is replaced by ‘n’ binary attributes?
(a) Flattening
(b) Replication
(c) Redundancy
(d) Falsification
(e) Atomicity.

27.In general, data mining algorithms should not have a complexity higher than
(a) n(log n)
(b) (log n)
(c) (n+1)(log n)
(d) n(log (n+1))
(e) 2n(log n).

Page 6 of 20
28.In data mining, which of the following statements are true about the various types of knowledge?
I. Shallow knowledge is the information that can be analyzed using Online Analytical Processing (OLAP)
tools.
II. Multi-dimensional knowledge is the information that can be easily retrieved from databases using a query
tool such as Structured Query Language (SQL).
III. Hidden knowledge is the information that can be found easily by using pattern recognition or machinelearning
algorithms.
IV. Deep knowledge is the information that is stored in the database but can only be located if we have a clue
that tells us where to look.
(a) Both (I) and (II) above
(b) Both (III) and (IV) above
(c) (I), (II) and (III) above
(d) (II), (III) and (IV) above
(e) All (I), (II), (III) and (IV) above.

29.Which of the following statements is/are true about Genetic algorithms?
I. These algorithms can be viewed as a kind of meta-learning strategy.
II. Any programmer can write the basic structure of this algorithm easily.
III. Solutions found by these algorithms are coded symbolically and it is very hard to read as compared to
neural networks.
(a) Only (I) above
(b) Only (II) above
(c) Both (I) and (II) above
(d) Both (II) and (III) above
(e) All (I), (II) and (III) above.

30.Which of the following statements is/are true about the various forms of neural networks?
I. A perceptron consists of a simple three-layered network with input units called photo-receptors,
intermediate units called associators and output units called responders.
II. A back propagation network not only has input and output nodes, but also a set of intermediate layers with
hidden nodes.
III. A Kohenen self-organizing map is a collection of neurons or units, each of which is connected to a small
number of other units called its neighbors.
(a) Only (I) above
(b) Only (II) above
(c) Both (I) and (II) above
(d) Both (II) and (III) above
(e) All (I), (II) and (III) above.
__________________
Answered By StudyChaCha Member
Reply With Quote Quick reply to this message
Reply


Reply to this Question / Ask Another Question
Your Username: Click here to log in

Message:
Options

Forum Jump


All times are GMT +6.5. The time now is 04:02 PM.


Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2019, vBulletin Solutions Inc.
Search Engine Optimisation provided by DragonByte SEO v2.0.42 (Pro) - vBulletin Mods & Addons Copyright © 2019 DragonByte Technologies Ltd.