Data Mining
Data Mining
Ans- b
2.You are given data about seismic activity in Japan, and you want to predict a magnitude of the next
earthquake, this is an example of
(c) Serration
Ans- a
Ans- d
4. ____ is a comparison of the general features of the target class data objects against the general
features of objects from one or multiple contrasting classes
(d)Data selection
Ans- c
5. Bayesian classifiers is
(a) a class of learning algorithm that tries to find an optimum classification of a set of examples using
the probabilistic theory
(b) any mechanism employed by a learning system to constrain the search space of a hypothesis
2
(c) an approach to the design of learning algorithms that is inspired by the fact that when people
encounter new situations, they often explain them by reference to familiar experiences, adapting
the explanations to fit the new situation
Ans- a
(a) Data
(b) Information
(c) Query
Ans-d
7. Cluster is
(a) Group of similar objects that differ significantly from other objects
(b) Operations on a database to transform or simplify data in order to prepare it for a machine
learning algorithm
(c) Symbol of Representation of facts or ideas from which information can potentially be extracted
Ans- a
(a) Additional acquaintance used by a learning algorithm to facilitate the learning process
Ans- a
9. Case-based learning is
(a) A class of learning algorithm that tries to find an optimum classification of a set of examples using
the probabilistic theory
(b) Any mechanism employed by a learning system to constrain the search space of a hypothesis
(c) An approach to the design of learning algorithms that is inspired by the fact that when people
encounter new situations, they often explain them by reference to the familiar experiences,
adapting the explanations to fit the new situation
Ans- c
10. Some telecommunication companies want to segment their customers into distinct groups in
order to send appropriate subscription offers this is an example of
3
(c) Serration
Ans- d
2017
11. An ……………… system is market-oriented and is used for data analysis by knowledge workers, including
managers, executives, and analysts.
(a) OLAP
(b) OLTP
(c) Both of the above
(d) None of the above
Ans- a
Ans- d
(c) Operational
(d) Informational
Ans- c
(a) Metadata
(b) Microdata
(c) Minidata
(d) Multidata
Ans- a
Ans- d
(a) Regression
(b) Logistic
(c) Probability
(d) Neural
Ans- b
(a) One
(b) Two
(c) Three
(d) Five
Ans- Two
(a) Partitioning
(b) Hierarchical
Ans- c
Ans- a
Ans- b
5
2016
21. An operational system is which of the following?
(a) A system that is used to turn the business in real time and is based on historical data
(b) A system that is used to run the business in real time and is based on current data
(c) A system that is used to support decision making and is based on current data
(d) A system that is used to support decision making and is based on historical data
Ans- b
22. A star schema has what type of relationship between a dimension and fact table
Ans- c
Ans- c
Ans- c
25. The ……. operation performs a selection on One dimension of the given cube, resulting in a subcube
(a) pivot
(b) slice
(c) roll-up
Ans- b
26. The process of partitioning the ranges of quantitative attributes into intervals, is called
6
(a) Splitting
(b) grouping
(c) binning
Ans- c
Ans- d
Ans- d
(c) clustering
Ans- b
2015
31. Data mining is also referred to as
Ans- a
(a) table
(b) database
(c) metadata
(d) integration
Ans- c
33. To represent any n-dimension data we need a series of ………. Dimension cubes.
(a) (n-1)
(b) n
(c) n+1
(d) n+2
Ans- a
34. The ……... operation performs a selection on one dimension of the given cube, resulting in a subcube.
(a) pivot
(b) slice
(c) roll-up
(d) drill-down
Ans- b
35. ________ serves support multidimensional views of data through array-based multidimensional storage
engines.
(a) ROLAP
(b) MOLAP
(d) database
Ans- b
36. The ______ software gives the user the opportunity to look at the data from a variety of different
dimensions.
Ans- b
8
37. _____ Techniques can be used to reduce the number of values for a given continuous attribute by
dividing the range of the attribute into two intervals
(a) Discretization
(b) Transformation
(c) Smoothing
(d) Generalization
Ans- a
Ans- a
39. Consider a scenario where a bin contains values 4,8 and 15. if smoothing by bin-means method is
applied to clean the data then each of the original value in the bin will be replaced by
(a) 8
(b) 9
(c) 15
(d) 4
Ans- b
40. ____ is simple text files that are automatically generated every time someone accesses one web site.
Ans- b