ML Theory Questions
ML Theory Questions
ML Theory Questions
Unit 2
1. Define Data Mining. Explain the Knowledge Discovery Process with a neat diagram.
2. Describe the major tasks in data preprocessing.
3. Explain different data normalization techniques.
4. Explain different data transformation strategies.
5. How is Principal component analysis carried out to reduce the data?
Unit 3
1. What is an association rule? What metrics are used to measure the strength of an
association rule?
2. Define two-step process of Association rule mining.
3. Explain the process of generating the Association rules from the frequent item sets
with an example.
4. Explain Hash-based technique to improve efficiency of Apriori Algorithm.
5. Explain mining frequent item-sets using Vertical Data Format
6. Explain Correlation analysis using lift with an example.
7. Explain the following pattern evaluation measures - all confidence, max confidence,
Kulczynski(Kulc) , and cosine measures.
Unit 4
1. What is the general approach to classification?
2. What are the three popular attribute selection measures?
3. Explain K-nearest Neighbour classification.
4. Explain Bagging approach to improve classification accuracy for class-imbalanced
data
5. Explain different classifier evaluation measures with examples
6. Explain cross validation and bootstrap accuracy estimates
7. Explain the following ensemble methods:
i. Bagging ii. Random forests
Unit 5
1. What are the desired features of Cluster Analysis?
2. What is a cluster? Explain different clustering methods
3. Explain k-means method of partitioning with pseudo code.
4. Explain different types of outliers with examples
5. Explain agglomerative and divisive hierarchical clustering approaches using an
example
6. Explain Density based clustering with an example
7. Illustrate DBSCAN approach to clustering