Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

ML Theory Questions

Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 2

Unit 1

1. Explain the steps involved in developing a machine learning application.


2. Explain different data smoothing techniques.
3. What is an Attribute? Explain different types of attributes with examples.
4. In real-world data, tuples with missing values for some attributes are a common
occurrence. Describe various methods for handling this problem.
5. Explain pixel-oriented visualization technique with an example.
6. Explain with an example, Geometric Projection Visualization Techniques.
7. Briefly outline how to compute the dissimilarity between objects described by the
following: (a) Nominal attributes (b) Asymmetric binary attributes (c) Numeric
attributes
8. Differentiate between Data Matrix and Dissimilarity Matrix.

Unit 2
1. Define Data Mining. Explain the Knowledge Discovery Process with a neat diagram.
2. Describe the major tasks in data preprocessing.
3. Explain different data normalization techniques.
4. Explain different data transformation strategies.
5. How is Principal component analysis carried out to reduce the data?

Unit 3
1. What is an association rule? What metrics are used to measure the strength of an
association rule?
2. Define two-step process of Association rule mining.
3. Explain the process of generating the Association rules from the frequent item sets
with an example.
4. Explain Hash-based technique to improve efficiency of Apriori Algorithm.
5. Explain mining frequent item-sets using Vertical Data Format
6. Explain Correlation analysis using lift with an example.
7. Explain the following pattern evaluation measures - all confidence, max confidence,
Kulczynski(Kulc) , and cosine measures.
Unit 4
1. What is the general approach to classification?
2. What are the three popular attribute selection measures?
3. Explain K-nearest Neighbour classification.
4. Explain Bagging approach to improve classification accuracy for class-imbalanced
data
5. Explain different classifier evaluation measures with examples
6. Explain cross validation and bootstrap accuracy estimates
7. Explain the following ensemble methods:
i. Bagging ii. Random forests
Unit 5
1. What are the desired features of Cluster Analysis?
2. What is a cluster? Explain different clustering methods
3. Explain k-means method of partitioning with pseudo code.
4. Explain different types of outliers with examples
5. Explain agglomerative and divisive hierarchical clustering approaches using an
example
6. Explain Density based clustering with an example
7. Illustrate DBSCAN approach to clustering

You might also like