Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
0% found this document useful (0 votes)
10 views

Data Analytics - Unit-1,2,3, & 4 questions - Assignment

The document outlines a comprehensive curriculum on data analytics, machine learning, data mining, and social media analytics, featuring long answer questions that cover definitions, frameworks, algorithms, and evaluation metrics. It addresses various concepts such as data science, machine learning types, data mining techniques, and text analytics, along with their advantages, disadvantages, and applications. Each unit includes detailed inquiries into specific topics, encouraging a deep understanding of the field.

Uploaded by

prasad_mah
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
10 views

Data Analytics - Unit-1,2,3, & 4 questions - Assignment

The document outlines a comprehensive curriculum on data analytics, machine learning, data mining, and social media analytics, featuring long answer questions that cover definitions, frameworks, algorithms, and evaluation metrics. It addresses various concepts such as data science, machine learning types, data mining techniques, and text analytics, along with their advantages, disadvantages, and applications. Each unit includes detailed inquiries into specific topics, encouraging a deep understanding of the field.

Uploaded by

prasad_mah
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 6

Unit 1 Introduction to Data Analytics

Long Answer Questions

1. Define data science. What is its purpose? Explain in detail.

2. What is data analytics? Enlist its different roles. Also state its advantages and

disadvantages.

3. With the help of a diagram describing the lifecycle of data analytics.

4. Explain four layers in the data analytics framework diagrammatically.

5. Differentiate between data analysis and data analytics.

6. What are the types of data analytics? Describe two of them in detail.

7. What is prescriptive analytics? Explain in detail.

8. What exploratory analytics? What is its purpose? Explain with example.

9. Write a short note on: Mechanistic analytics.

10. What is a mathematical model? List its types. Explain two of them in detail.

11. What is a linear and nonlinear model? Compare them.

12. What is the baseline model? Enlist two of them in detail.

13. How to evaluate a model? Describe in detail.

14. Write a short note on: Metrics for evaluating classifiers.

15. What is a confusion matrix? How to use it in data analytics? Explain

diagrammatically.

16. Define accuracy, precision, recall and f-score.

17. What is the ROC curve? How to implement it? Explain with example.

18. What is class imbalance? Describe in detail.

19. Write a short note on: Evaluating value prediction models.

Unit 2 Machine Learning Overview

Long Answer Questions

1. What is machine learning? States its advantages and disadvantages. Also

list its various applications.


2. What is deep learning? How does it work? Explain diagrammatically.

3. What is AI? What is its purpose? State its advantages and disadvantages.

4. With the help of a diagram describing the relationship between AI, ML and

DL.

5. List application of machine learning in data science.

6. With the help of a suitable diagram describe machine learning model.

7. How to train and validate a model? Describe in detail.

8. What are the types of machine learning? Compare them.

9. What is supervised learning? How does it works? State its advantages and

disadvantages.

10.What is k-NN? How does it works? Explain diagrammatically. Also state its

advantages and disadvantages.

11.What is a decision tree? How does it works? State its advantages and

disadvantages.

12.Explain Support Vector Machine (SVM) with the help of diagram.

13.Write a short note on: Naïve Bayes.

14. Describe unsupervised learning with diagram and advantages and

disadvantages, With the help of an example explain k-means clustering

algorithm.

15.What is association rule mining? Describe with an example.

16. Explain polynomial regression diagrammatically.

17.What is semi-supervised machine learning? With the help of diagram

explain its basic idea. Also state its advantages and disadvantages.

18. What is a regression model? Explain linear regression with a diagram.

19.Define logistic regression with assumptions.

20.Write a short note on: Ensemble techniques.

21.With the help of example explain concept of classification. Also list various

classification techniques.
22.What is a random forest? Describe diagrammatically.

23.What is clustering? How does it works? Explain with example.

24.Describe various clustering techniques. Describe two of them in short.

25.What is reinforcement learning? Explain diagrammatically. Also state its

advantages and disadvantages.

26.Differentiate between supervised, unsupervised, semi-supervised and

reinforcement machine learning

Unit 3 Mining Frequent Patterns, Associations and Correlations

Long Answer Questions:

1. What is data mining? Explain with a diagram? Also state its advantages anddisadvantages.
2. Explain usage of Market Basket Analysis with example?
3. Explain Appropriate algorithm in detail.
4. What are frequent item sets, closed item sets, and association rules? Describe in detail.
5. What is outlier analysis? Describe in detail.
6. What kind of patterns can be mined? Explain IN detail.
7. How to mine following:
(i) Frequent Patterns.
(ii) Associations.
(iii) Correlations.
8. What are different types of data? Explain in detail with appropriate examples.
9. What are different sources of data in data science? Describe in detail.
10. Explain different data formats in brief.
11. What is meant by followings and Explain in reference of mining frequent patterns:
(i) Predictive analysis.
(ii) Cluster analysis.
(iii) Outlier analysis.
12. What is structured and unstructured data? Distinguish between them.
13. How to generate association rules from frequent item sets? Explain in detail.
14.With the help of an example describe FP growth algorithm. Also state its advantages and
disadvantages.
15. Which techniques are used for improving efficiency of Appropriate algorithm? Describe
two of them in detail.
16. Write a short note on: Class/concept description
17. A database has five transactions. Let min sup =60%and min conf=80%.

TID Items Bought


T100 {M,O,N,K,E,Y}
T200 {D,O,N,K,,E,Y}
T300 {C,A,K,E}
T400 {D,U,C,K,Y}
T500 {C,O,O,K,I,E}

Find all frequent item sets using Appropriate and FP-growth, respectively. Compare the

efficiency of the two mining processes.

18. A database has six transactions. Let min-sup = 50% and min-conf = 75%. Find

all frequent item sets using Appropriate algorithm. List all the strong association rules.

TID List of Items


001 Pencil, Sharpener, Eraser, Chart papers,
Sketch pen
002 Chart papers, Charts, Glue sticks
003 Pencil, Glue stick, Eraser, Pen, Sketch pen
004 Oil pastels, Poster colors, Correction tape
005 Whitener, Pen, Pencil, Charts, Glue stick
006 Color pencils, Crayons, Eraser, Pen

19. Consider the following transaction database:

transaction1 = (product1, product2, product7)

transaction2 = (product4, product5, product7)

transaction3 = (product6, product7, product8, product9)

transaction4 = (product1, product3, product4, product6, product7

Find the frequent product set using FP Algorithm with minimum support 50%.

Unit 4 Social Media and Text Analytics

Long Answer Questions.

1. What is social media? State its advantages and disadvantages.

2. What is text analytics? What is its purpose? Also states its tasks.
3. What is social media analytics? What is its purpose? List its benefits.

4. Explain process of social media analytics diagrammatically.

5. Describe layers of social media analytics with the help of diagram.

6. What is a social network? List any four examples of it. Explain two of them in

short.

7. What is social media data? List its types. Also state how to accessing social

media

data in detail.

8. What is social network analysis? Define it? Describe in detail.

9. With the help of suitable diagram describe life cycle of social media analytics.

10. What is link prediction? Explain with example.

11. What is community detection? What are its different methods? Explain four of

them in short.

12. What is influence maximization? Explain its framework diagrammatically.

13. What is expert finding? How to find an expert? Describe with example.

14. Write a short note on: Prediction of trust.

15. Explain the term distrust among individuals in detail.

16. What is NLP? What is its purpose? Describe its phases with the help of

diagram.

17. What is text analytics? Explain in detail.

18. What is tokenization: How is it used in text analytics?

19. What is a bag of words? How to use NLP? Explain in detail.

20. What is Word Weighting (TF-IDF)? Describe in detail.

21. Explain n-gram with example.

22. What is stemming and lemmatization? How do they differ from each other?

23. Describe the term synonyms with respect to NLP.

24. Write a short note on: Parts of speech tagging.

25. What is sentiment analysis? Explain its classification?


26. What is text analytics? Explain its steps diagrammatically. Also states its

advantages, disadvantages and applications.

27. What is text categorization? Describe diagrammatically. Also list approaches.

28. What is text summarization? Explain its two types in detail.

29. What is trend analytics? Describe its methods in detail.

30. Write a short note on: Challenges to social media analytics.

You might also like