A H192009 Pages: 3: Answer All Questions, Each Carries 4 Marks

Uploaded by

This document is a 3 page exam for a Data Mining and Warehousing course. It contains questions that assess understanding of key concepts in data warehousing, data preprocessing, decision trees, association rule mining, clustering, and neural networks. Students are asked to define and differentiate terms, explain algorithms and procedures, draw schemas, and show calculations involving techniques like normalization, smoothing, attribute selection, pruning, and backpropagation.

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

A H192009 Pages: 3: Answer All Questions, Each Carries 4 Marks

Uploaded by

Srinivas R Pai

0% found this document useful (0 votes)

40 views3 pages

Original Title

04. PYQP - CS402-QP - OCT19

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

0% found this document useful (0 votes)

40 views3 pages

A H192009 Pages: 3: Answer All Questions, Each Carries 4 Marks

Uploaded by

Srinivas R Pai

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

Jump to Page

You are on page 1of 3

Search inside document

A H192009 Pages: 3

Reg No.:_ Name:____________

APJ ABDUL KALAM TECHNOLOGICAL UNIVERSITY
EIGHTH SEMESTER B.TECH DEGREE EXAMINATION(S), OCTOBER 2019
Course Code: CS402
Course Name: DATA MINING AND WAREHOUSING
Max. Marks: 100 Duration: 3 Hours

PART A
Answer all questions, each carries 4 marks. Marks

1 How is data warehouse different from a database? How are they similar? (4)
2 Compare star and snowflake schema dimension table. (4)
3 Use the two methods below to normalize the following group of data: (4)
100,200,300,500,900
i) min-max normalization by setting min=0 and max=1
ii) z-score normalization
4 Explain the attribute selection method in decision trees . (4)
5 Distinguish between hold out method and cross validation method. (4)
6 Explain prepruning and postpruning approaches in decision tree algorithm. (4)
7 Differentiate between support and confidence. (4)
8 How to compute the dissimilarity between objects described by binary variables? (4)
9 Differentiate between Agglomerative and Divisive hierarchical clustering (4)
method.
10 Explain web content mining? (4)
PART B
Answer any two full questions, each carries 9 marks.
11 The following data is given in increasing order for the attribute age:
13,15,16,16,19,20,20,21,22,22,25,25,25,25,30,33,33,35,35,35,36,40,45,46,52,70.
a) Use smoothing by bin boundaries to smooth these data, using bin depth of 3. (3)
b) How might you determine outliers in the data? (3)
c) What other methods are there for data smoothing? (3)
12) Explain the following procedures for attribute subset selection
a) Stepwise forward selection (3)
b) Stepwise backward elimination (3)
c) A combination of forward selection and backward elimination (3)

Page 1of 3
A H192009 Pages: 3

13 a) Suppose a datawarehouse consists of three measures customer, account and (4)

branch and two measures count (number of customers in the branch) and
balance. Draw the schema diagram using snowflake schema.
b) Real-world data tend to be incomplete, noisy, and inconsistent. What are the (5)
various approaches adopted to clean the data?
PART C
Answer any two full questions, each carries 9 marks.
14 Given the following data on a certain set of patients seen by a doctor, can the (9)
doctor conclude that a person having chills, fever, mild headache and without
running nose has the flu?(Use Naive Bayes algorithm for prediction)

15 The following figure shows a multilayer feed-forward neural network. Let the (9)
learning rate be 0.9. The initial weight and bias values of the network is given in
the table below.The activation function used is the sigmoid function.

x1 x2 x3 w14 w15 w24 w25 w34 w35 w46 w56 θ4 θ5 θ6

1 0 1 0.2 -0.3 0.4 0.1 -0.5 0.2 -0.3 -0.2 -0.4 0.2 0.1

Page 2of 3
A H192009 Pages: 3

Show weight and bias updation with the first training sample (1,0,1) with class
label 1, using backpropagation algorithm

16 a) Explain classification by C4.5 algorithm. (6)

b) What is meant by Maximum Marginal Hyperplane (MMH)? (3)
PART D
Answer any two full questions, each carries 12 marks.
17 Consider the transaction database given below. Set minimum support count as 2
and minimum confidence threshold as 70%
Transaction ID List of Item_Ids
T100 I1,I2,I5
T200 I2,I4
T300 I2,I3
T400 I1,I2,I4
T500 I1,I3
T600 I2,I3
T700 I1,I3
T800 I1,I2,I3,I5
T900 I1,I2,I3
a) Find the frequent itemset using Apriori Algorithm. (8)
b) Generate strong association rules . (4)
18 a) Explain DBSCAN algorithm . (8)
b) State the pros and cons of DBSCAN method. (4)
19 a) Explain clustering by k-medoid algorithm. (6)
b) Explain Apriori based frequent subgraph mining. (6)
****

Page 3of 3

CS8391-Data Structures-Anna University Question Papers
Document8 pages
CS8391-Data Structures-Anna University Question Papers
bhuvangates
75% (4)
School of Graduate Studies KNUST
Document32 pages
School of Graduate Studies KNUST
Kwajo Asante
100% (5)
B.Tech Degree S8 (S, FE) / S6 (PT) (S, FE) Examination June 2023 (2015 Scheme)
Document4 pages
B.Tech Degree S8 (S, FE) / S6 (PT) (S, FE) Examination June 2023 (2015 Scheme)
Venkitaraj K P
No ratings yet
2020-09-22SupplementaryCS467CS467-E - Ktu Qbank
Document3 pages
2020-09-22SupplementaryCS467CS467-E - Ktu Qbank
sabitha s
No ratings yet
Machine Learning PYQ 2023
Document8 pages
Machine Learning PYQ 2023
nitob90303
No ratings yet
CS467 A
Document3 pages
CS467 A
E3 Tech
No ratings yet
DWM (W2022)
Document2 pages
DWM (W2022)
Samay Patel
No ratings yet
Answer All Questions, Each Carries 4 Marks
Document3 pages
Answer All Questions, Each Carries 4 Marks
Karthika
No ratings yet
101905CS502H - Neural Networks and Deep Learning - Model Question Paper
Document4 pages
101905CS502H - Neural Networks and Deep Learning - Model Question Paper
R Kumar
No ratings yet
Be Summer 2022
Document2 pages
Be Summer 2022
Rahul Meghani
No ratings yet
Gujarat Technological University
Document2 pages
Gujarat Technological University
Breeje Anadkat
No ratings yet
883 Question Paper
Document2 pages
883 Question Paper
Saurabh Bodke
No ratings yet
CS467 Machine Learning, January 2023
Document3 pages
CS467 Machine Learning, January 2023
ഓൺലൈൻ ആങ്ങള
No ratings yet
Cst201 Data Structures, December 2021
Document2 pages
Cst201 Data Structures, December 2021
SHAHEEM TK
No ratings yet
Cst201 Data Structures, December 2021
Document2 pages
Cst201 Data Structures, December 2021
Arathy
No ratings yet
CST201 Data Structures, December 2021
Document2 pages
CST201 Data Structures, December 2021
Anas Ansar
No ratings yet
AMT305 INTRODUCTION TO MACHINE LEARNING, Pyq2
Document3 pages
AMT305 INTRODUCTION TO MACHINE LEARNING, Pyq2
romepop923
No ratings yet
CST466 DATA MINING, OCTOBER 2023.pdf - Crdownload
Document3 pages
CST466 DATA MINING, OCTOBER 2023.pdf - Crdownload
20b739
No ratings yet
Page 1 of 2
Document4 pages
Page 1 of 2
Vicky Ratnakar
No ratings yet
Gujarat Technological University
Document2 pages
Gujarat Technological University
feyayel990
No ratings yet
Thapar University, Patiala
Document2 pages
Thapar University, Patiala
Mridul Mahindra
No ratings yet
University QP Nov-Dec 2021
Document3 pages
University QP Nov-Dec 2021
27 Jagadesh.K
No ratings yet
2022 Dec. ITT401-A
Document2 pages
2022 Dec. ITT401-A
gracemann365
No ratings yet
Data Mining Merged
Document10 pages
Data Mining Merged
Rishi Bathija
No ratings yet
Dcs 7302
Document17 pages
Dcs 7302
K. Malathi Staff,COMPUTER SCIENCE AND ENGINEERING
No ratings yet
Dmbi
Document3 pages
Dmbi
Tarishi Talwaria
No ratings yet
CS402 Data Mining and Warehousing Question Bank
Document6 pages
CS402 Data Mining and Warehousing Question Bank
Junaid M Faisal
No ratings yet
Data Structures rcs305 2020
Document2 pages
Data Structures rcs305 2020
Shivanshu Kumar Upadhyay
No ratings yet
DWDM 19
Document2 pages
DWDM 19
cdukgjchd
No ratings yet
Gujarat Technological University
Document1 page
Gujarat Technological University
Vaishnavi Pansaniya
No ratings yet
Jntuworld: R07 Set No. 2
Document7 pages
Jntuworld: R07 Set No. 2
Bhargavramudu Jajjara
No ratings yet
MC5032 - DMDW
Document3 pages
MC5032 - DMDW
Msec Mca
No ratings yet
BCN1043 Computer Arc & Org S1 0119
Document6 pages
BCN1043 Computer Arc & Org S1 0119
m-868020
No ratings yet
Data Warehousing and Data Mining
Document4 pages
Data Warehousing and Data Mining
Ramesh Yadav
No ratings yet
Gujarat Technological University
Document3 pages
Gujarat Technological University
cokoka3983
No ratings yet
Model Cs 8 PDF
Document17 pages
Model Cs 8 PDF
Poomani Punitha
No ratings yet
BVM IP 2324 3papers
Document20 pages
BVM IP 2324 3papers
kousalya.kumar
No ratings yet
Bcacac 385
Document6 pages
Bcacac 385
blackhatgamingyt133
No ratings yet
2013 Main
Document1 page
2013 Main
Anjali Reddy
No ratings yet
Gujarat Technological University
Document2 pages
Gujarat Technological University
Shiv Patel
No ratings yet
ML Question
Document2 pages
ML Question
Dr. Jayanthi V.S.
No ratings yet
Ce 317
Document5 pages
Ce 317
all work
No ratings yet
Part-A: (Answer Any Two Questions)
Document10 pages
Part-A: (Answer Any Two Questions)
Sabit Islam Bhuiya
No ratings yet
Adobe Scan 30-May-2023
Document7 pages
Adobe Scan 30-May-2023
Manav Verma
No ratings yet
r05321204 Data Warehousing and Data Mining
Document5 pages
r05321204 Data Warehousing and Data Mining
SRINIVASA RAO GANTA
No ratings yet
Gujarat Technological University
Document2 pages
Gujarat Technological University
nikita gohel
No ratings yet
SCT 3160619 Nov-2021
Document2 pages
SCT 3160619 Nov-2021
Harsh Darji
No ratings yet
Design and Analysis of Alogarithm - Regular CW
Document4 pages
Design and Analysis of Alogarithm - Regular CW
owenfraser256
No ratings yet
CST201 DATA STRUCTURES, December 2020
Document2 pages
CST201 DATA STRUCTURES, December 2020
Anas Ansar
No ratings yet
III Yr B.Tech. - Computer Science & Engineering/Information Technology Data Mining
Document2 pages
III Yr B.Tech. - Computer Science & Engineering/Information Technology Data Mining
gamerzworld6200
No ratings yet
2022 Dsa
Document4 pages
2022 Dsa
Siddhi Pandya
No ratings yet
EC206 CO Modelqn2 Ktustudents - in
Document3 pages
EC206 CO Modelqn2 Ktustudents - in
gpuonline
No ratings yet
Gujarat Technological University
Document5 pages
Gujarat Technological University
patel
No ratings yet
Ce 317
Document4 pages
Ce 317
all work
No ratings yet
126VW122019
Document2 pages
126VW122019
Abhishek yadav
No ratings yet
Nepal College of Information Technology Assessment
Document1 page
Nepal College of Information Technology Assessment
Siddhant Pakhrin
No ratings yet
CNS 2101 - Data Structures and Algorithms - July 2023
Document4 pages
CNS 2101 - Data Structures and Algorithms - July 2023
lisa.sayi
No ratings yet
Machine Learning (Csen 3233)
Document4 pages
Machine Learning (Csen 3233)
cmmaity2017
No ratings yet
CS204 - Operating Systems (S) Dec 2019 - Ktu Qbank
Document3 pages
CS204 - Operating Systems (S) Dec 2019 - Ktu Qbank
Jessel Cherian
No ratings yet
Introduction to Modeling Cognitive Processes
From Everand
Introduction to Modeling Cognitive Processes
Tom Verguts
No ratings yet
Machine Learning in the AWS Cloud: Add Intelligence to Applications with Amazon SageMaker and Amazon Rekognition
From Everand
Machine Learning in the AWS Cloud: Add Intelligence to Applications with Amazon SageMaker and Amazon Rekognition
Abhishek Mishra
No ratings yet
Datasheet MRS Series
Document6 pages
Datasheet MRS Series
Santiago Ospina
No ratings yet
DELL SERVIÇOS - Prodeploy Enterprise Suite Customer
Document37 pages
DELL SERVIÇOS - Prodeploy Enterprise Suite Customer
Paulo Victor Silva
No ratings yet
Wordassociate2019 Studentstudyguide PDF
Document47 pages
Wordassociate2019 Studentstudyguide PDF
rihamo kawaii
No ratings yet
PID Control With Fuzzy Compensation For Hydroelectric Generating Unit - For Thanh
Document5 pages
PID Control With Fuzzy Compensation For Hydroelectric Generating Unit - For Thanh
Lê Trung Dũng
No ratings yet
How To Ram Clear Equinox DFDC V3
Document11 pages
How To Ram Clear Equinox DFDC V3
ray.renales
No ratings yet
National Annex To Eurocode
Document20 pages
National Annex To Eurocode
Yasela
No ratings yet
ADS Tutorial PDF
Document246 pages
ADS Tutorial PDF
Pavan T
100% (1)
DX Diag
Document39 pages
DX Diag
Julio Melgaço
No ratings yet
Component Based Technology Unit 1
Document13 pages
Component Based Technology Unit 1
Krish Nan
No ratings yet
OpenText Directory Services 16.4.2 Release Notes
Document34 pages
OpenText Directory Services 16.4.2 Release Notes
vijayks
No ratings yet
IT207 Network Essentials Project Frame Relay
Document50 pages
IT207 Network Essentials Project Frame Relay
Muhayar
No ratings yet
3 Essential Excel Skills For The Data Analyst - YouTube
Document4 pages
3 Essential Excel Skills For The Data Analyst - YouTube
Slavica Zivkovic
No ratings yet
Type DHM9B (Digital) Load Cell: Short Description
Document2 pages
Type DHM9B (Digital) Load Cell: Short Description
Pravin Nirukhe
No ratings yet
HP DeskJet F2280
Document225 pages
HP DeskJet F2280
polovne
No ratings yet
Study, Cura Procedure and 3 D Printing Excercise
Document3 pages
Study, Cura Procedure and 3 D Printing Excercise
kingsurya6091022
No ratings yet
SM14 Det
Document12 pages
SM14 Det
Sanjay Raj
No ratings yet
UAT Turn Over Memo - Infocast
Document2 pages
UAT Turn Over Memo - Infocast
Glutton Arch
No ratings yet
Enterprise System by SZ Khan Maneri
Document13 pages
Enterprise System by SZ Khan Maneri
SarzaminKhan
No ratings yet
Receiver: Service Manual
Document49 pages
Receiver: Service Manual
heladiomontesdeoca
No ratings yet
Humareader HS: Microtiter Plate Reader
Document2 pages
Humareader HS: Microtiter Plate Reader
walter Neves
No ratings yet
Ketan Autoclave Catalog
Document12 pages
Ketan Autoclave Catalog
naina ka madhav
No ratings yet
AMD Socket A VIA KT600 + VT8237 ATX Motherboard: Declaration of Conformity
Document14 pages
AMD Socket A VIA KT600 + VT8237 ATX Motherboard: Declaration of Conformity
Maja Čović
No ratings yet
dm00629855 Getting Started With Projects Based On Dualcore stm32h7 Microcontrollers in Stm32cubeide Stmicroelectronics
Document28 pages
dm00629855 Getting Started With Projects Based On Dualcore stm32h7 Microcontrollers in Stm32cubeide Stmicroelectronics
Juan C Gamboa
No ratings yet
NREB Registered EIA Consultant - 31oct2021
Document11 pages
NREB Registered EIA Consultant - 31oct2021
carol
No ratings yet
Checking The IHO S-52 Presentation Library Edition Number in The ECDIS
Document4 pages
Checking The IHO S-52 Presentation Library Edition Number in The ECDIS
gongax
No ratings yet
Waterproof - Getting Started
Document11 pages
Waterproof - Getting Started
Antonio Aviles
No ratings yet
Recycling Passport - Dismantling Instructions: Babylog8000
Document10 pages
Recycling Passport - Dismantling Instructions: Babylog8000
Jesus Duno
No ratings yet
Performance Based Rubrics
Document4 pages
Performance Based Rubrics
Juanito Bonito
100% (1)
Mansi Resume
Document1 page
Mansi Resume
Abhinav Srivastava
No ratings yet