Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

IS328 Tutorial Session 9-Project Proposal

Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 3

IS328 Data Mining

Semester 2, 2020

Week 10: Project Proposal


Tutorial/Lab Session – 9

This tutorial covers project proposal which needs to be completed in order to continue to
work on the Assignment 2- Project report.

Overview and Assessment Guidelines:


The goals of this tutorial/Assignment 2 are
 To apply the concepts learnt in the chosen scenario using appropriate data mining
tools.
 As a team of 2 members, propose a project and implement it as part of Assignment 2
 For project ideas and data sources, you may visit
www.kdnuggets,com
www.kaggle.com
 Make a proposal in the given format and submit it in the respective drop box.
 This tutorial must be completed by the same team of two members worked on
Assignment 1.
 This tutorial will be assessed out of 10 marks and carries over all weightage of 10%
grade towards all the graded tutorials. (Tutorial 5 and Tutorial 9)
 Please note that instead of Week 12 tutorial , Week 10 – Tutorial 9 will be assessed

Project Proposal (10 Marks)


Format: Please follow the below format to draft and submit your project proposal. Also,
content given within < > is only the guideline and it must be replaced appropriately
including the <>.

<Title of the Project Chosen>


<Member Names and IDs >
<The Title and Member information must be specified on a separate page>

Abstract

< The abstract conveys the most important messages regarding your project, such as: what
you set out to do? How did you do it? What results were obtained? Where it can be applied?
However, for your project proposal, just specify “What you set out to do?” in one paragraph
and the remaining part you can complete in the project report.>

Introduction
Course Coordinator: Dr. Vani Vasudevan 1
<Provide a brief overview of data mining. Describe what your proposal is about and the
organization of the rest of the proposal. Include whether you will be performing data mining
tasks, implementing a new algorithm in R or Weka or combination of both, or modifying
some other system to incorporate data mining features, etc. Basically, provide the nature of
your project. This section should be a page or less in length.>
Data Mining Task
<Provide the specific tasks you will perform on the data set. Include specific questions you
will investigate, and the goals for the tasks. This should be independent of the specific
techniques you will use to achieve your goals. This section should be a page or less>.
Data Set
<Describe the data set(s) you will be using in your project. Include the origin of the data set,
an overview of the data set organization, attributes of the data, and challenges of the data set
you've selected. Include any information you have about missing values in the data set. This
should be about one pages in length.>
Methods and Models
<Describe in detail the data mining methods and models you plan to employ to achieve the
goals you set in the Data Mining Task section of your document. Include some mention of
necessary data transformation. If you're implementing a technique, you should have some
idea of how it will be implemented and incorporated into Weka (or some other data mining
tool). If you are combining techniques, explain how you intend to use the output of one
technique as input into another technique. This section should be up to 5 pages in length.
Remember, be detailed, include how you will select the best model from the model space,
etc.>
Assessment
<Discuss the assessment methodology you will use to validate that you have found
meaningful patterns. Will you use n-fold cross-validation, confidence intervals for
accuracy, etc. How will you create your training and test sets? What baseline models will
you use? This section should be about a page or two in length.>
Presentation and Visualization
<Describe how your results will be presented and visualized in such a way to show
meaningful patterns in the data. >
Roles
<In this section, discuss the roles that each group member will have in the project. One
paragraph per group member is sufficient.>
Schedule
<The schedule is a table of dates and tasks that you plan to complete by those dates. Tasks to
be done by the progress report must be listed, as well as any other dates you want to set for
yourselves. Additional deadlines are highly recommended. Be sure to include when you will
have data transformation, modeling, assessment, visualization, etc. completed.
Date   Tasks to be Completed
??/??/2020   Tasks completed by chosen date
??/??/2020   Tasks to be completed by the final report date
??/??/2020   Tasks completed by the class presentation>
Bibliography
<This is where you list bibliographic information for any references you made throughout
the proposal. You should have 5 – 10 references.>

Course Coordinator: Dr. Vani Vasudevan 2


Submission Instructions:
1. This tutorial must be submitted in groups of 2 members. Assign a group leader and
submit the assignment through the group leader’s moodle account.
2. Project Proposal: You have to submit 2 files (1. Project proposal, 2.DataSets: original
as well as refined with data preprocessing tasks in separate spreadsheet) of your
project. The submission filenames should read A2_Proposal_Sxxx_Syyy.docx and
A2_DataSet_Sxxx_Syyy.xxxx where Sxxx, Syyy are student ids of the group
members. For example, A2_Proposal_S11003232_S01004488.docx.
3. Incorrect submission will result in high penalty.

Marking Rubrics for Project Proposal (10 Marks)

Marks
Unsatisfactory Satisfactory Good
CBOK Allocat
(0%-49%) (50% - 75%) (76% - 100%)
ed
Data and I.Do not identify I. Identified I. Identified
Information accurately any of the accurately some of accurately most of
Management data quality problems the data quality the data quality
problems problems
II. Do not perform all
required tasks correctly II. Performed most of II. Performed all the
and consistently the required tasks required tasks
6
correctly and correctly and
III. Provided inaccurate consistently consistently
and/or incomplete reports
III. Provided relatively III. Provided
accurate and complete accurate and
reports complete
reports
Teamwork I. Inappropriate task I. Appropriate I. Appropriate
concepts & distribution and/or task task
issues failure in distribution & distribution &
completion of tasks completion on completion on
in a given time time
4
timeframe. II. Submission of II. Submission of
II. Delay in assignment on assignment on
submission of time time
assignment

Sub Total &


comments

Course Coordinator: Dr. Vani Vasudevan 3

You might also like