Assignment Yassir 1

This document outlines two assignments for an ICT583 Data Science Applications course at Murdoch University. Assignment 1 is a group mid-term assignment involving a literature review on data-driven models for dementia risk analysis and prediction. Assignment 2 is an individual data science project applying the full data science pipeline to a real-world dementia dataset using R, with a report and code submission. The project involves data preprocessing, exploratory analysis, applying and comparing two prediction models, and discussing results.

Uploaded by

Love Dove

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (1 vote)

231 views

Assignment Yassir 1

Uploaded by

Love Dove

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

ICT583 Data Science Applications

Murdoch University
Mid-term assignment & Data science application project

In this unit, you will complete two consecutive assignments that focus on a specific topic in
real-world data science applications. These assignments are designed to help you develop a
good understanding of the latest data-driven modeling techniques used in real-world
applications, and to guide you through the implementation of the entire data science pipeline
on a real dataset using R. By completing these assignments, you will gain hands-on experience
and knowledge that will prepare you for new real-world data science projects.

Topic background: Dementia is a debilitating disease that affects millions of people worldwide.
Early detection and risk prediction are crucial for effective treatment and care. Data-driven
models are increasingly important in the field of dementia research, as they can identify
patterns and relationships in complex datasets that can be used to predict an individual's risk
of developing the disease.

Assignment 1 Mid-term assignment (group assignment)

For this assignment, your group will conduct a concise literature review on the latest data-
driven models for the dementia risk analysis and prediction. The purpose of this review is to
help you gain knowledge and ideas about the most up-to-date data-driven approaches used
for dementia risk analysis and prediction, so that you can develop your own models to analyze
the dementia data provided in Assignment 2.

Group assignment guidelines:

 You will be working on this assignment in a group of 3 to 4 students.
 Please note that you are only allowed to form a group with students who are enrolled in
the same tutorial as yours.
 Each group is required to submit one literature review in a Word document and one
signed group contribution sheet. Only one group member, designated as the liaison
person, should submit the required documents on behalf of the group.
 Collaborate with your group members to complete the assignment and submit it before
the deadline. Make sure to communicate effectively and contribute to the group's work
EQUALLY. A group contribution sheet is required to submit along with this assignment.
Each group member’s individual mark will be given based on the contribution to the group
work.
Literature review guidelines:
 You are required to review at least five computing JOURNAL articles published date after
2020 that focus on dementia risk analysis and prediction modeling using data-driven
models and analytics tools, such as statistical, machine learning and other data mining
techniques.
 Word limit: 1,500 words (can be within a +/- 10% range of this word limit), excluding
references.
 The document should be formatted in Times New Roman 12 font, single line space, with
“Normal” margins selected (from the Word 'Layout' menu, choose 'Normal').
 Your review should be well-structured, clearly written, and appropriately referenced. The
following outline should be followed:
1. Introduction:
The introduction is used to set the context of your review. In this opening paragraph, you
need to:
a. Define the topic of your study and provide any relevant background information that
helps your reader to understand the topic.
b. Explain your reason or perspective for reviewing the literature on this topic.
By doing so, you will give your readers an idea of what to expect in your review and what
and why data-driven models for dementia risk analysis and prediction is significant.
2. Body:
This section begins with an explanation of how you have organized your small-scale
literature review.
Before you begin this section, be sure that you have sorted your reviewed articles into
different themes which can be based on different analyzed data types, data-driven
techniques, or the purposes of data modelling. After you sort your articles, it is important
to give your sorted groups a descriptive name. The names of the sorted articles will
become your headings for each of the paragraphs that you write in the body of your
review.
To write the body of your small-scale literature review, it is important to include the
following:
a. Write an introduction paragraph for the body of your review. This paragraph tells the
reader specific information on how many articles you reviewed and how you sorted the
articles into common themes.
b. This will be separate paragraphs that describe each theme and a summary of each
article including the data resources used, adopted data-driven models, findings,
advantages, and weaknesses, etc. you can also compare, contrast and/or connect the
articles you've selected under each theme.
3. Summary
This is the last paragraph of your small-scale literature review. In this paragraph, it is
important to summarize the main findings and insights from the review. You should also
identify any gaps or limitations in the studies reviewed, as well as any opportunities for
further research and development in this field.
4. References
This is the last page of your review. It serves as a listing of all references that you
mentioned in your paper. Please use IEEE reference style when completing this list. Please
refer to the useful links below.

Useful links:
Where to find literature review
https://scholar.google.com.au/
https://librarysearch.murdoch.edu.au/discovery/search?vid=61MUN_INST:61MU&lang=en
Search for literature Guide
https://libguides.murdoch.edu.au/LitReview/search
IEEE Referencing Guide
https://libguides.murdoch.edu.au/IEEE
https://medium.com/academicianhelp/ieee-referencing-using-microsoft-word-66c855181d64
Assignment 2 Data science application project (individual assignment)

Students will work independently to perform the entire data science pipeline on a given real-
world dementia dataset using R. You will be required to describe the entire project in a
detailed report and submit the code.

The data set used in this study was obtained from a mobile health care service offered in
collaboration with non-governmental organizations that run elderly care centers. This service
was provided to elderly people residing in various districts of Hong Kong for free from 2008 to
2018. The data set consists of 2299 cases, each of which includes eleven variables. These
variables include age, body height, body weight, education level, financial support, geriatric
depression scale score, out-of-pocket financial source (whether they were independent or
dependent on family), marital status, Mini Nutritional Assessment part A score, Mini
Nutritional Assessment part B score. The outcome labels were based on the categories of the
Mini Mental State Exam.

Assignment guidelines:
 Each student is required to submit one project report in a Word document, and R files
which are reproducible to generate all the results in the report.
 R is the only accepted programming language for this assignment. You must use R to
complete all tasks and analyses.

Project report guidelines:

 Do not include any form of code snippets directly into the report. All code should be
included solely in the R files submitted.
 Word limit: 800 words (can be within a +/- 10% range of this word limit), excluding
references, figures, and tables. The report should be formatted in Times New Roman 12
font with normal margins selected (from the Word 'Layout' menu, choose 'Normal').
 Note that 800 words can be a relatively short length for a project report, so it's important
to focus on being clear and concise in your writing, and make the maximum use of well-
designed visualization to help convey information in a more efficient and impactful way.
The following outline should be followed:

Introduction: Introduce the topic of the data science project, including the problem
statement and the goals that the project aims to achieve.

Dataset description: Provide background information on the dataset used in the project,
including its source and any relevant characteristics. Include summary statistics to give
readers an overview of the data.
Data pre-processing: Explain any pre-processing steps that were necessary for the dataset
and justify why they were performed. This section should consider steps such as cleaning,
transforming or encoding the data.

Exploratory data analysis: Perform preliminary investigations on the dataset using

summary statistics and visualizations. This section should provide insights into the dataset
and help identify any potential patterns or trends.

Prediction modelling: Select two prediction models and applied them on the given dataset.
This section should also include some brief information on the selected models, explain
why the chosen models were appropriate for the dataset. Also evaluate the performance
of the two models and compare their results using the appropriate performance metrics.

Results and discussion: Analyze the results and discuss the findings in a clear and engaging
manner. This section should include visualizations and any insights gleaned from the data.

Conclusion: summarize the project to give a concise overview of the project and useful
insights and conclusions.

In addition to the project report, we also require the submission of an R file that includes the
complete code performed from data loading to prediction modeling. The code should be well-
organized, easy to follow, and produce the same outcomes as presented in the project report.

R file guidelines:
 In your submitted code file, include comments to explain the purpose and functionality of
each section of code.
 Organize the code into clear sections, such as data cleaning, exploratory data analysis and
prediction model implementation.
 Use white space and indentation to enhance readability.
 Avoid using overly complicated code, and instead focus on writing clear, concise code.

Bonus task:
Create an R Shiny app that allows users to interact with the data science pipeline you
developed in the project.

Note that
1) This task is a ‘bonus’, which means you will not lose any mark if it is not completed.
However, if you completed, you would earn extra marks (up to extra 15 points on the total
mark of the assignment, with the cap of reaching 100).
2) The bonus task will not be supervised by the teaching staff. Some useful online links are
provided to guide creating the R Shiny app. Therefore, students who are interested need to
rely on their self-learning and exploration to complete the task.

Specification: The R Shiny app should 1) be user-friendly, with clear instructions and intuitive
navigation. 2) Users should be able to upload the dataset, perform exploration data analysis
via generating different visualizations, select prediction models, and view performance
metrics. To develop the app, the student will need to integrate the code used in the previous
tasks into the Shiny framework. Additional features, such as interactive visualizations, can also
be added to enhance the user experience.

Submission for the bonus task requires the Shiny app R scripts and a separate simple user
guide Word document (1-2 pages) that explains the app's functionality and provides
instructions on how to use it. Students can include screenshots and code snippets to showcase
the app's features and functionality.

Useful links for Bonus task – R shiny task

How to Build a Data Analysis App in R Shiny
https://towardsdatascience.com/how-to-build-a-data-analysis-app-in-r-shiny-143bee9338f7
R shiny quick tutorial
https://shiny.rstudio.com/tutorial/written-tutorial/lesson7/

Domestic and Industral Installation-2 PDF
80% (5)
Domestic and Industral Installation-2 PDF
120 pages
Setup and Administration For SAP Cloud ALM
No ratings yet
Setup and Administration For SAP Cloud ALM
122 pages
Manual de Motores MS18 - Poclain
100% (2)
Manual de Motores MS18 - Poclain
10 pages
System Requirment Specifications For Online Revenue Recovery
67% (3)
System Requirment Specifications For Online Revenue Recovery
61 pages
Use Case Narrative Sample PDF
100% (1)
Use Case Narrative Sample PDF
3 pages
Lecture 06
No ratings yet
Lecture 06
31 pages
Assignment - I
No ratings yet
Assignment - I
5 pages
Smarter Work Management System
No ratings yet
Smarter Work Management System
3 pages
Chapter Two: 4.1 The Structured Paradigm Versus The Object-Oriented Paradigm
100% (1)
Chapter Two: 4.1 The Structured Paradigm Versus The Object-Oriented Paradigm
43 pages
Online Bakery Software Requirements Spec
No ratings yet
Online Bakery Software Requirements Spec
29 pages
Hotel Reservation System (Introduction)
No ratings yet
Hotel Reservation System (Introduction)
3 pages
A System To Filter Unwanted Messages From Osn User Walls
0% (1)
A System To Filter Unwanted Messages From Osn User Walls
19 pages
Tourism Srs
100% (1)
Tourism Srs
22 pages
Sample - Software Requirements Specification For Hospital Info Management System
No ratings yet
Sample - Software Requirements Specification For Hospital Info Management System
6 pages
Unit 7 Design-and-Implementation
No ratings yet
Unit 7 Design-and-Implementation
31 pages
Louw, Door Janne, (2006, May 10,2006) - Description With UML Hotel Reservation System. Developed A Hotel Management System That Can Be Used Online
No ratings yet
Louw, Door Janne, (2006, May 10,2006) - Description With UML Hotel Reservation System. Developed A Hotel Management System That Can Be Used Online
12 pages
Food Court Management System
100% (1)
Food Court Management System
6 pages
Software Engineering Fundamentals Tutorial
No ratings yet
Software Engineering Fundamentals Tutorial
10 pages
SRS For Library Management System
No ratings yet
SRS For Library Management System
19 pages
Purbanchal University: BCA274CO User Interface Design
No ratings yet
Purbanchal University: BCA274CO User Interface Design
1 page
MCA Example Case Study
No ratings yet
MCA Example Case Study
37 pages
Library Management System
No ratings yet
Library Management System
9 pages
Chapter Two Literature Review 2.1: (CITATION Lit17 /L 1033)
No ratings yet
Chapter Two Literature Review 2.1: (CITATION Lit17 /L 1033)
12 pages
Pulkit Hospital Management Report
No ratings yet
Pulkit Hospital Management Report
53 pages
ApartmentVisitor Django Report
No ratings yet
ApartmentVisitor Django Report
74 pages
Event Management Report
No ratings yet
Event Management Report
34 pages
Online Library Management System
No ratings yet
Online Library Management System
21 pages
BSC Final Year Project-Teachers CBT Testing Centre Application
100% (1)
BSC Final Year Project-Teachers CBT Testing Centre Application
128 pages
Front-End Developer Assessment
No ratings yet
Front-End Developer Assessment
2 pages
Software Requirements Specification For
100% (1)
Software Requirements Specification For
13 pages
DSD Project Titles
No ratings yet
DSD Project Titles
21 pages
Web Based Career Guidance
100% (2)
Web Based Career Guidance
13 pages
Internal Mark Assessment System: Purpose of The Project
No ratings yet
Internal Mark Assessment System: Purpose of The Project
3 pages
Software Project Management Unit 4
100% (1)
Software Project Management Unit 4
50 pages
Online Library Management System
100% (1)
Online Library Management System
25 pages
BBAI501 Unit 1 and 2 Notes by SP
No ratings yet
BBAI501 Unit 1 and 2 Notes by SP
61 pages
Student Grievance Management System Synopsis
No ratings yet
Student Grievance Management System Synopsis
6 pages
Design and Implementation of A Computerized Hotel Business Billing Systemfcdd9f1d 0f7d 4c2d b4c6 958ddf4a2678
No ratings yet
Design and Implementation of A Computerized Hotel Business Billing Systemfcdd9f1d 0f7d 4c2d b4c6 958ddf4a2678
61 pages
Hoel Mangement System
No ratings yet
Hoel Mangement System
7 pages
SRS Society Management
No ratings yet
SRS Society Management
6 pages
Hospital-Management-System - Silky
No ratings yet
Hospital-Management-System - Silky
14 pages
Project Online Mobile Recharge System
100% (2)
Project Online Mobile Recharge System
10 pages
Synopsis On Website Builder
No ratings yet
Synopsis On Website Builder
7 pages
Petrol Pump Management
0% (1)
Petrol Pump Management
10 pages
Project Oxygen - Seminar Report
50% (2)
Project Oxygen - Seminar Report
36 pages
Hotel Management System
No ratings yet
Hotel Management System
88 pages
Pharmacy Management System Project Report.
50% (2)
Pharmacy Management System Project Report.
26 pages
Newspaper Agency
100% (4)
Newspaper Agency
20 pages
Problem Statement
No ratings yet
Problem Statement
3 pages
Faculty of Graduate Studies and Research Master of Science in Information Technology
No ratings yet
Faculty of Graduate Studies and Research Master of Science in Information Technology
31 pages
Srs Online College Magazine
0% (2)
Srs Online College Magazine
20 pages
Bus Analysis Study
100% (3)
Bus Analysis Study
11 pages
Existing System
0% (1)
Existing System
3 pages
BANK MANAGEMENT SYSTEM-1
No ratings yet
BANK MANAGEMENT SYSTEM-1
22 pages
Synopsis On Hotel Management System Intr
No ratings yet
Synopsis On Hotel Management System Intr
21 pages
Literature Review On Hotel Management System Project
No ratings yet
Literature Review On Hotel Management System Project
8 pages
Atanubairagi: Rajkumardey Chandan Sharma Krishnendubera
100% (1)
Atanubairagi: Rajkumardey Chandan Sharma Krishnendubera
18 pages
SRS & Vision of Hotel Management System
No ratings yet
SRS & Vision of Hotel Management System
13 pages
INSE 6260 Software Quality Assurance: Prof. Rachida Dssouli
No ratings yet
INSE 6260 Software Quality Assurance: Prof. Rachida Dssouli
27 pages
Food Ordering System in C
No ratings yet
Food Ordering System in C
38 pages
Touchpad Plus Ver. 1.1 Class 7
From Everand
Touchpad Plus Ver. 1.1 Class 7
Nisha Batra
No ratings yet
Jump Start Web Performance
From Everand
Jump Start Web Performance
Craig Buckler
No ratings yet
JavaScript Everywhere Second Edition
From Everand
JavaScript Everywhere Second Edition
Gerardus Blokdyk
No ratings yet
005N9751 RevE BWRX 300 General Description
No ratings yet
005N9751 RevE BWRX 300 General Description
88 pages
Grade 6 Sketchup Project Room Design With Labels and Scenes
No ratings yet
Grade 6 Sketchup Project Room Design With Labels and Scenes
1 page
1 Introduction To Biostatistics Last
No ratings yet
1 Introduction To Biostatistics Last
19 pages
AI-BASED DATA CLEANING
No ratings yet
AI-BASED DATA CLEANING
11 pages
Field-Installation-Guide-Cisco-HCI-UCM
No ratings yet
Field-Installation-Guide-Cisco-HCI-UCM
31 pages
Advice 90 (12 2022)
No ratings yet
Advice 90 (12 2022)
1 page
CFX 9850 Gbplus
No ratings yet
CFX 9850 Gbplus
600 pages
Fireclass Prec En13 User - 0
No ratings yet
Fireclass Prec En13 User - 0
13 pages
Civil Service Exam - Additional Questions
No ratings yet
Civil Service Exam - Additional Questions
4 pages
Crutchfield
No ratings yet
Crutchfield
12 pages
Market Survey-Acoustics Materia
50% (2)
Market Survey-Acoustics Materia
23 pages
Single Phase Transformers
No ratings yet
Single Phase Transformers
48 pages
NBR Iec 60439-1-2-3 PDF
90% (20)
NBR Iec 60439-1-2-3 PDF
130 pages
Tuttnauer Manual Tecnico 5596 1R-EP SN18020808
No ratings yet
Tuttnauer Manual Tecnico 5596 1R-EP SN18020808
175 pages
MTCRE Presentation Material-English
No ratings yet
MTCRE Presentation Material-English
157 pages
KODAK PREPS Imposition Software 9.0-V7-20211123 - 205319
No ratings yet
KODAK PREPS Imposition Software 9.0-V7-20211123 - 205319
222 pages
Virginia Science Third Grade Plants 2
No ratings yet
Virginia Science Third Grade Plants 2
2 pages
Application Form 2021: Personal Details
No ratings yet
Application Form 2021: Personal Details
2 pages
SWOT Analysis
100% (1)
SWOT Analysis
6 pages
323-1851-102.7 (6500 R16.9 eMOTR CPS) Issue1
No ratings yet
323-1851-102.7 (6500 R16.9 eMOTR CPS) Issue1
88 pages
Alpha Company
No ratings yet
Alpha Company
20 pages
Comparative Study of Naive Bayes, Gaussian Naive Bayes Classifier and Decision Tree Algorithms For Prediction of Heart Diseases
No ratings yet
Comparative Study of Naive Bayes, Gaussian Naive Bayes Classifier and Decision Tree Algorithms For Prediction of Heart Diseases
14 pages
Geothermal Power Plant
No ratings yet
Geothermal Power Plant
22 pages
Session 03 Gathering The Information and Scanning The Environment
No ratings yet
Session 03 Gathering The Information and Scanning The Environment
31 pages
Autocad Certified User Study Guide: Autodesk
No ratings yet
Autocad Certified User Study Guide: Autodesk
27 pages
v50x (V35ax) en
No ratings yet
v50x (V35ax) en
37 pages
References
No ratings yet
References
15 pages