CC7182 - Programming For Data Analytics
CC7182 - Programming For Data Analytics
Coursework Weight: This coursework accounts for 100% of your total module
grades.
Coursework Assignment
Coursework Submission
The course work is due on Week 12 (12th May, 2023). You need to submit the
following items via Google Classroom:
Technical report as a single PDF file
Associated Python files as a ZIP file
Data Description
Filename: student.csv
The data set contains 395 student records. Each record consists of 33 variables,
which includes information about the students.
Variable 33, G3 – final grade (numeric: 0 - 20), is the target variable.
Silveira)
9. Mjob - mother's job ("teacher", "health" care related, civil "services" (e.g.
10. Fjob - father's job ("teacher", "health" care related, civil "services" (e.g.
1. Data Understanding
Understand what your data resources are and what the characteristics
of those resources are. Write down your findings including the
characteristics of the different columns in the dataset.
(4 marks)
2. Data Transformation
For this task, you need to further explore the given dataset on your
own by using different analysis and visualization techniques and then
present the insights that you have gained. Be creative and come up
with interesting insights and draw conclusions from the data.
(20 marks)
Part 2. Analysis of Livestock Data of Nepal
Data Description
The data given here is about livestock raised across Nepal according to different
districts/regions and the commodities produced by them. The overall data is
spread across multiple files. The list of files are as follows:
1. horseasses-population-in-nepal-by-district.csv
2. milk-animals-and-milk-production-in-nepal-by-district.csv
3. net-meat-production-in-nepal-by-district.csv
4. production-of-cotton-in-nepal-by-district.csv
5. production-of-egg-in-nepal-by-district.csv
6. rabbit-population-in-nepal-by-district.csv
7. wool-production-in-nepal-by-district.csv
8. yak-nak-chauri-population-in-nepal-by-district.csv
You are required to study the data and understand its structure and properties.
Then using python you should clean and merge all the data sources and perform
EDA (Exploratory Data Analysis) on it.
1. Data Understanding
Understand what your data resources are and what the characteristics
of those resources are. Write down your findings including the
characteristics of the different columns in the dataset.
(5 marks)
The technical report should have screen shots of the code. The results achieved
and the interpretations of the results should also be included in the technical
report. Python code should include adequate comments as well.
For task 5 in part 1 and task 3 in part 2, 20 marks are allocated according to the
following categories
creativity (5 marks)
quality of analysis performed/interpretations of the results/presentation
of insights (10 marks)
programming style/use of tools (5 marks)
END