Attachment 1
Attachment 1
Attachment 1
To build a solid model, you need to go through the following steps and finalize your models in the end.
Note: You need to install (add-ins) Analysis ToolPak to be able to do this project. It will appear as Data
Analysis in Data navigation bar. Also, add proper titles for each worksheet.
Project Tasks
Task 1. Prepare the dataset (10 points)
Firstly, you need to prepare the data for building the models. In classic data modeling tasks, you only use
a portion of the data to train your model – this portion of the data is called the training set; the rest of
the data is used to evaluate the performances of your models – this is called the test set.
Before modeling the data, you need to have a better understanding of the relationship among the
variables. The research team have specified a set of numerical variables that they care the most about.
These numerical variables are listed in the table below. In particular, the scientists are mostly interested
in the rings of the abalone since it tells the age of the abalones.
Hint: Use Descriptive Statistics in Data Analysis and check Labels in First Row to describe/analyze the
characteristics of each data. Remember to select data title in Input Range.
1|Page
Lengt Diamete Heigh Whole_weigh Shucked_weigh Viscera_weigh Shell_weigh Ring
h r t t t t t s
2|Page
Task 4. Create unstacked data (15 points)
It is very important to look at the 3 different genders separately and see if the relationships are different
for the 3 different genders.
3|Page
Next, you need to build regression models on the unstacked data and compare them with the models
with the stacked data.
The rings on the abalone indicate it’s age. The most interesting problem that the research team found is
the prediction of the abalone’s age using the other measurement variables in the data. You believe you
can build good regression models to do the prediction.
You need to find the best predictor for abalone’s age. This is a trail and error process.
Hints: Use the variable Rings as an instance and pair it with the rest of variables.
4|Page