Basic Statistics Assignment 1
Basic Statistics Assignment 1
Basic Statistics
Assignment-1
1) A marketing research team is conducting a survey to gather data from customers about their
preferences and experiences with a new product. The collected data falls into various categories.
Identify the type of data for each of the following scenarios: nominal, ordinal, interval, or ratio.
a) Customers are asked to rate their satisfaction with the product on a scale of 1 to 5, with
1 being "Very Dissatisfied" and 5 being "Very Satisfied."
b) The survey collects data on the number of products purchased by each customer in the
last month.
c) Customers are asked to indicate their age groups: 18-24, 25-34, 35-44, 45-54, 55+.
d) Customers are asked to rank three product features in order of importance: price,
durability, and aesthetics.
e) The survey asks customers to choose their preferred payment method: credit card, cash,
online payment, or mobile payment.
For each scenario, explain your reasoning behind categorizing the data as nominal, ordinal, interval,
or ratio.
2) The given list provides the count of rooms in 50 houses located in Goa:
2643344754
5375544562
6344586553
3375445416
5448623364
a) Create a frequency table and a bar graph to visually represent this data.
b) If a new real estate developer plans to construct an apartment complex with units
having an equal number of rooms, which count of rooms should they choose
according to this data? Provide an explanation.
3) The following four data sets give the daytime temperature in Mumbai, Bangalore, Hyderabad,
and Chennai for all the 28 days in February 2023.
Mumbai
31 30 30 30 30 29 31
30 31 29 29 30 31 29
29 30 28 29 29 29 29
28 29 27 29 28 29 29
Bangalore
20 29 22 25 20 19 28
24 22 23 25 26 21 22
20 16 23 26 22 18 21
20 19 24 24 22 18 20
Hyderabad
26 35 23 27 26 25 33
26 25 29 30 28 24 28
23 21 24 32 27 23 24
24 23 31 32 35 23 23
Chennai
28 28 29 29 30 27 30
25 24 25 24 29 26 28
29 31 23 26 29 31 26
27 26 27 25 29 37 25
Draw a box-plot for each data set and comment on the differences in the shape, spread, and
location of these box-plots.
4) Ten patients at a doctor’s surgery wait for varying lengths of time to see their doctor. The
waiting times, in minutes, are as follows:
5 mins, 17 mins, 8 mins, 2 mins, 55 mins, 9 mins, 22 mins, 11 mins, 16 mins, 5 mins.
a) Calculate the mean, median, and mode for the given waiting times. For each
calculation, show your steps clearly.
b) Considering the nature of the data and its distribution, discuss which measure of central
tendency (mean, median, or mode) would be most appropriate to represent the typical
waiting time in this scenario. Explain your reasoning.
5) For each dataset, calculate the first quartile (Q1), median (Q2), third quartile (Q3), and the
interquartile range (IQR). Provide clear steps for your calculations and show your final answers.
a) The data represents the time in minutes that twelve employees took to commute to
work on a particular day:
18, 34, 68, 22, 10, 92, 46, 52, 38, 29, 45, 37, 10, 50, 30, 70, 90.
b) The data provides the number of people killed in road traffic accidents in Delhi from
2013 to 2021:
1820, 1671, 1622, 1591, 1584, 1690, 1463, 1196.
c) The following dataset presents the final marks of 40 students for the Basic Statistics
course:
61, 77, 51, 85, 55, 77, 70, 56, 41, 61, 28, 87, 23, 22, 86, 63, 99, 94, 38, 25,
90, 59, 87, 53, 29, 86, 33, 87, 75, 50, 59, 77, 77, 71, 99, 78, 70, 93, 78, 93.
6) The management team of a manufacturing company is examining the production output of two
assembly lines, A and B, over the past 7 days. The number of units produced per day is recorded
for each assembly line. The team aims to understand the average production and variability to
make informed decisions.
Assembly Line A:
Assembly Line B:
a) Calculate the mean production and the standard deviation for both Assembly Line A and
Assembly Line B over the 7-day period.