Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
0% found this document useful (0 votes)
4 views

Assignment For Computer Scie Year II Students

This document is an assignment for Year II Computer Science students at Hawassa University, focusing on descriptive statistics and data interpretation. Students are required to collect a dataset, perform various statistical calculations, create visualizations, and reflect on the implications of their findings. The assignment includes specific questions related to data analysis, interpretation, and real-world applications of statistics, with a due date of December 17, 2024.

Uploaded by

Mintesnot Yigezu
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
4 views

Assignment For Computer Scie Year II Students

This document is an assignment for Year II Computer Science students at Hawassa University, focusing on descriptive statistics and data interpretation. Students are required to collect a dataset, perform various statistical calculations, create visualizations, and reflect on the implications of their findings. The assignment includes specific questions related to data analysis, interpretation, and real-world applications of statistics, with a due date of December 17, 2024.

Uploaded by

Mintesnot Yigezu
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

HAWASSA UNIVERSITY

DEPARTMENT OF STATISTICS
Name: _________________________________________ID:_______________ Date: Dec 2024

INTRODUCTION TO STATISTICS
For Computer Science Year II Students
December 2024

Assignment: Descriptive Statistics and Data Interpretation


Due Date: 17/12/2024
Total Marks: 20

Questions:
1. Data Collection:
Collect a dataset of 30 observations on a variable of your choice (e.g., age, height, weight,
hours of study) in your class. List the raw data clearly.

A) Organize the Data (5 marks):


a) Create a frequency table for your dataset.
b) Group the data into intervals (if applicable).
B) Calculate the following for your dataset:
a) Mean
b) Median
c) Mode
C) Compute the following:
a) Range
b) Variance
c) Standard Deviation
D) Data Visualization (5 marks):
a) Construct a bar chart or histogram for the data.
b) Ensure proper labeling of axes and intervals.
E) Interpretation of Results (5 marks): Based on your calculations, answer the following:
a) What does the mean represent in this context?
b) How do the range, Variance and standard deviation help in understanding the spread
of data?
F) Describe the distribution shape based on the graph. Is it symmetric, skewed, or
uniform? Explain your reasoning.
2. Reflect on how descriptive statistics can help in real-life decision-making. Provide
one example related to your field of study or interest.
HAWASSA UNIVERSITY
DEPARTMENT OF STATISTICS
Name: _________________________________________ID:_______________ Date: Dec 2024
3. Explain the significance of a small versus a large standard deviation in a dataset. Provide
an example for each case.
4. Choose a real-world example (e.g., income, exam scores, or sales data):
Describe how each measure of central tendency could be used.
Highlight potential drawbacks for each measure in your chosen context.
5. The following data shows the salaries (in $1,000) of employees in two departments:
Department A: 40, 45, 50, 55, 60, 40, 45, 50, 55, 60, 40, 45, 50, 55, 60
Department B: 30, 35, 40, 45, 150, 30, 35, 40, 45, 150, 30, 35, 40, 45, 150
a) Compute the mean, median, and mode for both departments.
b) Discuss which measure gives a better understanding of the salary distribution
in each department.
6. Explain why the mode might be less useful for continuous data. Provide an example of a
dataset where the mode is not meaningful.
7. Explain how the mean is affected by outliers. Give an example of a situation where using
the mean leads to misleading conclusions.
8. Why is the coefficient of variation (CV) a useful tool for comparing variability across
datasets with different units? What are its limitations?
9. List two drawbacks of using the range as a measure of variation.
10. A study measures the monthly income of two groups of workers. Group A's income data
is tightly clustered, while Group B's data has significant variability. Suggest the most
appropriate measure of variation to describe each group and justify your answer.
11. A researcher analyzes two datasets and finds that one has a larger standard deviation but a
smaller IQR. What does this suggest about the distribution of data in the two datasets?
Can a dataset have a small variance but a large range? Justify your answer with an
example.

You might also like