Module_2_Answers_Corrected
Module_2_Answers_Corrected
1. Define Probability.
Probability is a measure of the likelihood of an event occurring. It is defined as the ratio of the
1. Experiment: Any process that generates well-defined outcomes (e.g., tossing a coin).
2. Event: A specific outcome or a set of outcomes from an experiment (e.g., getting heads).
3. Medical diagnosis.
4. Weather forecasting.
A data collection method where respondents answer a set of pre-designed questions. It is used for
Bayes theorem is used to find the probability of an event given the probability of another related
event.
Formula: P(A|B) = [P(B|A) * P(A)] / P(B)
A measure of variability quantifies the spread or dispersion of a dataset. Examples include Range,
The process of cleaning and transforming raw data into a usable format for analysis.
Data enrichment involves enhancing raw data by adding context or supplementary information from
external sources.
The process of converting data from one format or structure to another to make it more suitable for
It refers to ensuring the accuracy, consistency, and reliability of data by performing checks and
Correlation measures the strength and direction of a relationship between two variables, while
b) 10, 8, 5, 0, 1, 7, 9, 2, 1
18. Compute Mean, Median, and Mode for the Following Data Sets:
a) 45, 55, 60, 60, 63, 63, 63, 63, 65, 65, 70
4. Removing outliers.
20. Crowdsourcing:
a) Define Crowdsourcing: It involves obtaining data, ideas, or services from a large group of people,
typically via the internet.
1. Surveys.
2. Interviews.
3. Observation.
Mean, Median, and Mode are used to describe the central value of a dataset.
1. Removing duplicates.
3. Correcting errors.
4. Standardizing formats.
7. Handling outliers.
8. Finalizing the cleaned dataset.
Crowdsourcing allows researchers to gather vast amounts of labeled data quickly. It is often used in
machine learning projects for tasks like image labeling or sentiment analysis. Challenges include