0% found this document useful (0 votes)

8 views

Distributions in Data Science

idk

Uploaded by

lavanyaaverma7

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views

Distributions in Data Science

idk

Uploaded by

lavanyaaverma7

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Distributions in Data Science

Please choose the correct option in the questions below:

(1) If a card is chosen from a standard deck of cards, what is the probability of getting a five
or a seven?

(a) 4/52
(b) 1/26
(c) 8/52
(d) 1/169

Ans: (c) 8/52

(2) Which of the following is the condition for Uniform Distributions?

(a) Each value in the set of possible values has the exact same possibility of happening.
(b) Have a constant probability of success
(c) Has only two possible outcomes
(d) Must have at least 3 trials

Ans: (a) Each value in the set of possible values has the exact same possibility of
happening.

(3) The collection of one or more outcomes from an experiment is called

(a) Probability
(b) Distribution
(c) Event
(d) Random Experiment
Ans: (c) Event

(4) Which of the following are types of distributions?

(a) Continuous

(b) Discrete

(c) Both of them

Ans: (c) Both of them

(5) Which of the following is not an example of discrete probability distribution?

(a) The sale or purchase price of a house

(b) The number of bedrooms in a house

(c) The number of bathrooms in a house

(d) Whether or not a home has a swimming pool in it

Ans: (a) The sale or purchase price of a house

(6) A discrete probability distributions may be represented by

(a) A table
(b) A graph
(c) A Mathematical Equation
(d) All of these
Ans: (d) All of these

(7) What is the probability that a ball is drawn at random from a jar?

(a) 0.1

(b) 1

(d) 0

(e) Cannot be determined from given information

Ans: (e) Cannot be determined from given information

(8) Statistical investigative process has which of the following components:

(a) Formulate /Statistical Investigate Questions

(b) Collect/ Consider the Data
(c) Interpret Data

(d) All of the above

Ans: (d) All of the above

Standard Questions

(1) Explain what distributions in data science with the help of two examples is?

Ans: Distribution in data science is a method which shows the probable values for a
variable and how often they occur.

While the concept of probability gives us the mathematical calculations, distributions help
us actually visualize what is happening underneath. For example, consider a coin which
has two sides, head and tail. The probability of getting the head is 0.5. The probability of
getting the tail is 0.5 and so on. You can be sure that you have exhausted all the values
when the sum of probabilities is equal to 1% to 100%. For all other values apart from this,
the probability of occurrence is zero.

Probability Table for Tossing a coin

Every probability distribution is associated with a graph which describes the likelihood of
occurrence of each event. Below graph represents our example. This type of distribution is
called as a Uniform Distribution.

Uniform Distribution Graph for Tossing a coin

However, point to note over here is that distribution in statistics is defined by underlying
probabilities and not by the graph.

Probability Table for Tossing two

Probability Table for Tossing two coin Now, let us extend our problem statement to tossing
two coins. By looking at the graph we can understand that probability of getting a head in
both the coins is 0.25. Similarly, getting a head in one coin and tail in another coin is 0.25.
Probability of getting tail in one coin and head in another coin is 0.25. And probability
of getting a tail in both the coins is 0.25.

Uniform Distribution Graph for Tossing two Coins

(2) Explain what is a Statistical Problem- Solving process.

Ans: Statistical Problem is the solving process or the method of collecting and analysing
the data and to answer the investigative questions which is based on statistics.

This method includes four components which are:

(a) Formulate Statistical Investigative Questions: This method is involves

imagining/predicting the differences before starting with the actual process. Framing of
statistical questions helps us understanding/identifying the differences which leads to
productive investigations. Below are some examples of the statistical questions for
identifying the changes and nourishing the process of data collection and analysing of data
subsequently.

• How fast can my plant grow?

• The plants which gets exposed to sunlight more grows faster?
• Does the sunlight affects the growth of plant? How?

Some questions are been asked for collecting data such as How tall is the plant? Many
other such type of data collection questions can be asked in order to answer the statistical
investigative questions. The plants which gets exposed to sunlight grows faster?

There are some features statistical investigative questions which needs to be understood
before predicting the differences and are much important. The variables of interest much
be transparent, the group or population that the question is focused on must be clear, is
question requiring for the description of data, is the question comparing variables across
two or more groups is the question of looking at association of two variables, the question
should be about the whole group and not and not about an individual, the question should
be answered through data collection with the data in hand, and the question should be
purposeful.

(b) Collect/consider the data: This step is recalled as the acknowledging variability while
designing for differences.

Data collection designs must understand the differences in the data. Statistical Process
Control and random sampling are the two methods which can help in detecting the
changes in the data and reduce them. Designs of Experiments are the method which are
used for testing the induce variabilities.
The data which is collected whether as the first hand (freshly/new data) or the second hand
(collected from other sources) needs interrogation. For ex :- We needs to answer or explain
the certain questions in regards to how the variables are different as per the type, what are
the possible results/outcomes of the variables, and how the data was collected. Such
questions are needed to explain whether the data is answerable to the statistical
investigation questions. The scope of generalizability and the possible limitations in
analysis and interpretations are been affected by data collection designs.

(c) Analyse the data: It can be also called as the step of accounting variability while the
distributions. In the case of data analysing we have to understand its variability. Giving
reasons in regards to the distributions is the key accounting for and describing variability
for all the developing levels. In order to compare, describe, and explore the distributions
variability graphical displays and numerical summaries are used. For ex :- In the box plots
or comparative dot plots are used for showing the batting averages of both the teams i.e is
Indian Cricket Team and Australia Cricket Team for specific year. These graphs helps us in
differentiating batting averages team distributions. By separating the distributions of the
two teams or by describing the overlap we can consider the variability.

(d) Interpret the data: This step is also recalled as the permitting for the variations while
considering the data. You’ll come to know that mostly statistical interpretations are made
in the presence of variabilities and are often taken into considerations. The two sources of
variability such as randomization to treatment group, and variability from individual to
individual are to be remembered when interpreting the results of the randomized
comparative medical experiment. When the results are been declared generally and when
look back towards the moment while collecting and studying the data, we consider such
variability sources.

(3) Explain low distributions are broadly categorized, support your answer with appropriate
example for each category.

Ans:

Types of statistical distributions

Depending on the type of data we use, we have grouped distributions into two categories,
discrete distributions for discrete data (finite outcomes) and continuous distributions for
continuous data (infinite outcomes).

Continuous data
Continuous data is a type of information that can range from one extreme to
another, usually measured on a scale such as temperature or weight. It can also
be presented in the form of a histogram which allows for easier comparison
and understanding between different sets of data. With Continuous Data, you
are able to gain insights into trends and relationships that might not ordinarily
be seen with other types of datasets.

Discrete data

Discrete data has a limited set of values and ranges, such as countable
elements like the student population in a classroom or cars passing through an
intersection. Representing this kind of information with bar graphs allows for
quick understanding at-a-glance!

(4) Explain in detail how do we formulate statistical investigative questions

Ans: This method is involves imagining/predicting the differences before starting with the
actual process. Framing of statistical questions helps us understanding/identifying the
differences which leads to productive investigations. Below are some examples of the
statistical questions for identifying the changes and nourishing the process of data
collection and analysing of data subsequently.

• How fast can my plant grow?

• The plants which gets exposed to sunlight more grows faster?
• Does the sunlight affects the growth of plant? How?

Some questions are How tall is the plant? Where the question is answered with the single
height, therefore such question is not a type of statistical question. Some questions are
been asked for collecting data such as How tall is the plant? Many other such type of data
collection questions can be asked in order to answer the statistical investigative questions.
The plants which gets exposed to sunlight grows faster?

Different heights for different exposures of sunlight are been noticed. Which means the
plants growth due exposure of sunlight may depend upon the measurement of the plants
and may differ. While statistical investigative questions begin worth while studies, the use
of questioning is prominent throughout all four components of the statistical problem-
solving process. Such pattern of questions can be explained detailed with help of examples
at different levels. There are some features statistical investigative questions which needs
to be understood before predicting the differences and are much important. The variables
of interest much be transparent, the group or population that the question is focused on
must be clear, is question requiring for the description of data, is the question comparing
variables across two or more groups is the question of looking at association of two
variables, the question should be about the whole group and not and not about an
individual, the question should be answered through data collection with the data in hand,
and the question should be purposeful.

(5) Name five instances where you have observed a uniform distribution.

Ans:

Real-Life Examples of the Uniform Distribution

• Guessing a Birthday.
• Rolling a Die.
• Raffle Tickets.
• Deck of Cards.
• Spinning a Spinner.

High Order Thinking Skills (HOTS)

(1) Consider that there are 60 students in your class out if which 20 get affected with cold
and flu every semester. Note down five statistical investigative questions for determining a
student’s immunity to a catching cold flu.

(2) Consider you are taking a part in an animal welfare campaign. One of the most recent
concerns raised by people is dogs not being able to tolerate sudden rise in temperature due
to global warming. Note down five statistical investigative questions to understand how
dogs react to changing weather.

D&D 5E All Spells
50% (2)
D&D 5E All Spells
22 pages
Microwave Communication
No ratings yet
Microwave Communication
84 pages
Hydrology II - P.Nyenje
100% (1)
Hydrology II - P.Nyenje
86 pages
Data Science Important Questions
No ratings yet
Data Science Important Questions
4 pages
Statistics
No ratings yet
Statistics
13 pages
Notes - Chapter 2 - IT Skills and Data Analysis I
No ratings yet
Notes - Chapter 2 - IT Skills and Data Analysis I
22 pages
Eco-07 2012 Solution
No ratings yet
Eco-07 2012 Solution
9 pages
Copy of Identifying Patterns
No ratings yet
Copy of Identifying Patterns
6 pages
Statistics For Management: Q.1 A) 'Statistics Is The Backbone of Decision Making'. Comment
No ratings yet
Statistics For Management: Q.1 A) 'Statistics Is The Backbone of Decision Making'. Comment
10 pages
Class Notes(4)
No ratings yet
Class Notes(4)
10 pages
Statistics and Probability - Assignment
No ratings yet
Statistics and Probability - Assignment
31 pages
Statistics: An Overview: Unit 1
No ratings yet
Statistics: An Overview: Unit 1
10 pages
1431364773L01.EE3121.What is Statistics - introduction
No ratings yet
1431364773L01.EE3121.What is Statistics - introduction
6 pages
Data Gathering, Organization, Presentation and Interpretation
No ratings yet
Data Gathering, Organization, Presentation and Interpretation
10 pages
CS001-B03 - Exploratory Data Analysis 20
No ratings yet
CS001-B03 - Exploratory Data Analysis 20
7 pages
Understanding Statistics - KB Edits040413
No ratings yet
Understanding Statistics - KB Edits040413
70 pages
Thinking Statistically
From Everand
Thinking Statistically
Anthony Banfield
5/5 (1)
Statistics For Management
No ratings yet
Statistics For Management
22 pages
Business Statistics I Essentials
From Everand
Business Statistics I Essentials
Louise Clark
5/5 (5)
How Much Data Does Google Handle?
No ratings yet
How Much Data Does Google Handle?
132 pages
CH 1 Notes
No ratings yet
CH 1 Notes
7 pages
Data Analysis & Exploratory Data Analysis (EDA)
No ratings yet
Data Analysis & Exploratory Data Analysis (EDA)
14 pages
Final Paper Guide For PS, Spring : e Source File For This Document Is Not Yet Available at
No ratings yet
Final Paper Guide For PS, Spring : e Source File For This Document Is Not Yet Available at
13 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
4 pages
Q 4 RESEARCH Module 2 3
No ratings yet
Q 4 RESEARCH Module 2 3
27 pages
Data Analysis and Data Collection
No ratings yet
Data Analysis and Data Collection
2 pages
Statistics
No ratings yet
Statistics
16 pages
Dissertation Using Descriptive Statistics
100% (2)
Dissertation Using Descriptive Statistics
4 pages
Chapter 10 - Data Analysis PDF
No ratings yet
Chapter 10 - Data Analysis PDF
5 pages
6 2 Unit 6
No ratings yet
6 2 Unit 6
9 pages
MDM4U
No ratings yet
MDM4U
2 pages
Notes On Data Processing, Analysis, Presentation
No ratings yet
Notes On Data Processing, Analysis, Presentation
63 pages
How To Write Up Descriptive Statistics For Dissertation
100% (1)
How To Write Up Descriptive Statistics For Dissertation
7 pages
Mat105 Study Guide
No ratings yet
Mat105 Study Guide
14 pages
E-Book On Essentials of Business Analytics: Group 7
No ratings yet
E-Book On Essentials of Business Analytics: Group 7
6 pages
COR-STAT1202 Introductory Statistics Seminar 1 Full Version
No ratings yet
COR-STAT1202 Introductory Statistics Seminar 1 Full Version
9 pages
PR2-MODULAR-M
No ratings yet
PR2-MODULAR-M
5 pages
STA 111 NURSING NOTES
No ratings yet
STA 111 NURSING NOTES
36 pages
Lesson 2 - Univariate Statistics and Experimental Design
No ratings yet
Lesson 2 - Univariate Statistics and Experimental Design
34 pages
Makalah Statistik Matematika Bahasa Inggris
No ratings yet
Makalah Statistik Matematika Bahasa Inggris
10 pages
Probability and Statistics Notes
No ratings yet
Probability and Statistics Notes
38 pages
Statistic Lecture2023
No ratings yet
Statistic Lecture2023
99 pages
Learning Activity 1.2 Reading
No ratings yet
Learning Activity 1.2 Reading
13 pages
Modified Ps Final 2023
No ratings yet
Modified Ps Final 2023
124 pages
CL 5 - Statistical Methods Used in Business
No ratings yet
CL 5 - Statistical Methods Used in Business
7 pages
Unit V nOTES
No ratings yet
Unit V nOTES
9 pages
Descriptive and Inferential Statistics
100% (1)
Descriptive and Inferential Statistics
10 pages
Statistics Reading Comprehension 1
100% (1)
Statistics Reading Comprehension 1
2 pages
Gathering and Organizing Data
80% (10)
Gathering and Organizing Data
5 pages
Measurement and The DGP - Tagged
No ratings yet
Measurement and The DGP - Tagged
59 pages
DS Module 2
No ratings yet
DS Module 2
113 pages
Lecture 1 Inferential Statistics
No ratings yet
Lecture 1 Inferential Statistics
32 pages
Business Research CH-6
No ratings yet
Business Research CH-6
28 pages
Basic Statistics Notes 2
No ratings yet
Basic Statistics Notes 2
118 pages
MB0040 Statistics
No ratings yet
MB0040 Statistics
18 pages
The Elements of Data Analytic Style
No ratings yet
The Elements of Data Analytic Style
95 pages
Unit3 Inferentialnew
No ratings yet
Unit3 Inferentialnew
36 pages
Introduction To Inferential Statistics
No ratings yet
Introduction To Inferential Statistics
8 pages
2013 Trends in Global Employee Engagement Report
No ratings yet
2013 Trends in Global Employee Engagement Report
8 pages
Week 7-Data Analysis (1)
No ratings yet
Week 7-Data Analysis (1)
54 pages
Pr. Lesson 13
No ratings yet
Pr. Lesson 13
8 pages
Data analyticsMSE
No ratings yet
Data analyticsMSE
12 pages
Intro To Probability and Statistics
No ratings yet
Intro To Probability and Statistics
147 pages
Zonk 2
No ratings yet
Zonk 2
12 pages
Vernacular Architecture
100% (1)
Vernacular Architecture
30 pages
Copeland Zr90 300
No ratings yet
Copeland Zr90 300
21 pages
Mechanical Services
No ratings yet
Mechanical Services
2 pages
POM Final
No ratings yet
POM Final
42 pages
CFD Modelling of Air Flow Distribution
100% (1)
CFD Modelling of Air Flow Distribution
8 pages
How Climate Change Has An Impact On Wildlife
No ratings yet
How Climate Change Has An Impact On Wildlife
2 pages
Vocabulario
100% (7)
Vocabulario
40 pages
OMC DOL Datasheet
No ratings yet
OMC DOL Datasheet
2 pages
Activity 4 Online Activity - Unit 1
20% (5)
Activity 4 Online Activity - Unit 1
56 pages
"The Notion of Climate Change and Its Impacts": Xandrix Jhon U. Gazzingan Grade 10-Eduardo Quisumbing
No ratings yet
"The Notion of Climate Change and Its Impacts": Xandrix Jhon U. Gazzingan Grade 10-Eduardo Quisumbing
1 page
s41893-023-01170-0
No ratings yet
s41893-023-01170-0
3 pages
Catlike Catalogue 2015
No ratings yet
Catlike Catalogue 2015
92 pages
Experimental Comparative Analysis of Clay Pot Refrigeration Using Two Different Designs of Pots
No ratings yet
Experimental Comparative Analysis of Clay Pot Refrigeration Using Two Different Designs of Pots
6 pages
Axo/Axu/Mod 15: Technical Selection
0% (1)
Axo/Axu/Mod 15: Technical Selection
3 pages
Thunderbolt Manual
No ratings yet
Thunderbolt Manual
14 pages
Passive Techniques
No ratings yet
Passive Techniques
22 pages
Team Name: Problem Statement
No ratings yet
Team Name: Problem Statement
3 pages
UCSP Reviewer
No ratings yet
UCSP Reviewer
2 pages
TD 39
No ratings yet
TD 39
74 pages
DR Seifu Bekele ACSEVSeminar2013 Presentation
No ratings yet
DR Seifu Bekele ACSEVSeminar2013 Presentation
44 pages
Pantheons - The Pagan Journey
No ratings yet
Pantheons - The Pagan Journey
4 pages
Master Ing
No ratings yet
Master Ing
192 pages
De Mantra
No ratings yet
De Mantra
4 pages
Kedarkantha Trek PDF
No ratings yet
Kedarkantha Trek PDF
11 pages
El Salvador
No ratings yet
El Salvador
27 pages
Mixed Poems Using Senses
No ratings yet
Mixed Poems Using Senses
71 pages