Concise Biostatistics Manual
Concise Biostatistics Manual
Concise Biostatistics Manual
1
Concise Biostatistics Manual Prashant Rao, Sarika Rao
Concise
Biostatistics
Manual
By
Dr Prashant R Rao
MBBS, MS, DNB, MNAMS, FMAS, FIAGES
DNB (Surgical Gastroenterology)
Assistant Professor in Surgical Gastroenterology
LTMMC & LTMGH, Sion, Mumbai
&
Dr Sarika P Rao
MBBS, MS, MCh, DNB (Plastic & Reconstructive Surgery)
Fellowship in Microvascular & Aesthetic Surgery
Assistant Professor in Plastic & Reconstructive Surgery
LTMMC & LTMGH, Sion, Mumbai
2
Concise Biostatistics Manual Prashant Rao, Sarika Rao
Dedicated to
Our Parents
3
Concise Biostatistics Manual Prashant Rao, Sarika Rao
© Leelavathi Publications
First Edition: 2019
All rights reserved.
The authors have taken special care to ensure that the information provided in
the text are correct to the best of their abilities. However, mistakes are inevitable.
Hence the readers are requested to check and confirm the information provided
in the book in case of any doubt. The authors are not liable to anyone for any loss
or damage caused by the errors.
Cost: free. I just hope this book reaches to whoever needs it and it helps you in
any way possible to pass your theory exams.
Help us make this book better by providing your valuable feedback, positive
criticisms and suggestions to us, via email on concisecancermanual@gmail.com
4
Concise Biostatistics Manual Prashant Rao, Sarika Rao
This book has been prepared by compiling and editing “DATA” from various study
materials and notes provided to us by our seniors and colleague friends, along
with some very good articles and books.
Multiple topics in the subject of Biostatistics which are frequently asked in the
medical examinations have been covered in this book.
Further reading
If one has the time and patience please try and go through the following:
5
Concise Biostatistics Manual Prashant Rao, Sarika Rao
6
Concise Biostatistics Manual Prashant Rao, Sarika Rao
7
Concise Biostatistics Manual Prashant Rao, Sarika Rao
8
Concise Biostatistics Manual Prashant Rao, Sarika Rao
• Descriptive studies
• Analytical studies
o Ecological studies
o Cohort studies
• Experimental studies
o Uncontrolled trial
• Integrative studies
o Systematic review
o Meta-analysis
9
Concise Biostatistics Manual Prashant Rao, Sarika Rao
• It is also known as: backward looking study, effect to cause study, disease
to risk factor study, outcome to exposure study
• Distinguishing features:
o Both exposure and disease have occurred before the start of study
10
Concise Biostatistics Manual Prashant Rao, Sarika Rao
• Basic steps:
o Matching
o Assessment of exposure
• Study design:
Yes a b
No c d
11
Concise Biostatistics Manual Prashant Rao, Sarika Rao
o Short duration
o No follow up required
o No ethical problems
o No Hawthrone effects
o No risk to subjects
• Disadvantages
12
Concise Biostatistics Manual Prashant Rao, Sarika Rao
Odds ratio
• It is a measure of the strength of association between the risk factor and
outcome
Case Control
Exposed a b
Non exposed c d
• Interpretation:
13
Concise Biostatistics Manual Prashant Rao, Sarika Rao
Cohort study
• Distinguishing features:
14
Concise Biostatistics Manual Prashant Rao, Sarika Rao
• Indication:
o Follow up
15
Concise Biostatistics Manual Prashant Rao, Sarika Rao
• Analysis
16
Concise Biostatistics Manual Prashant Rao, Sarika Rao
o Relative risk:
o Attributable risk
▪ Expressed as percentage
17
Concise Biostatistics Manual Prashant Rao, Sarika Rao
18
Concise Biostatistics Manual Prashant Rao, Sarika Rao
• Disadvantages:
o Attrition is a problem
o It is expensive
19
Concise Biostatistics Manual Prashant Rao, Sarika Rao
20
Concise Biostatistics Manual Prashant Rao, Sarika Rao
• It’s a type of experimental study design which aims to reduce bias when
testing a new treatment/ intervention
• Basic steps:
o Drawing up a protocol
21
Concise Biostatistics Manual Prashant Rao, Sarika Rao
o Randomization
22
Concise Biostatistics Manual Prashant Rao, Sarika Rao
Protocol
Randomization
Assessment
23
Concise Biostatistics Manual Prashant Rao, Sarika Rao
• Classification of RCTs:
o On basis of hypothesis:
▪ Superiority trial
▪ Non inferiority trial
▪ Equivalence trial
o On basis of outcome of interest:
▪ Explanatory
▪ Pragmatic
o Study designs in randomized control trial:
▪ Concurrent parallel study design
▪ Crossover type study design
o Types of RCT:
▪ Animal experiment
▪ Human clinical trial
▪ Preventive trial
▪ Risk factor trial
▪ Cessation experiment
24
Concise Biostatistics Manual Prashant Rao, Sarika Rao
• Advantages of RCT
o Considered most reliable form of scientific evidence (level 1
evidence)
o Used to find cause and effect relationship
o No selection bias
o Multiple outcome variables can be measured in a single study
• Disadvantages of RCT
o Expensive
o Longer study duration
o Ethical restrictions and administrative issues
o Participant and observer bias
o Noncompliance of controls threatens the validity of study
25
Concise Biostatistics Manual Prashant Rao, Sarika Rao
• It ensures that the investigator has no control over the allocation process and
this helps eliminate selection bias
• It means that every individual study subject has an equal chance of being
allocated to either study or control group
• Methods:
o Simple randomization (easiest method)
o Systematic randomization
o Block randomization
o Stratified randomization
• Advantages:
o Eliminates bias, especially selection bias and confounding bias
o Allows for comparability between study and control group
o Facilitates the concept of ‘blinding’
o It permits the use of probability theory to express the likelihood that any
difference in outcome between treatment groups merely indicates
chance
26
Concise Biostatistics Manual Prashant Rao, Sarika Rao
27
Concise Biostatistics Manual Prashant Rao, Sarika Rao
28
Concise Biostatistics Manual Prashant Rao, Sarika Rao
29
Concise Biostatistics Manual Prashant Rao, Sarika Rao
Meta-analysis
• first done by Karl Pearson in 1904, term coined by Gene V Glass in 1940
• Steps:
o Literature search
o Report results
30
Concise Biostatistics Manual Prashant Rao, Sarika Rao
• Meta-analysis flowchart:
Select studies
Extract data
Analyze data
Statistical analysis
Report results
31
Concise Biostatistics Manual Prashant Rao, Sarika Rao
• Types of bias
o Selection bias:
32
Concise Biostatistics Manual Prashant Rao, Sarika Rao
o Confounding bias:
▪ exposure outcome
confounding variable
▪ Types:
o Randomization
o Blinding
o Allocation concealment
33
Concise Biostatistics Manual Prashant Rao, Sarika Rao
Types of data
o Ordinal data
o Nominal data
o Interval data
34
Concise Biostatistics Manual Prashant Rao, Sarika Rao
o Ratio
35
Concise Biostatistics Manual Prashant Rao, Sarika Rao
✓ Mean
✓ Median
✓ Mode
• Mean
o Example:
mean = a + b + c + d/ 4
o Advantages
▪ Easy to calculate
▪ Easy to understand
o Disadvantage
• Median
o Advantage
▪ It is easy to calculate
▪ Easy to understand
o Disadvantage
o Example
37
Concise Biostatistics Manual Prashant Rao, Sarika Rao
• Mode
o Types
o Advantage
▪ Easy to calculate
▪ Easy-to-understand
o Disadvantage
38
Concise Biostatistics Manual Prashant Rao, Sarika Rao
39
Concise Biostatistics Manual Prashant Rao, Sarika Rao
40
Concise Biostatistics Manual Prashant Rao, Sarika Rao
P value
• Definition:
• For example, if you Toss a Coin the probability of getting head or tail is 50%
so the P value is 0.5
• Lesser the p-value, lesser is the probability of the event occurring by chance
• Interpretation of P value
41
Concise Biostatistics Manual Prashant Rao, Sarika Rao
• P value for null hypothesis is usually kept at less than 0.05, it means that
null hypothesis is true in less than 5% cases. So, if P < 0.05, the null
hypothesis is rejected, that is alternative hypothesis is accepted and the
difference is statistically significant
• Significance:
42
Concise Biostatistics Manual Prashant Rao, Sarika Rao
o True positive
o True negative
o False positive
o False negative
• Error in research results when false positive or false negative outcomes are
accepted
o Alpha error.
o Example: type I error would mean that the effects of two drugs
studied were found to be different by statistical analysis, when in fact
there was no difference between them.
43
Concise Biostatistics Manual Prashant Rao, Sarika Rao
o Beta error.
o Example: type II error would mean that the effects of two drugs
studied were not found different by statistical analysis, when in fact
there was difference
44
Concise Biostatistics Manual Prashant Rao, Sarika Rao
45
Concise Biostatistics Manual Prashant Rao, Sarika Rao
Sample size
o Type of study?
46
Concise Biostatistics Manual Prashant Rao, Sarika Rao
47
Concise Biostatistics Manual Prashant Rao, Sarika Rao
o Larger sample size can minimise the sampling error. That is, larger
samples tend to be associated with smaller margin of error.
However, there is a point at which increasing sample size no longer
impacts the sampling error: this is known as law of diminishing
returns
▪ Population size
▪ Confidence limit
• Significance:
Statistical Tests
▪ Wilcoxon test
• Parametric tests:
• Paired T test
• Unpaired T test
• One-way ANOVA
• Two-way ANOVA
49
Concise Biostatistics Manual Prashant Rao, Sarika Rao
50
Concise Biostatistics Manual Prashant Rao, Sarika Rao
51
Concise Biostatistics Manual Prashant Rao, Sarika Rao
Source:
52
Concise Biostatistics Manual Prashant Rao, Sarika Rao
Source: https://cyfar.org/types-statistical-tests
53
Concise Biostatistics Manual Prashant Rao, Sarika Rao
54
Concise Biostatistics Manual Prashant Rao, Sarika Rao
Correlation
• Correlation is relationship between the two sets of continuous data
o Example: relationship between height and body weight; relationship
between fasting blood sugar and body weight
• Correlation statistics is used to determine the extent to which two
independent variables are related and yields a number called coefficient of
correlation.
• Correlation coefficient may be positive or negative and may vary from -1 to
+1
• Positive correlation means that values of two different variables increase
and decrease together (direct relationship).
o For example, speed of running and pulse rate correlates positively.
• Negative correlation means that if value of one variable decreases then
value of the other variable increases (inverse relationship).
o For example, age and number of scalp hair may correlates negatively.
• The strength of a correlation is determined by absolute value of correlation
coefficient
• Closer is the value to 1, stronger is the correlation.
• Correlation between two variables is shown by scatter plot
• P value in a correlation statistics indicates whether the correlation (or no
correlation) observed is real or by chance.
• Correlation analysis is important because it can be used to predict values of
one variable on the basis of value of other variable.
55
Concise Biostatistics Manual Prashant Rao, Sarika Rao
• A correlation does not mean causation but it also does not mean absence
of causation, that is, if two variables exhibit strong correlation then one of
the variables may cause the other.
• Correlation data is therefore not sufficient evidence for causation.
• Pearson correlation is applied for parametric data while Spearman
correlation is applied for nonparametric data.
• Combined effect of a group of variable upon a variable not included in the
group is called as multiple correlation.
56
Concise Biostatistics Manual Prashant Rao, Sarika Rao
Regression
57
Concise Biostatistics Manual Prashant Rao, Sarika Rao
Incidence
IR = number of new cases of specific disease diagnosed during a given time period x 1000
• Uses
58
Concise Biostatistics Manual Prashant Rao, Sarika Rao
o Attack rate
• Attack rate:
59
Concise Biostatistics Manual Prashant Rao, Sarika Rao
o Uses
o Limitation
60
Concise Biostatistics Manual Prashant Rao, Sarika Rao
Prevalence
• Definition: it is the total number of all individuals who have the disease at
particular time period divided by the population at risk of having the
disease in this time period
• Prevalence is a ratio
• The term refers to all current cases (new and old existing cases) at a given
time period in a given population
o Point prevalence
o Period prevalence
• Point prevalence
o It is defined as the total number of all current cases (old and new) of
a disease at one point of time in a defined population
o It can be made specific for age, sex and other relevant factors
61
Concise Biostatistics Manual Prashant Rao, Sarika Rao
• Period prevalence:
o It measures the frequency of all current cases (old and new) existing
during a defined period of time in a defined population
o It includes cases arising before but extending into the defined period
as well as those arising during the defined time period
• Uses
62
Concise Biostatistics Manual Prashant Rao, Sarika Rao
63
Concise Biostatistics Manual Prashant Rao, Sarika Rao
Screening Test
64
Concise Biostatistics Manual Prashant Rao, Sarika Rao
o Educational purposes
• Types of screening
65
Concise Biostatistics Manual Prashant Rao, Sarika Rao
o Test should be able to detect the disease prior to the onset of sign
and symptoms
o Repeatability
▪ The test must give consistent results when repeated more than
once on the same individual under the same conditions
• Biological variation
67
Concise Biostatistics Manual Prashant Rao, Sarika Rao
• Sensitivity
• Specificity
• Sensitivity
o Defined as the ability of a test to correctly identify all those who have
the disease that is true positive
o Sensitivity = [a / a + c] x 100
• Specificity
o Specificity = [d / b + d] x 100
68
Concise Biostatistics Manual Prashant Rao, Sarika Rao
• In general
69
Concise Biostatistics Manual Prashant Rao, Sarika Rao
• Basic concepts
70
Concise Biostatistics Manual Prashant Rao, Sarika Rao
• Example:
• Advantage
71
Concise Biostatistics Manual Prashant Rao, Sarika Rao
Forest plots
72
Concise Biostatistics Manual Prashant Rao, Sarika Rao
o The diamond in the plot represents the overall measure of effect of the
meta-analysis
73
Concise Biostatistics Manual Prashant Rao, Sarika Rao
• Advantage
o Easy to understand
74
Concise Biostatistics Manual Prashant Rao, Sarika Rao
75
Concise Biostatistics Manual Prashant Rao, Sarika Rao
• I: intervention
• C: comparison group
• O: outcome
76
Concise Biostatistics Manual Prashant Rao, Sarika Rao
o Self evaluation
• Advantages
77
Concise Biostatistics Manual Prashant Rao, Sarika Rao
• Disadvantage
o It takes time to learn the methods and to put them into clinical
practice
o Research is costly
Cochrane collaboration
• One of the international Agencies which has taken up the task of building
evidence-based medicine is Cochrane collaboration
78
Concise Biostatistics Manual Prashant Rao, Sarika Rao
Levels of evidence
From the Centre for Evidence-Based Medicine, http://www.cebm.net.
2B Individual Cohort study (including low quality RCT, e.g. <80% follow-up)
Grades of Recommendation
“Extrapolations” are where data is used in a situation that has potentially clinically
important differences than the original study situation.
79
Concise Biostatistics Manual Prashant Rao, Sarika Rao
Ethics in Research
• Principles of ethics:
o Beneficience
o Non maleficence
o Justice
o This principle gives the individual subject, the right to gather as much
information as possible so that they can make their informed choice
whether to go forward with the intervention or not
80
Concise Biostatistics Manual Prashant Rao, Sarika Rao
• Justice
81
Concise Biostatistics Manual Prashant Rao, Sarika Rao
Informed consent
o Volunteerism
o Information disclosure
82
Concise Biostatistics Manual Prashant Rao, Sarika Rao
o Alternatives available
83
Concise Biostatistics Manual Prashant Rao, Sarika Rao
84
Concise Biostatistics Manual Prashant Rao, Sarika Rao
Notes
85