BIOSTAT
BIOSTAT
BIOSTAT
C. a nonparametric test.
D. none of the above
32. The Pearson correlation coefficient is best used to determine the association of
A. two ratio variables to each other.
B. three or more ratio variables to each other.
C. two nominal variables to each other.
D. three or more ordinal variables to each other
33. The Pearson correlation coefficient is most appropriate to use when
A. neither of the variables is normally distributed.
B. one of the variables is normally distributed.
C. both of the variables are normally distributed.
D. none of the above
34. The Pearson correlation coefficient provides a measure of
A. U-shaped relationships.
B. the strength of curvilinear relationships.
C. the strength of linear relationships.
D. all of the above.
35. Correlation coefficients measure
A. positive relationships.
B. inverse relationships.
C. curvilinear relationships.
D. a and b only
36. The Spearman correlation coefficient is best used to examine the relationship of
A. three or more ratio variables to each other.
B. two nonnormally distributed ordinal or interval or ratio variables to each
other.
C. two nominal variables to each other.
D. a nominal variable to a ratio variable
37. The Spearman correlation coefficient should be used instead of the Pearson
correlation coefficient when
A. neither of the variables is normally distributed.
B. one of the variables is normally distributed.
C. both of the variables are normally distributed.
D. both a and b.
38. A perfect inverse relationship would have an r of
A. 1.
B. 0.
C. +1.
D. 100.
C. both a and b.
D. none of the above
46. Dummy variables are used to
A. recode the dependent variable.
B. represent ratio variables in regression models.
C. represent ordinal variables in regression models.
D. represent nominal variables in regression models
47. Linear regression allows you to test the significance of the following:
A. The overall model
B. Each regression coefficient
C. The risk ratio comparing those with a characteristic to those without
D. a and b only
48. When the outcome is nominal and dichotomous, which form of regression would be
appropriate?
A. Linear regression
B. Logistic regression
C. Multinomial or polytomous regression
D. All of the above
49. Multivariate regression models, be they linear, logistic, or any other form, allow us
to do which of the following?
A. Simultaneously consider the effects of several independent variables on the
dependent variable of interest
B. Look at the association between two variables of nominal scale
C. Minimize the risk of obtaining spurious results
D. a and c
50. Adding an interaction term to the linear regression model allows us to
A. interpret the adjusted association between each independent variable and the
outcome.
B. assess whether there is significant interaction between the two variables used
to create the interaction term.
C. calculate odds ratios.
D. all of the above
51. How would you interpret a correlation coefficient of -1?
A) Strong positive correlation - as one variable increases, so does the other
B) Weak negative correlation - as one variable increases, the other variable
decreases slightly
C) No correlation at all
D) Perfect negative correlation - as one variable increases, the other variable
decreases
52. A study looking at the correlation between cholesterol level and distance between
home and work found a correlation coefficient of 0.13. How much of the variance in
cholesterol level is explained by distance between home and work (ie, what is the
coefficient of determination)?
A) 13%
B) 1.7%
C) 0.13%
D) 0.17%
53. Which of the following are assumptions required for the Pearson correlation
coefficient to be valid?
A) The two variables must have a linear relationship.
B) Homoscedasticity
C) The two variables must be of interval or ratio measurement scale.
D) The two variables must be normally distributed.
E) The two variables must be of ordinal, interval, or ratio measurement scale.
F) There are no outliers
54. Under which of the following conditions would you need to use the Spearman
correlation coefficient and not the Pearson correlation coefficient?
A) The two variables are normally distributed.
B) The two variables do not have a linear relationship.
C) One variable on the ordinal measurement scale and one is on the interval
measurement scale.
D) There are only three outliers.
E) Neither variable is normally distributed.
F) There is only one variable to be examined
55. A study was conducted looking at the association between number of polyps in the
colon and mean grams of fiber consumed per day based on a standard food frequency
questionnaire. The Pearson correlation coefficient was 0.9 with a p-value of 0.003 and
the Spearman correlation coefficient was 0.28 with a p-value of 0.049. Which form of
correlation would be more appropriate to report and why?
A) The Pearson correlation results should be reported because it shows a stronger
correlation with a smaller p-value (more significant).
B) The Pearson correlation results should be reported because the two variables
are normally distributed.
C) The Spearman correlation results should be reported because at least one of
the variables does not meet the distribution assumption required to use Pearson
correlation.
D) The Spearman correlation results should be reported because the p-value is
closer to 0.05
56. In a study looking at the correlation between annual income and cholesterol level,
one concern was that the age of the participants might be a confounder because both
income and cholesterol generally increase with age. Which of the following correlation
methods could the researchers use to remove the influence of age on the correlation
between income and cholesterol level?
A) Kendall's Tau
B) Spearman correlation coefficient
C) Partial correlation
D) Point-Biserial
57. Squaring the Pearson correlation coefficient (r2) tells us that
A) the direction of the association (positive or negative).
B) whether the correlation is statistically significant at the 0.05 level.
C) how much variance is shared by the two variables.
D) whether or not we need to conduct a Spearman test of correlation instead
58. One of the important benefits of multivariate regression over many other statistical
tests is that it
A) allows you to prevent measurement error.
B) ascertains temporal order that is necessary for causal inference.
C) ensures that you are fulfilling all the required assumptions for statistical
testing.
D) allows you to adjust for potentially confounding variables
59. What type of relationship between a dependent and independent variable is
described by linear regression?
A) An exponential relationship
B) A parabolic relationship
C) A linear relationship
D) A threshold effect
60. Linear regression is appropriate for what type of dependent variable?
A) Ratio scale and normally distributed
B) Count
C) Dichotomous
D) Polytomous
61. In a study looking at the association of mean hours worked per day (hours) and age
(years) with body mass index (BMI), the researchers used multivariate liner regression.
The regression equation is below. What BMI would you predict from this equation for
someone age 25 years who works 45 hours per week?
Y = -3.95 + 0.04(hours worked) + 1.21(age)
A) 32.05
B) 28.1
C) -2.7
D) 66.05
62. What measure(s) of the strength and direction of association can we get from a
linear regression model? (Select all that apply.)
A) One-way ANOVA
B) Chi-square
C) Risk ratio
D) Odds ratio
B) observation.
C) systemic review of research evidence.
D) trial-and-error
68. One response to the nursing shortage has resulted in the recruitment of nurses with
various educational levels. The results of one research study found that the level of
nursing education is critical in caring for certain patients. Therefore, we can conclude
that
A) proper education results in better patient care.
B) increasing the number of nurses without concern for educational level has
serious implications for critically ill patients.
C) proper mentoring will ensure the nursing staff is well prepared.
D) on-the-job training will provide adequate exposure to proper nursing
techniques for all levels of patients
69. Simple numerical or graphical summaries are referred to as
A) explanation statistics.
B) inferential statistics.
C) control statistics.
D) descriptive statistics
70. Techniques that allow conclusions to be drawn about the relationships found among
different variables in a population sample are referred to as
A) descriptive statistics.
B) inferential statistics.
C) predictive statistics.
D) explanatory statistics
71. A study that often attempts to understand how variables are related to each other is
referred to as
A) predictive studies.
B) explanatory studies.
C) inferential statistics.
D) descriptive statistics
72. A research study needs to be well planned before starting. A common mistake seen
in research is
A) reviewing existing literature on the subject.
B) developing a detailed study plan.
C) defining general and ambiguous terms or language.
D) failing to recognize approach limitations
73. The study plan hypotheses
A) will be changed as the study evolves.
B) must be very specific.
C) can be decided after the data are collected.
D) can be generalized
200219 4
220239 3
A) histogram.
B) pie chart.
C) stem-and-leaf plot.
D) flow chart
81. Data values that do not fit the pattern of the rest of the data are
A) deleted from the study.
B) considered outliers.
C) reinvestigated.
D) put in a separate category
82. The marginal probability is computed by
A) the total number of times that the event could have occurred divided by the
number of times the event occurred.
B) the number of times the event occurred divided by the total number of times
that it could have occurred.
C) dividing the subset by the total number of times and then multiplying by 100.
D) subtracting the number of times the event occurred from 100 and then
dividing by the total number of times that it could have occurred
83. The set of probabilities associated with each possible outcome in the sample space is
referred to as
A) t distribution.
B) marginal distribution.
C) probability distribution.
D) conditional distribution
84. The normal distribution is a frequency polygon that demonstrates which of the
following properties?
A) Bell shaped
B) The population mean is shown as .
C) Mean, median, and mode are equal.
D) Total area under the curve above the x-axis equals 1
85. The _____ of a test is the probability that the test result and the diagnosis agree.
A) sensitivity.
B) specificity.
C) predictive value.
D) efficiency
86. Confidence intervals can be computed around the population mean from a single
simple mean using the
A) central limit theorem.
B) normal distribution.
C) predictive value.
D) joint probability
87. ________ probability is the probability that one event will occur given that another
event has occurred.
A) Joint
B) Conditional
C) Marginal
D) Addition
88. The process for testing hypotheses includes which of the following?
A) Make sure the data meet the necessary assumptions to compute the test
statistic.
B) Obtain the p-value of computed statistic.
C) Define significance level.
D) Compute the parameters that are being compared by the test statistic.
E) Compute the test statistic
89. The local health department is interested in determining if there is a difference in
the incidence of childhood caries among children living in rural areas drinking
untreated well water and children living in cities drinking fluorinated water. A null
hypothesis could read
A) the oral health of the rural children will be significantly worse than that of the
city children.
B) the oral health of the city children will be significantly worse than that of the
rural children.
C) the oral health of the city children will be significantly better than that of the
rural children.
D) the oral health of the rural children will not differ from that of the city
children
89. The alpha-level is defined by the researcher before any statistical tests are
conducted. This specific level is defined as
A) the p-value.
B) the standard deviation.
C) error of inference.
D) statistically significant
90. The researcher has determined the project has an = 0.05. This indicates all values
outside ____ will be rejected.
A) 45%
B) 55%
C) 75%
D) 95%
91. A null hypothesis was not rejected, and later it was found that the results were false.
This is an example of
A) a type I error.
B) a type II error.
C) a type III error.
D) a type IV error
92. A potential mistake when interpreting the data is to reject the H0 when it is true.
This is referred to as
A) a type I error.
B) a type II error.
C) a type III error.
D) a type IV error
93. Researchers articulate the expected relationships between the variables by the
_______.
A) null hypothesis
B) alternative hypothesis
C) parameters
D) statistic
94. Data values that do not fit the pattern of the rest of the data are
A)reinvestigated.
B)put in a separate category.
C)considered outliers.
D)deleted from the study
95. A strong correlation (>0.85) between the independent variables that you want to
include in one multivariate linear regression model is an indication that which
assumption has been violated?
A)Normal distribution of the dependent variable
B)Residual analysis
C)Multicolinearity
D)Homoscedasticity
96. Commonly used techniques of presenting and organizing statistical data include
which of the following?
A)Descriptive statistics
B)Graphical displays
C)Frequency distributions
D)Summary charts
97. A psychologist is interested in learning how well his clients are doing with a certain
treatment. He asks his clients to complete the following survey. The data collected will
be used to create
After taking the medication, I feel
0Depressed after 15 hours
1Depressed after 10 hours
2Depressed after 5 hours
3No change
C)95%
D)55%
115. Under which of the following conditions would you need to use the Spearman
correlation coefficient and not the Pearson correlation coefficient?
A)One variable on the ordinal measurement scale and one is on the interval
measurement scale.
B)Neither variable is normally distributed.
C)There is only one variable to be examined.
D)The two variables are normally distributed.
E)There are only three outliers.
F)The two variables do not have a linear relationship
116. Confidence intervals can be computed around the population mean from a single
simple mean using the
A)predictive value.
B)central limit theorem.
C)joint probability.
D)normal distribution
117. The alpha-level is defined by the researcher before any statistical tests are
conducted. This specific level is defined as
A)the standard deviation.
B)error of inference.
C)statistically significant.
D)the p-value
118. Which of the following is not a part of the research project that is used statistically?
A)Interpretation of data
B)Peer review of data
C)Organization of data
D)Collection of data
119. Researchers rely on ________ to assist them in drawing meaningful conclusions
on data collected on a population through their research project.
A)the probability theory
B)the empirical distribution
C)central limit theorem
D)the theoretical distribution
120. The probability that the laboratory test is positive when given to a group of patients
who have the disease is referred to as
A)efficiency.
B)specificity.
C)sensitivity.
D)predictive value
121 A study looking at the correlation between cholesterol level and distance between
home and work found a correlation coefficient of -0.13. How much of the variance in
cholesterol level is explained by distance between home and work (ie, what is the
coefficient of determination)?
A)0.17%
B)0.13%
C)13%
D)1.7%
122. The first step to deciding which statistical technique to use is
A)to identify the variables.
B)to determine which measurement scale to use.
C)to determine the hypotheses.
D)to identify the question to be answered
123. Health care practices have improved due in part to the use of
A)trial-and-error.
B)systemic review of research evidence.
C)observation.
D)experimentation
124. Techniques that allow conclusions to be drawn about the relationships found
among different variables in a population sample are referred to as
A)inferential statistics.
B)explanatory statistics.
C)predictive statistics.
D)descriptive statistics
125. When determining the class intervals in creating a relative frequency chart, it is
important to remember each class must be
A)mutually inclusive.
B)limited in number.
C)mutually exclusive.
D)broad in their width
126. Simple numerical or graphical summaries are referred to as
A)control statistics.
B)explanation statistics.
C)inferential statistics.
D)descriptive statistics
127. What is the difference between stepwise methods versus the standard approach and
hierarchical methods of selecting independent variables to include in a linear regression
model?
A)Stepwise is better for hypothesis testing than the other approaches because it is
less influenced by chance.
B)Stepwise uses statistical criteria for selecting the variables to be included,
whereas the other approaches use theoretical rational.
C)Stepwise methods allow you to include more variables without increasing the
probability of making a type I error.
D)Stepwise uses theoretical rational, whereas the other approaches use statistical
criteria for selecting the variables for inclusion
128. The hypotheses of the study should
A)be based on expert opinion.
B)be updated at the end of the study.
C)predetermine the study results.
D)state the expected relationship between the variables
129. The process for testing hypotheses includes which of the following?
A)Define significance level.
B)Compute the test statistic.
C)Compute the parameters that are being compared by the test statistic.
D)Obtain the p-value of computed statistic.
E)Make sure the data meet the necessary assumptions to compute the test
statistic
130. The _______ can be used to test whether the sample mean is significantly
different from a population mean.
A)one-sample z score
B)one-sample z-test
C)one-sample f-score
D)one-sample t test
131. How would you interpret a correlation coefficient of -1?
A)No correlation at all
B)Strong positive correlationas one variable increases, so does the other
C)Perfect negative correlationas one variable increases, the other variable
decreases
D)Weak negative correlationas one variable increases, the other variable
decreases slightly
132. A research study needs to be well planned before starting. A common mistake seen
in research is
A)reviewing existing literature on the subject.
B)developing a detailed study plan.
C)defining general and ambiguous terms or language.
D)failing to recognize approach limitations
133. Linear regression is appropriate for what type of dependent variable?
A)Ratio scale and normally distributed
B)Count
C)Dichotomous
D)Polytomous
134. A pediatrician will compare the 6-week-old infant with the accepted normal growth
progress using the
A)frequency table.
B)percentile rank.
C)stemplots.
D)nominal scale
135. The normal distribution is a frequency polygon that demonstrates which of the
following properties?
A)Mean, median, and mode are equal.
B)Bell shaped
C)Total area under the curve above the x-axis equals 1.
D)The population mean is shown as
136. In ________ probability, the distribution of events can be inferred without
collecting data.
A)classical
B)relative frequency
C)theoretical
D)empirical
137. ________ probability is the probability that one event will occur given that
another event has occurred.
A)Addition
B)Joint
C)Conditional
D)Marginal
138. Which of the following are assumptions required for the Pearson correlation
coefficient to be valid?
A)Homoscedasticity
B)The two variables must be normally distributed.
C)The two variables must be of ordinal, interval, or ratio measurement scale.
D)There are no outliers.
E)The two variables must have a linear relationship.
F)The two variables must be of interval or ratio measurement scale
139. The marginal probability is computed by
A)the number of times the event occurred divided by the total number of times
that it could have occurred.
B)dividing the subset by the total number of times and then multiplying by 100.
C)subtracting the number of times the event occurred from 100 and then dividing
by the total number of times that it could have occurred.
D)the total number of times that the event could have occurred divided by the
number of times the event occurred
140. A study that often attempts to understand how variables are related to each other is
referred to as
A)descriptive statistics.
B)explanatory studies.
C)predictive studies.
D)inferential statistics
141. What do we mean by "regression toward the mean?"
A)The linear regression equation can be used to identify the average value of each
variable in the model.
B)Linear regression normalizes the scale of the variables so they have a mean of
zero and standard deviation of 1.
C)Outliers in the model should always be excluded when using linear regression.
D)The phenomenon that if a variable is extreme on its first measurement, it will
tend to be closer to the average on a second measurement
142. What measure(s) of the strength and direction of association can we get from a
linear regression model? (Select all that apply.)
A)Risk ratio
B)One-way ANOVA
C)Chi-square
D)Regression coefficient (beta)
E)F-statistic
F)Odds ratio
143. When defining a good research study, which of the following should be included?
A)Description of the planned statistical analysis
B)Statement of the problem
C)List of the hypotheses to be tested
D)Description of the sample and how it was obtained
E)The expected outcome
144. A study was conducted looking at the association between number of polyps in the
colon and mean grams of fiber consumed per day based on a standard food frequency
questionnaire. The Pearson correlation coefficient was 0.9 with a p-value of 0.003 and
the Spearman correlation coefficient was 0.28 with a p-value of 0.049. Which form of
correlation would be more appropriate to report and why?
A)The Pearson correlation results should be reported because the two variables
are normally distributed.
B)The Spearman correlation results should be reported because the p-value is
closer to 0.05.
C)The Pearson correlation results should be reported because it shows a stronger
correlation with a smaller p-value (more significant).
D)The Spearman correlation results should be reported because at least one of
the variables does not meet the distribution assumption required to use Pearson
correlation