BIOSTAT

1.
The equation xi2 is best known as the:

A. The sum of all the squared values of the variable x in the data set
B. Product of all the values of the variable x in the data set
C. Square of the sum of all the values of the variable x in the data set
2. The equation (xi)2 is best known as the:
A. Sum of all the squared values of the variable x in the data set
B. Product of all the values of the variable x in the data set
C. Square of the sum of all the values of the variable x in the data set
D. Sum of all the values of the variable x in the data set
3. A nominal level of measurement is used when the values of a variable have which of
the following properties?
A. They can be placed in meaningful order, but there is no information about the
size of the interval between each value.
B. They can be placed in meaningful order, have meaningful intervals, and have a
true zero.
C. They can be placed in meaningful order and have meaningful intervals
between the times, but there is no true zero.
D. They simply represent categories
4. Class intervals should be:
A. Of equal length
B. Mutually exclusive
C. Overlapping intervals
D. A and B only
5. Symmetrical distributions are best described by which of the following statements?
A. They have an equal number of data points that appear to the left and right of
the center
B. They are normally distributed
C. They have a U-shaped distribution
D. They have a standard deviation
6. Skewness refers to the :
A. Shape of the top of the curve
B. Extent to which data are not symmetrical
C. Presence of outliers
D. Sample
7. Which of the following statements best describes a stem-and-leaf display?
A. It shows the range of values of the variable.
B. It shows the shape of the distribution of the variable.
C. It preserves the individual values of the variable.
D. All of the above are correct
8. Standard deviation is best described by which of the following statements?
A. It can be used to compare variation between two or more variables.

B. It is the average distance of each point from the mean.
C. It is the variance squared.
D. Both a and b are correct
9. Descriptive statistics have which of the following properties?
A. They are numerical or graphical summaries of data.
B. They are used to examine relationships between variables in a data set.
C. They are used to see how well sample data can be generalized to the
population.
D. All of these.
10. Inferential statistics have which of the following properties?
A. They are numerical or graphical summaries of data.
B. They are used to examine relationships between variables in a data set.
C. They are used to see how well sample data can be generalized to the
population.
D. Both b and c are correct
11. The p(A) is most accurately defined as the:
A. Joint probability of event A
B. Marginal probability of event A
C. Proof of event A
D. Marginal probability that event A will not occur
12. The p(B) is most accurately defined as the:
A. Joint probability of event B
B. Marginal probability of event B
C. Proof of event B
D. Marginal probability that event B will not occur
13. The p(B]A) is most accurately defined as the:
A. Joint probability that events A and B will occur
B. Addition rule
C. Conditional probability that event B will occur
D. Multiplication rule
14. The p(AB)=p(A)+p(B)-p(AB) describes the:
A. Joint probability that events A and B will occur
B. Addition rule
C. Conditional probability that event B will occur
D. Multiplication rule
15. If the p(A | B) = p(A), then
A. two events are independent.
B. two events are mutually exclusive.
C. p(A) = p(B).
D. none of the above apply
16. A normal distribution is characterized by

A. a bell shape.
B. a mean, median, and mode that are equal.
C. a total area under the curve above the x-axis that is 1.
D. all of the above
17. A z-score can give information about
A. the mean of a distribution.
B. the standard deviation (SD) of a distribution.
C. the percentile rank of a data point.
D. none of the above
18. A z-score of 0 corresponds to the
A. mean.
B. SD.
C. interquartile range.
D. 75th percentile
19. The 50th percentile is always the
A. Mean
B. Median
C. SD
D. A and B
20. A sample population cure is more likely to look like the population curve when:
A. The bell shape is wide
B. The sample size is small
C. The sample size is >30
D. None of the above
21. The null hypothesis states:
A. The expected direction of the relationship between the variables.
B. That no relationship will be found
C. That a relationship will be found, but it will not state the direction
D. None of the above
22. The a-level is defined by:
A. The probability of making a Type I error
B. The probability of making a Type II error
C. The researcher at the start of a study
D. A and C only
23. The one sample t-test is used to compare a sample mean to a population mean
when:
A. The population mean is known
B. The population standard deviation (SD) is not known
C. The sample size is at least 30
D. All of the above occur
24. Power is defined by

A. the -level.
B. the sample size.
C. the effect size ().
D. all of the above
25. A type I error occurs when the
A. null hypothesis is accepted when it is false.
B. null hypothesis is rejected when it is true.
C. sample size is too small.
D. effect size () is not defined in advance
26. A type II error occurs when the
A. null hypothesis is accepted when it is false.
B. null hypothesis is rejected when it is true.
C. sample size is too small.
D. effect size () is not defined in advance
27. Power can be increased by doing which of the following?
A. Increasing the -level.
B. Increasing the sample size.
C. Increasing the effect size ().
D. All of the above.
28. A researcher conducts a small study and finds that no statistically significant
relationship exists between smoking and lung cancer. This is most likely
A. a type I error.
B. a type II error.
C. both a and b.
29. Which of the following is more likely to contain the "true" population value of the
mean?
A. A 90% confidence interval (CI).
B. A 95% CI.
C. A 99% CI.
D. All of the above
30. If a statistical test is significant, it means that
A. it has important clinical applications.
B. the study had acceptable power.
C. the null hypothesis was rejected.
D. all of the above are true
31. The Pearson correlation test is best described as
A. a type of ANOVA.
B. a parametric test.
C. a nonparametric test.
32. The Pearson correlation coefficient is best used to determine the association of
A. two ratio variables to each other.
B. three or more ratio variables to each other.
C. two nominal variables to each other.
D. three or more ordinal variables to each other
33. The Pearson correlation coefficient is most appropriate to use when
A. neither of the variables is normally distributed.
B. one of the variables is normally distributed.
C. both of the variables are normally distributed.
34. The Pearson correlation coefficient provides a measure of
A. U-shaped relationships.
B. the strength of curvilinear relationships.
C. the strength of linear relationships.
D. all of the above.
35. Correlation coefficients measure
A. positive relationships.
B. inverse relationships.
C. curvilinear relationships.
D. a and b only
36. The Spearman correlation coefficient is best used to examine the relationship of
A. three or more ratio variables to each other.
B. two nonnormally distributed ordinal or interval or ratio variables to each
other.
C. two nominal variables to each other.
D. a nominal variable to a ratio variable
37. The Spearman correlation coefficient should be used instead of the Pearson
correlation coefficient when
A. neither of the variables is normally distributed.
B. one of the variables is normally distributed.
C. both of the variables are normally distributed.
D. both a and b.
38. A perfect inverse relationship would have an r of
A. 1.
B. 0.
C. +1.
D. 100.
39. The r2 measures

A. the variance not shared by the two variables.
B. strength of the positive relationship of the two variables.
C. the variance shared by the two variables.
40. The Pearson and Spearman correlation coefficients can be used to measure
A. the relationship of two independent variables to each other.
B. the relationship of two related variables to each other.
C. test-retest reliability.
D. all of the above
41. Multiple linear regression models are best used with
A. a dichotomous dependent variable.
B. any ratio-level variable.
C. a normally distributed ratio-level variable.
D. all of the above
42. The unadjusted regression coefficients (bs) in a multiple linear regression model
give information about
A. the strongest predictor of the dependent variable.
B. the change in the dependent variable per unit increase in the independent
variable.
C. the adjusted odds of having the condition represented by the dependent
variable given that the independent variable is present.
D. a and b only
43. The adjusted regression coefficient in a multiple linear regression contains
information about
variable.
D. a and b only.
44. The coefficient of determination in a multiple linear regression contains information
about
variable.
D. the amount of variance in the dependent variable explained by the model
45. Linear regression models describe
A. curvilinear relationships only.
B. linear relationships only.
C. both a and b.
46. Dummy variables are used to
A. recode the dependent variable.
B. represent ratio variables in regression models.
C. represent ordinal variables in regression models.
D. represent nominal variables in regression models
47. Linear regression allows you to test the significance of the following:
A. The overall model
B. Each regression coefficient
C. The risk ratio comparing those with a characteristic to those without
D. a and b only
48. When the outcome is nominal and dichotomous, which form of regression would be
appropriate?
A. Linear regression
B. Logistic regression
C. Multinomial or polytomous regression
D. All of the above
49. Multivariate regression models, be they linear, logistic, or any other form, allow us
to do which of the following?
A. Simultaneously consider the effects of several independent variables on the
dependent variable of interest
B. Look at the association between two variables of nominal scale
C. Minimize the risk of obtaining spurious results
D. a and c
50. Adding an interaction term to the linear regression model allows us to
A. interpret the adjusted association between each independent variable and the
outcome.
B. assess whether there is significant interaction between the two variables used
to create the interaction term.
C. calculate odds ratios.
D. all of the above
51. How would you interpret a correlation coefficient of -1?
A) Strong positive correlation - as one variable increases, so does the other
B) Weak negative correlation - as one variable increases, the other variable
decreases slightly
C) No correlation at all
D) Perfect negative correlation - as one variable increases, the other variable
decreases
52. A study looking at the correlation between cholesterol level and distance between
home and work found a correlation coefficient of 0.13. How much of the variance in
cholesterol level is explained by distance between home and work (ie, what is the
coefficient of determination)?
A) 13%
B) 1.7%
C) 0.13%
D) 0.17%
53. Which of the following are assumptions required for the Pearson correlation
coefficient to be valid?
A) The two variables must have a linear relationship.
B) Homoscedasticity
C) The two variables must be of interval or ratio measurement scale.
D) The two variables must be normally distributed.
E) The two variables must be of ordinal, interval, or ratio measurement scale.
F) There are no outliers
54. Under which of the following conditions would you need to use the Spearman
correlation coefficient and not the Pearson correlation coefficient?
A) The two variables are normally distributed.
B) The two variables do not have a linear relationship.
C) One variable on the ordinal measurement scale and one is on the interval
measurement scale.
D) There are only three outliers.
E) Neither variable is normally distributed.
F) There is only one variable to be examined
55. A study was conducted looking at the association between number of polyps in the
colon and mean grams of fiber consumed per day based on a standard food frequency
questionnaire. The Pearson correlation coefficient was 0.9 with a p-value of 0.003 and
the Spearman correlation coefficient was 0.28 with a p-value of 0.049. Which form of
correlation would be more appropriate to report and why?
A) The Pearson correlation results should be reported because it shows a stronger
correlation with a smaller p-value (more significant).
B) The Pearson correlation results should be reported because the two variables
are normally distributed.
C) The Spearman correlation results should be reported because at least one of
the variables does not meet the distribution assumption required to use Pearson
correlation.
D) The Spearman correlation results should be reported because the p-value is
closer to 0.05
56. In a study looking at the correlation between annual income and cholesterol level,
one concern was that the age of the participants might be a confounder because both
income and cholesterol generally increase with age. Which of the following correlation
methods could the researchers use to remove the influence of age on the correlation
between income and cholesterol level?
A) Kendall's Tau
B) Spearman correlation coefficient
C) Partial correlation
D) Point-Biserial
57. Squaring the Pearson correlation coefficient (r2) tells us that
A) the direction of the association (positive or negative).
B) whether the correlation is statistically significant at the 0.05 level.
C) how much variance is shared by the two variables.
D) whether or not we need to conduct a Spearman test of correlation instead
58. One of the important benefits of multivariate regression over many other statistical
tests is that it
A) allows you to prevent measurement error.
B) ascertains temporal order that is necessary for causal inference.
C) ensures that you are fulfilling all the required assumptions for statistical
testing.
D) allows you to adjust for potentially confounding variables
59. What type of relationship between a dependent and independent variable is
described by linear regression?
A) An exponential relationship
B) A parabolic relationship
C) A linear relationship
D) A threshold effect
60. Linear regression is appropriate for what type of dependent variable?
A) Ratio scale and normally distributed
B) Count
C) Dichotomous
D) Polytomous
61. In a study looking at the association of mean hours worked per day (hours) and age
(years) with body mass index (BMI), the researchers used multivariate liner regression.
The regression equation is below. What BMI would you predict from this equation for
someone age 25 years who works 45 hours per week?
Y = -3.95 + 0.04(hours worked) + 1.21(age)
A) 32.05
B) 28.1
C) -2.7
D) 66.05
62. What measure(s) of the strength and direction of association can we get from a
linear regression model? (Select all that apply.)
A) One-way ANOVA
B) Chi-square
C) Risk ratio
D) Odds ratio
E) Regression coefficient (beta)

F) F-statistic
63. A multivariate regression model looking at the association of intraocular pressure
(mm hg) with (1) having seasonal allergies, (2) wearing contact lenses, and (3) using eye
drops found a coefficient of determination (r2) of 0.23. How would you interpret this
statistic?
A) The association between having seasonal allergies, use of contact lenses, and
use of eye drops with intraocular pressure is not significant at the 0.05 level.
B) The intraocular eye pressure of those who have seasonal allergies is
0.23
mm hg higher compared to those without allergies, adjusting for use of
contact lenses and eye drops.
C) Those who wear contact lenses, have seasonal allergies, and use eye drops
have a 37% chance of having high intraocular pressure.
D) seasonal allergies, contact lens use, and eye drop use taken together explain
23% of the variation in intraocular eye pressure
64. What do we mean by regression toward the mean?
A) The linear regression equation can be used to identify the average value of
each variable in the model.
B) The phenomenon that if a variable is extreme on its first measurement, it will
tend to be closer to the average on a second measurement.
C) Linear regression normalizes the scale of the variables so they have a mean of
zero and standard deviation of 1.
D) Outliers in the model should always be excluded when using linear regression
65. A strong correlation (>0.85) between the independent variables that you want to
include in one multivariate linear regression model is an indication that which
assumption has been violated?
A) Multicolinearity
B) Residual analysis
C) Homoscedasticity
D) Normal distribution of the dependent variable
66. What is the difference between stepwise methods versus the standard approach and
hierarchical methods of selecting independent variables to include in a linear regression
model?
A) Stepwise uses theoretical rational, whereas the other approaches use statistical
criteria for selecting the variables for inclusion.
B) Stepwise methods allow you to include more variables without increasing the
probability of making a type I error.
C) Stepwise uses statistical criteria for selecting the variables to be included,
whereas the other approaches use theoretical rational.
D) Stepwise is better for hypothesis testing than the other approaches because it
is less influenced by chance
67. Health care practices have improved due in part to the use of
A) experimentation.
B) observation.
C) systemic review of research evidence.
D) trial-and-error
68. One response to the nursing shortage has resulted in the recruitment of nurses with
various educational levels. The results of one research study found that the level of
nursing education is critical in caring for certain patients. Therefore, we can conclude
that
A) proper education results in better patient care.
B) increasing the number of nurses without concern for educational level has
serious implications for critically ill patients.
C) proper mentoring will ensure the nursing staff is well prepared.
D) on-the-job training will provide adequate exposure to proper nursing
techniques for all levels of patients
69. Simple numerical or graphical summaries are referred to as
A) explanation statistics.
B) inferential statistics.
C) control statistics.
D) descriptive statistics
70. Techniques that allow conclusions to be drawn about the relationships found among
different variables in a population sample are referred to as
A) descriptive statistics.
B) inferential statistics.
C) predictive statistics.
D) explanatory statistics
71. A study that often attempts to understand how variables are related to each other is
referred to as
A) predictive studies.
B) explanatory studies.
C) inferential statistics.
D) descriptive statistics
72. A research study needs to be well planned before starting. A common mistake seen
in research is
A) reviewing existing literature on the subject.
B) developing a detailed study plan.
C) defining general and ambiguous terms or language.
D) failing to recognize approach limitations
73. The study plan hypotheses
A) will be changed as the study evolves.
B) must be very specific.
C) can be decided after the data are collected.
D) can be generalized
74. The hypotheses of the study should

A) be based on expert opinion.
B) state the expected relationship between the variables.
C) be updated at the end of the study.
D) predetermine the study results
75. A research project that follows the participants forward in time to identify future
outcomes is referred to as
A) a quasi-experimental.
B) a cohort study.
C) a cross-sectional.
D) a casecontrol study
76. When defining a good research study, which of the following should be included?
A) Description of the sample and how it was obtained
B) List of the hypotheses to be tested
C) Statement of the problem
D) Description of the planned statistical analysis
E) The expected outcome
77. A pediatrician will compare the 6-week-old infant with the accepted normal growth
progress using the
A) percentile rank.
B) nominal scale.
C) frequency table.
D) stemplots.
78. Commonly used techniques of presenting and organizing statistical data include
which of the following?
A) Graphical displays
B) Frequency distributions
C) Summary charts
D) Descriptive statistics
79. The first step to deciding which statistical technique to use is
A) to identify the question to be answered.
B) to determine the hypotheses.
C) to identify the variables.
D) to determine which measurement scale to use
80. The proper graph to demonstrate the following research data would be
Weight Range Raw Frequency
100119 2
120139 4
140159 5
160179 6
180199 11
200219 4
220239 3
A) histogram.
B) pie chart.
C) stem-and-leaf plot.
D) flow chart
81. Data values that do not fit the pattern of the rest of the data are
A) deleted from the study.
B) considered outliers.
C) reinvestigated.
D) put in a separate category
82. The marginal probability is computed by
A) the total number of times that the event could have occurred divided by the
number of times the event occurred.
B) the number of times the event occurred divided by the total number of times
that it could have occurred.
C) dividing the subset by the total number of times and then multiplying by 100.
D) subtracting the number of times the event occurred from 100 and then
dividing by the total number of times that it could have occurred
83. The set of probabilities associated with each possible outcome in the sample space is
referred to as
A) t distribution.
B) marginal distribution.
C) probability distribution.
D) conditional distribution
84. The normal distribution is a frequency polygon that demonstrates which of the
following properties?
A) Bell shaped
B) The population mean is shown as .
C) Mean, median, and mode are equal.
D) Total area under the curve above the x-axis equals 1
85. The _____ of a test is the probability that the test result and the diagnosis agree.
A) sensitivity.
B) specificity.
C) predictive value.
D) efficiency
86. Confidence intervals can be computed around the population mean from a single
simple mean using the
A) central limit theorem.
B) normal distribution.
C) predictive value.
D) joint probability
87. ________ probability is the probability that one event will occur given that another
event has occurred.
A) Joint
B) Conditional
C) Marginal
D) Addition
88. The process for testing hypotheses includes which of the following?
A) Make sure the data meet the necessary assumptions to compute the test
statistic.
B) Obtain the p-value of computed statistic.
C) Define significance level.
D) Compute the parameters that are being compared by the test statistic.
E) Compute the test statistic
89. The local health department is interested in determining if there is a difference in
the incidence of childhood caries among children living in rural areas drinking
untreated well water and children living in cities drinking fluorinated water. A null
hypothesis could read
A) the oral health of the rural children will be significantly worse than that of the
city children.
B) the oral health of the city children will be significantly worse than that of the
rural children.
C) the oral health of the city children will be significantly better than that of the
rural children.
D) the oral health of the rural children will not differ from that of the city
children
89. The alpha-level is defined by the researcher before any statistical tests are
conducted. This specific level is defined as
A) the p-value.
B) the standard deviation.
C) error of inference.
D) statistically significant
90. The researcher has determined the project has an = 0.05. This indicates all values
outside ____ will be rejected.
A) 45%
B) 55%
C) 75%
D) 95%
91. A null hypothesis was not rejected, and later it was found that the results were false.
This is an example of
A) a type I error.
B) a type II error.
C) a type III error.
D) a type IV error
92. A potential mistake when interpreting the data is to reject the H0 when it is true.
This is referred to as
A) a type I error.
B) a type II error.
C) a type III error.
D) a type IV error
93. Researchers articulate the expected relationships between the variables by the
_______.
A) null hypothesis
B) alternative hypothesis
C) parameters
D) statistic
94. Data values that do not fit the pattern of the rest of the data are
A)reinvestigated.
B)put in a separate category.
C)considered outliers.
D)deleted from the study
95. A strong correlation (>0.85) between the independent variables that you want to
include in one multivariate linear regression model is an indication that which
assumption has been violated?
A)Normal distribution of the dependent variable
B)Residual analysis
C)Multicolinearity
D)Homoscedasticity
96. Commonly used techniques of presenting and organizing statistical data include
which of the following?
A)Descriptive statistics
B)Graphical displays
C)Frequency distributions
D)Summary charts
97. A psychologist is interested in learning how well his clients are doing with a certain
treatment. He asks his clients to complete the following survey. The data collected will
be used to create
After taking the medication, I feel
0Depressed after 15 hours
3No change
A)an interval scale.

B)a ratio scale.
C)a summary scale.
D)an ordinal scale
98. The set of probabilities associated with each possible outcome in the sample space is
referred to as
A)probability distribution.
B)t distribution.
C)conditional distribution.
D)marginal distribution
99. A null hypothesis was not rejected, and later it was found that the results were false.
This is an example of
A)a type IV error.
B)a type III error.
C)a type I error.
D)a type II error.
100. One of the important benefits of multivariate regression over many other statistical
tests is that it
A)ensures that you are fulfilling all the required assumptions for statistical
testing
.
B)allows you to adjust for potentially confounding variables.
C)allows you to prevent measurement error.
D)ascertains temporal order that is necessary for causal inference
101. A research project that follows the participants forward in time to identify future
outcomes is referred to as
A)a cross-sectional.
B)a cohort study.
C)a case-control study.
D)a quasi-experimental
102. In a study looking at the correlation between annual income and cholesterol level,
one concern was that the age of the participants might be a confounder because both
income and cholesterol generally increase with age. Which of the following correlation
methods could the researchers use to remove the influence of age on the correlation
between income and cholesterol level?
A)Point-Biserial
B)Kendall's Tau
C)Spearman correlation coefficient
D)Partial correlation
103. Statistics use distinct symbols to represent specific information. Parameters are
usually represented by
A)lowercase English letters.
B)lowercase Roman numerals.
C)owercase Roman letters.

D)lowercase Greek letters
104. Researchers conducting a study with ____ power have a _____ probability of
committing a type ____ error.
A)high; low; II
B)low; high; I
C)high; low; I
D)low; high; II
105. The ______ of any statistical test represents the probability that the results were
obtained by chance alone.
A)-level
B)-level
C)p-value
D)-value
106. A multivariate regression model looking at the association of intraocular pressure
(mm hg) with (1) having seasonal allergies, (2) wearing contact lenses, and (3) using eye
drops found a coefficient of determination (r2) of 0.23. How would you interpret this
statistic?
A)seasonal allergies, contact lens use, and eye drop use taken together explain
23% of the variation in intraocular eye pressure.
B)The association between having seasonal allergies, use of contact lenses, and
use of eye drops with intraocular pressure is not significant at the 0.05 level.
C)Those who wear contact lenses, have seasonal allergies, and use eye drops have
a 37% chance of having high intraocular pressure.
D)The intraocular eye pressure of those who have seasonal allergies is 0.23 mm
hg higher compared to those without allergies, adjusting for use of contact lenses
and eye drops
107. The local health department is interested in determining if there is a difference in
the incidence of childhood caries among children living in rural areas drinking
untreated well water and children living in cities drinking fluorinated water. A null
hypothesis could read
A)the oral health of the city children will be significantly worse than that of the
rural children.
B)the oral health of the rural children will not differ from that of the city children.
C)the oral health of the rural children will be significantly worse than that of the
city children.
D)the oral health of the city children will be significantly better than that of the
rural children
108. A potential mistake when interpreting the data is to reject the H0 when it is true.
This is referred to as
A)a type IV error.
B)a type I error.
C)a type II error.

D)a type III error
109. Researchers articulate the expected relationships between the variables by the
_______.
A)statistic
B)null hypothesis
C)alternative hypothesis
D)parameters
110. The study plan hypotheses
A)will be changed as the study evolves.
B)must be very specific.
C)can be decided after the data are collected.
D)can be generalized
111. The _____ of a test is the probability that the test result and the diagnosis agree.
A)specificity.
B)predictive value.
C)efficiency.
D)sensitivity
112. In a study looking at the association of mean hours worked per day (hours) and age
(years) with body mass index (BMI), the researchers used multivariate liner regression.
The regression equation is below. What BMI would you predict from this equation for
someone age 25 years who works 45 hours per week?
Y = - 3.95 + 0.04(hours worked) + 1.21(age)
A)66.05
B)28.1
C)32.05
D)-2.7
113. One response to the nursing shortage has resulted in the recruitment of nurses with
various educational levels. The results of one research study found that the level of
nursing education is critical in caring for certain patients. Therefore, we can conclude
that
A)proper education results in better patient care.
B)increasing the number of nurses without concern for educational level has
serious implications for critically ill patients.
C)proper mentoring will ensure the nursing staff is well prepared.
D)on-the-job training will provide adequate exposure to proper nursing techni
ques for all levels of patients
114. The researcher has determined the project has an = 0.05. This indicates all values
outside ____ will be rejected.
A)75%
B)45%
C)95%
D)55%
115. Under which of the following conditions would you need to use the Spearman
correlation coefficient and not the Pearson correlation coefficient?
A)One variable on the ordinal measurement scale and one is on the interval
measurement scale.
B)Neither variable is normally distributed.
C)There is only one variable to be examined.
D)The two variables are normally distributed.
E)There are only three outliers.
F)The two variables do not have a linear relationship
116. Confidence intervals can be computed around the population mean from a single
simple mean using the
A)predictive value.
B)central limit theorem.
C)joint probability.
D)normal distribution
117. The alpha-level is defined by the researcher before any statistical tests are
conducted. This specific level is defined as
A)the standard deviation.
B)error of inference.
C)statistically significant.
D)the p-value
118. Which of the following is not a part of the research project that is used statistically?
A)Interpretation of data
B)Peer review of data
C)Organization of data
D)Collection of data
119. Researchers rely on ________ to assist them in drawing meaningful conclusions
on data collected on a population through their research project.
A)the probability theory
B)the empirical distribution
C)central limit theorem
D)the theoretical distribution
120. The probability that the laboratory test is positive when given to a group of patients
who have the disease is referred to as
A)efficiency.
B)specificity.
C)sensitivity.
D)predictive value
121 A study looking at the correlation between cholesterol level and distance between
home and work found a correlation coefficient of -0.13. How much of the variance in
cholesterol level is explained by distance between home and work (ie, what is the
coefficient of determination)?
A)0.17%
B)0.13%
C)13%
D)1.7%
122. The first step to deciding which statistical technique to use is
A)to identify the variables.
B)to determine which measurement scale to use.
C)to determine the hypotheses.
D)to identify the question to be answered
123. Health care practices have improved due in part to the use of
A)trial-and-error.
B)systemic review of research evidence.
C)observation.
D)experimentation
124. Techniques that allow conclusions to be drawn about the relationships found
among different variables in a population sample are referred to as
A)inferential statistics.
B)explanatory statistics.
C)predictive statistics.
D)descriptive statistics
125. When determining the class intervals in creating a relative frequency chart, it is
important to remember each class must be
A)mutually inclusive.
B)limited in number.
C)mutually exclusive.
D)broad in their width
126. Simple numerical or graphical summaries are referred to as
A)control statistics.
B)explanation statistics.
C)inferential statistics.
D)descriptive statistics
127. What is the difference between stepwise methods versus the standard approach and
hierarchical methods of selecting independent variables to include in a linear regression
model?
A)Stepwise is better for hypothesis testing than the other approaches because it is
less influenced by chance.
B)Stepwise uses statistical criteria for selecting the variables to be included,
whereas the other approaches use theoretical rational.
C)Stepwise methods allow you to include more variables without increasing the
probability of making a type I error.
D)Stepwise uses theoretical rational, whereas the other approaches use statistical
criteria for selecting the variables for inclusion
128. The hypotheses of the study should
A)be based on expert opinion.
B)be updated at the end of the study.
C)predetermine the study results.
D)state the expected relationship between the variables
129. The process for testing hypotheses includes which of the following?
A)Define significance level.
B)Compute the test statistic.
C)Compute the parameters that are being compared by the test statistic.
D)Obtain the p-value of computed statistic.
E)Make sure the data meet the necessary assumptions to compute the test
statistic
130. The _______ can be used to test whether the sample mean is significantly
different from a population mean.
A)one-sample z score
B)one-sample z-test
C)one-sample f-score
D)one-sample t test
131. How would you interpret a correlation coefficient of -1?
A)No correlation at all
B)Strong positive correlationas one variable increases, so does the other
C)Perfect negative correlationas one variable increases, the other variable
decreases
D)Weak negative correlationas one variable increases, the other variable
decreases slightly
132. A research study needs to be well planned before starting. A common mistake seen
in research is
A)reviewing existing literature on the subject.
B)developing a detailed study plan.
C)defining general and ambiguous terms or language.
D)failing to recognize approach limitations
133. Linear regression is appropriate for what type of dependent variable?
A)Ratio scale and normally distributed
B)Count
C)Dichotomous
D)Polytomous
134. A pediatrician will compare the 6-week-old infant with the accepted normal growth
progress using the
A)frequency table.
B)percentile rank.
C)stemplots.
D)nominal scale
135. The normal distribution is a frequency polygon that demonstrates which of the
following properties?
A)Mean, median, and mode are equal.
B)Bell shaped
C)Total area under the curve above the x-axis equals 1.
D)The population mean is shown as
136. In ________ probability, the distribution of events can be inferred without
collecting data.
A)classical
B)relative frequency
C)theoretical
D)empirical
137. ________ probability is the probability that one event will occur given that
another event has occurred.
A)Addition
B)Joint
C)Conditional
D)Marginal
138. Which of the following are assumptions required for the Pearson correlation
coefficient to be valid?
A)Homoscedasticity
B)The two variables must be normally distributed.
C)The two variables must be of ordinal, interval, or ratio measurement scale.
D)There are no outliers.
E)The two variables must have a linear relationship.
F)The two variables must be of interval or ratio measurement scale
139. The marginal probability is computed by
A)the number of times the event occurred divided by the total number of times
that it could have occurred.
B)dividing the subset by the total number of times and then multiplying by 100.
C)subtracting the number of times the event occurred from 100 and then dividing
by the total number of times that it could have occurred.
D)the total number of times that the event could have occurred divided by the
number of times the event occurred
140. A study that often attempts to understand how variables are related to each other is
referred to as
A)descriptive statistics.
B)explanatory studies.
C)predictive studies.
D)inferential statistics
141. What do we mean by "regression toward the mean?"
A)The linear regression equation can be used to identify the average value of each
variable in the model.
B)Linear regression normalizes the scale of the variables so they have a mean of
zero and standard deviation of 1.
C)Outliers in the model should always be excluded when using linear regression.
D)The phenomenon that if a variable is extreme on its first measurement, it will
tend to be closer to the average on a second measurement
142. What measure(s) of the strength and direction of association can we get from a
linear regression model? (Select all that apply.)
A)Risk ratio
B)One-way ANOVA
C)Chi-square
D)Regression coefficient (beta)
E)F-statistic
F)Odds ratio
143. When defining a good research study, which of the following should be included?
A)Description of the planned statistical analysis
B)Statement of the problem
C)List of the hypotheses to be tested
D)Description of the sample and how it was obtained
E)The expected outcome
144. A study was conducted looking at the association between number of polyps in the
colon and mean grams of fiber consumed per day based on a standard food frequency
questionnaire. The Pearson correlation coefficient was 0.9 with a p-value of 0.003 and
the Spearman correlation coefficient was 0.28 with a p-value of 0.049. Which form of
correlation would be more appropriate to report and why?
A)The Pearson correlation results should be reported because the two variables
are normally distributed.
B)The Spearman correlation results should be reported because the p-value is
closer to 0.05.
C)The Pearson correlation results should be reported because it shows a stronger
correlation with a smaller p-value (more significant).
D)The Spearman correlation results should be reported because at least one of
the variables does not meet the distribution assumption required to use Pearson
correlation
145. What type of relationship between a dependent and independent variable is

described by linear regression?
A)An exponential relationship
B)A parabolic relationship
C)A linear relationship
D)A threshold effect
146. Squaring the Pearson correlation coefficient (r2) tells us that
A)whether or not we need to conduct a Spearman test of correlation instead.
B)how much variance is shared by the two variables.
C)the direction of the association (positive or negative).
D)whether the correlation is statistically significant at the 0.05 level
147. Why is it important for the nurse to use evidence-based practice guidelines in the
clinical setting?
A. to meet federal requirements
B. to attain credits towards promotion
C. to improve patient health outcomes
D. to follow state board of nursing procedure
148. Why is it vital for nurses to inform policy makers of current health care problems?
A. Legal issues can be prevented for patients.
B. Policies can be developed to resolve problems.
C. Policy makers can get more votes during elections.
D. Nurses can meet mandates of state boards of nursing

BIOSTAT

Uploaded by

Copyright:

Available Formats

BIOSTAT

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

BIOSTAT

Uploaded by

Copyright:

Available Formats

What are the different types of measurement scales?

What is the difference between descriptive and inferential statistics?

1.

The equation xi2 is best known as the:

A. It can be used to compare variation between two or more variables.

16. A normal distribution is characterized by

24. Power is defined by

39. The r2 measures

E) Regression coefficient (beta)

74. The hypotheses of the study should

A)an interval scale.

C)owercase Roman letters.

C)a type II error.

145. What type of relationship between a dependent and independent variable is

You might also like