Treatment testing_Lecturs slides part 1
Treatment testing_Lecturs slides part 1
Parametric or non-parametric?
• A treatment test always has a null hypothesis and an alternative
hypothesis. Usually, the null hypothesis is that there is no effect.
* If alternative hypothesis specifies the direction of the effect, one-tailed tests can
be conducted and the p-value is only half of two-tailed test (usually, not always!).
1. Non-normal distributions:
* Sufficiently large sample sizes make it possible to appeal to the central limit
theorem (CLT): the standardized mean of a sample follows a normal distribution
even when the sample is drawn from a distribution that is not normal.
*If sample sizes are low, bootstrapping ensures that inferences made from
parametric tests are valid regardless of the distribution.
2. Heterogeneous variances:
Some parametric tests are still robust. Additionally, for some tests there are test
statistics for unequal variances along with the test for equal variances.
7
• If the seller has invested, the buyer proposes how to split the 100
units.
The treatments
The game is played with three different treatment groups:
T2: Buyer can send a message to the seller before the seller
makes the investment decision.
T3: Seller can send a message to the buyer along with the
investment decision.
You can do the analysis on your own. Use the following data set:
„IndSamples MeanDifferences.dta“. This part of the lecture is based on
Moffat (2016), chapter 3.
10
hist depvar, disc freq normal Normal density superimposed on the histogram,
with the same mean and std as the data
20
Minimum for
executing the
command
15
Frequency
10
5
0
0 20 40 60 80 100
buyer's offer to seller
Conclusion:
Sample means
Pooled std as a weighted average
of the two individual sample std Sample sizes
Group Obs Mean Std. Err. Std. Dev. [95% Conf. Interval]
• In case of a p-value smaller than 0.05 you reject the null hypothesis that there is
no difference between the two groups.
• Here: Using the two-sided(!) test there is only „mild/ suggestive evidence“ for a
treatment effect.
16
Group Obs Mean Std. Err. Std. Dev. [95% Conf. Interval]
• Procedure:
1 14 220.5 252
3 21 409.5 378
Given that nonparametric tests are more suitable for this specific data, we find no
evidence for a statistically significant difference between the treatments.
• Rank sum test, however, is more powerful since it uses the rank of
each observation instead of only the relation of a score to the
median value in the distribution (above or below).
• In case of ordinal data, you have no choice: only rank sum test
appropriate!
80
• cibar depvar, over(treatment)
70
Some graphical adjustments still
mean of offer
60
necessary, use the graph editor. You
50
can also add text, e.g. p-values…
40
30
25
Example:
Testing H1 to H4 with p1=0.01, p2=0.04, p3=0.03 and p4=0.005.
outcome subgroup treatme~1 treatme~2 diff_in~s Remark3_1 Thm3_1 Remark3_7 Bonf Holm
outcome subgroup treatme~1 treatme~2 diff_in~s Remark3_1 Thm3_1 Remark3_7 Bonf Holm
• Attention: The subgroup should not be coded as 0 vs. 1, use instead 1 vs. 2!
The treatment variable, however, has to contain values of 0.