1
Anchoring Climate Change Communications
Adam J. L. Harris
University College London
Han-Hui Por
Fordham University
Stephen B. Broomell
Carnegie Mellon University
Author Note
Adam J. L. Harris, Department of Experimental Psychology, University College
London; Han-Hui Por, Department of Psychology, Fordham University; Stephen B.
Broomell, Department of Social and Decision Sciences, Carnegie Mellon University.
Han-Hui Por is now at Educational Testing Service, Princeton, New Jersey.
We thank David Budescu for discussions and comments on a previous draft.
Correspondence concerning this article should be addressed to Adam J. L. Harris,
Department of Experimental Psychology, University College London, London, WC1E 6BT,
UK. E-mail: adam.harris@ucl.ac.uk
2
Abstract
Verbal probability expressions (VPEs) are frequently used to communicate risk and
uncertainty. The Intergovernmental Panel on Climate Change attempts to standardise the use
and interpretation of these expressions through a translation scale of numerical ranges to
VPEs. A common issue in interpreting VPEs is the tendency for individuals to interpret VPEs
around the mid-point of the scale (i.e., around 50%). Previous research has shown that
compliance with the IPCC’s standards can be improved if the numerical translation is
presented simultaneously with the VPE, reducing the regressiveness of interpretations. We
show that an explicit statement of the lower or upper bound implied by the expression (e.g.,
0-33%; 66-100%) leads to better differentiated estimates of the probability implied by ‘likely’
and ‘unlikely’ than when the bound is not explicitly identified (e.g., less than 33%; greater
than 66%).
Keywords: risk communication; verbal probability expressions; pragmatics;
Intergovernmental Panel on Climate Change; International Accounting Standards;
3
Anchoring Climate Change Communications
Tackling climate change is a global challenge that requires a unified understanding of
potential risks and losses attributable to human activities. The Intergovernmental Panel on
Climate Change (IPCC) is the body charged with the dissemination of information about
climate change to both policy makers and the general public. As with any scientific evidence,
there exists some degree of uncertainty in any particular observation or prediction. In some
instances, the amount of agreement or evidence will be insufficient to quantify this
uncertainty. In these instances, standardised qualitative reports of confidence are prescribed
(see Figure 1 in Mastrandrea et al., 2010). Where such quantification is, however, possible,
the IPCC prescribes the use of words, also known as verbal probability expressions (VPEs),
rather than numbers to communicate likelihood (e.g., “It is very likely that hot extremes, heat
waves, and heavy precipitation events will continue to become more frequent” (IPCC, 2007,
p. 15).
VPEs effectively convey the understanding that probability estimates are often fuzzy
concepts (e.g., Wallsten, 1990). It has long been known, however, that there is considerable
interpersonal variation in people’s interpretation of VPEs (e.g., Budescu & Wallsten, 1985,
1995; Beyth-Marom, 1982; Dhami & Wallsten, 2005; Karelitz & Budescu, 2004), suggesting
that VPEs can give rise to an “illusion of communication” (Budescu & Wallsten, 1995, p.
299). Additionally, the usage of VPEs can change depending on context, adding another layer
of complexity to standardizing the use of VPEs (e.g., Beyth-Marom, 1982).
In an effort to reduce the variability in the interpretation of its VPEs, the IPCC
provides guidelines for the numerical ranges that should be communicated with each VPE
(Table 1). Recent research on the interpretations of VPEs in the IPCC reports has
demonstrated large amounts of between person variability in these interpretations (Budescu,
4
Broomell, & Por, 2009; Budescu, Por, & Broomell, 2012; Budescu, Por, Broomell, &
Smithson, 2014; Harris, Corner, Xu, & Du, 2013). Moreover, overall, interpretations are
typically highly regressive (i.e., interpretations tend to be closer to 50% than the prescribed
meaning of the phrase). The regressiveness of interpretations results in less differentiation
between phrases such as ‘likely’ and ‘unlikely’ than is intended by the IPCC (since estimates
of both are ‘pulled’ towards 50%). For example, in Budescu et al. (2009), 64% of ‘best
estimates’ of the terms ‘very unlikely’, ‘unlikely’, ‘likely’ and ‘very likely’ were regressive
and outside the prescribed range for those terms.
Efforts to standardize the meaning of VPEs by providing a translation table (Table 1)
somewhat reduce the variability in interpretations and increase correspondence with the IPCC
guidelines (54% were inconsistent with the prescribed range - Budescu et al., 2009). Budescu
and colleagues (Budescu et al., 2009; Budescu et al., 2012; Budescu et al., 2014) have
additionally shown that the correspondence between interpretations and the IPCC’s
guidelines can be further increased with the use of a joint (verbal-numerical) presentation
format. This format reduces the variability of interpretations across participants as well as
the regressiveness in interpretations of VPEs. The joint presentation format provides the
numerical definition directly alongside each usage of a VPE (e.g., “It is very likely (greater
than 90%) that hot extremes, heat waves, and heavy precipitation events will continue to
become more frequent”). Despite the greater differentiation between VPEs, Budescu and
colleagues found interpretations to remain highly regressive, even with the joint verbalnumerical format (47% of responses were still inconsistent with the prescribed range). We
build upon this past work, testing whether another presentation difference can further reduce
the regressiveness of interpretations.
The IPCC (2007) guidelines for the fourth assessment report (AR4; see Table 1) were
somewhat ambiguous as to whether the numerical ranges for different VPEs were intended to
5
overlap. Indeed, a pragmatic interpretation of the IPCC’s meaning of ‘likely’ might lead one
to the assumption that (for example) the range for ‘likely’ is really 67-90% (i.e., suggesting a
lack of overlap with the range prescribed for ‘very likely’). For if the communicator knows
the probability is greater than 90%, they should maximise the informativeness of their
communication by choosing the more precise term (e.g., Grice, 1975/2001). We term this a
‘curtailed range’ assumption. The guidelines for AR5 (Mastrandrea et al., 2010; see Table 1)
were amended to make clear, for example, that the range of acceptable values for ‘likely’
extended as far as 100%, and did not stop at 90%. In the present paper, we test the
effectiveness of this strategy by comparing interpretations of verbal-numerical presentation
formats with numerical labels presented as in AR4 (single-anchor) versus AR5 (two-anchor).
There are two reasons to predict that interpretations should be less regressive in the
two-anchor condition than the single-anchor condition:
Firstly, in line with the intentions of Mastrandrea et al. (2010), making explicit the
fact that the range of (e.g.) ‘likely’ extends to 100%, rather than being curtailed at 90%,
effectively increases the upper limits of the estimate, allowing estimates to be spread over a
larger range. We term this the ‘extended range’ account where the midpoint of the perceived
range is higher in the explicit extended range than in the ambiguous curtailed range.
Secondly, the effect might be seen as an instance of anchoring (e.g., Tversky &
Kahneman, 1974), where the bounds pull judgments towards them. By not explicitly stating
the implied lower bound (0%) for ‘very unlikely’ (or upper bound of 100% for ‘very likely’)
the single-anchor presentation draws attention to the upper bound (10%) for ‘very unlikely’
(and the lower bound of 90% for ‘very likely’). Such an effect would be countered by the
value of 0 or 100 presented in a two-anchor condition. Anchoring effects have been
demonstrated in the laboratory using a variety of methodologies (for a review see Furnham &
Boo, 2011). Most commonly, participants first determine whether a target value is greater or
6
less than an anchor value. For example, demonstrating anchoring in probability judgments,
Plous (1989) asked participants ‘Is the chance of nuclear war between the United States and
the Soviet Union greater or less than 1%.’ Participants who first answered this question later
judged the likelihood of nuclear war as 9%, compared with an estimate of 19% for those who
didn’t first answer this question. Other studies have, however, observed anchoring effects in
consequential applied domains without an initial comparison question. Stewart (2009; see
also, Navarro-Martinez, Salisbury, Lemon, Stewart, Matthews, & Harris, 2012), for example,
observed that participants paid off less of a hypothetical credit card statement when a
minimum payment was specified than when it was not. Stewart proposed that the minimum
payment amount acted as an anchor, which reduced people’s estimates of how much they
should repay.1
On the basis of the mechanisms outlined above, we predict that best estimates of the
numerical probability will be less regressive with a two-anchor presentation than with a
single-anchor presentation. ‘Less regressive’ means that estimates of low probability
expressions (below 50%) should be lower, whilst those of high probability expressions
(above 50%) should be higher. We therefore predict an interaction between verbal probability
expression and presentation format, such that numerical estimates for ‘likely’ and ‘very
likely’ are predicted to be higher and estimates for ‘unlikely’ and ‘very unlikely’ are
predicted to be lower with a two-anchor presentation (such that both move further from
50%).
Although the current study is not intended to tease apart the extended range and
anchoring explanations, there are certain patterns of results predicted to be generated by each
mechanism. Consider a hypothetical participant who believed that ‘unlikely’ and ‘very
Strictly speaking, ‘anchoring’ is an effect rather than a mechanistic explanation. We use the term here,
however, to refer to a general assimilative effect of a provided numerical value on an estimate, rather than being
concerned with the precise underlying mechanism (for discussions of the major theories of anchoring see e.g.,
Furnham & Boo, 2011; Mochon & Frederick, 2013; Newell & Shanks, 2014).
1
7
unlikely’ were not intended to overlap and who picked the central value of the range as their
best estimate. A possible response is one whereby the minimum, best and maximum
estimates of ‘unlikely’ in the single anchor condition are 10%, 21% and 33% respectively.
Upon understanding that the lower end of the range extended all the way to zero (in the twoanchor condition for example), a participant with this response strategy would update their
estimates to 0%, 16% and 33%. Although consistent with an anchoring account, the most
parsimonious explanation for such an effect (whereby the maximum estimate is unchanged
for ‘unlikely’ and the minimum estimate is unchanged for ‘likely’) would seem to be the
extended range account. In contrast, if both minimum and maximum estimates are similarly
affected by the manipulation, this result would seem to be more consistent with a general
anchoring account.
Our conceptualisation of the AR4 guidelines as a single-anchor format and AR5 as a
two-anchor format can be thought of as synonymous with Teigen, Halberg and Fostervold’s
(2007a, 2007b) terminology of single bound and range, respectively. Teigen et al. (2007a,
Study 2) reported that best estimates of the price of skis described as costing less than 1500
Norwegian Krone (NOK 1500) were higher than estimates of skis described as costing
between NOK 500 and 1500 NOK. Similarly, estimates for shoes described as costing more
than NOK 500 were lower than for shoes costing between NOK 500 and NOK 1500. The
direction of effects is therefore as predicted in the current study. The situation is, however,
rather different. This difference arises from our focus on a probability scale, which is
bounded. With unbounded scales (at least at the upper end) such as price, there is no
indication as to what a plausible range is. Consequently, a Gricean interpretation would be
that the price should be quite close to the given value, otherwise a range would have been
specified. The range presentation thus provides additional information in such situations. In
Table 1, and the forthcoming experiments, the bounded probability scale ensures that an
8
upper and lower bound is present in both presentation formats. Notably, in the information
provided to participants, this bound is formally equivalent in the single- and two-anchor
conditions. As a result of this equivalency, there is no guarantee that the results observed in
Teigen et al. (2007a) will generalise to the present scenarios.
Judgments about climate change are highly politicised (e.g., Leiserowitz, Maibach,
Roser-Renouf, & Hmielowski, 2011), and may provide a difficult and unique context for
communicating uncertainty. VPEs can be (and have been) used in a number of contexts to
present uncertainty information. To enhance the generality of the present research, we
additionally test our manipulation of the single and two anchor formats in sentences taken
from the International Accounting Standards (IAS; Deloitte, 2008).
Method
Participants
Two hundred and eighty two US-based Mechanical Turk workers completed the
experiment. Sixty one of these failed the attention check (or did not complete it as they did
not finish the survey). Of the remaining 221 participants, 69 were female, and the age range
was 18-71 (median = 30 years; IQR = 11 years).
Design and Materials
A 2 (anchor) x 4 (VPE) mixed design was employed, with anchor condition
manipulated between-participants and VPE manipulated within-participants. The anchor
condition corresponded to whether the IPCC translations for the VPEs were presented with a
single anchor (e.g., “less than 10%” or “more than 90%”) or with two anchors (e.g., “0-10%”
or “90-100%”). The 4 VPEs used were ‘very unlikely’, ‘unlikely’, ‘likely’, and ‘very likely.’
9
Each VPE was embedded in two separate statements from the IPCC (2007, see Table 2). The
VPEs and their numerical translations were highlighted in yellow in the provided text (see
Table 2). The order of presentation of the sentences was randomised across participants. Four
additional sentences containing the terms ‘likely’ and ‘unlikely’ from the IAS (Deloitte,
2008) were also used, and these items were presented in the same anchor format as the IPCC
items. The IPCC items were always presented before the IAS items, as the IPCC items were
the main focus of the study.
All VPEs were presented with their numerical translations next to them (see Table 1),
and so the presentation format in the single-anchor condition was identical to the verbalnumerical condition of Budescu et al. (2009). The IAS items were presented with the same
numerical translations as the IPCC items.
Participants were asked to indicate the minimum, best and maximum probabilities that
they thought “the authors intended to communicate” [emphasis added] in each sentence.
Responses were constrained such that the best estimate was equal or more than the minimum
estimate and less than or equal to the maximum estimate. Responses were made by moving
sliders to provide estimates between 0 and 100% (see Figure 1).
At the end of the experiment, participants completed the same 5-item numeracy test
(Online Resource 1)2 as in Budescu et al. (2012). Participants also completed a short
demographic questionnaire, which included asking for participants’ year of birth, gender and
political affiliation: Strong Republican; Lean Republican; Independent; Lean Democrat;
Strong Democrat; Others. In analyses including this covariate, the first five options were
coded 1-5, whilst respondents reporting ‘other’ were excluded.
2
In this pre-print, all online resources are included at the end of this document.
10
Procedure
After participants consented to participate in the study, they were asked to indicate
their age and gender. At the start of both the IPCC items and the IAS items, participants were
introduced to these organisations and their guidelines for the interpretation of their
probability terms (in a table format, corresponding to the appropriate anchor condition – see
Table 1, although the inequality sign was presented verbally, i.e., “greater than / less than”).
Before proceeding to the main experimental task, participants were provided with a practice
example using the phrase “about as likely as not (33-66%)”, to ensure they were comfortable
using the response sliders. At the end of the IPCC and IAS tasks, participants completed the
numeracy test and the demographic questionnaire. Consistency between responses to the age
question at the start of the experiment, and the year of birth question in the final demographic
questionnaire served as an attention check.
Results
We first report analyses of the ‘best estimates’, before considering the range endorsed
by participants. We focus our analyses on the items taken from the IPCC report, and
subsequently report the analysis including the IAS context for ‘likely’ and ‘unlikely’ (as these
were the only two expressions included in the IAS context). The latter analysis reveals no
differences between the two contexts. All analyses used the average of participants’
interpretations for each VPE, across the items within each individual context.
IPCC
Mean ‘best estimates’ for the four VPEs across both anchor conditions are plotted in
Figure 2. A visual inspection of Figure 2 shows that, directionally, estimates are further from
11
50% (less regressive) in the two-anchor condition than the single-anchor condition for all
four VPEs, as predicted. A 2 (anchor condition) x 4 (VPE) mixed ANOVA revealed a main
effect of VPE, F(1.3, 283.2) = 3736, p < .001, etap2 = .95 (Greenhouse-Geisser correction
applied in cases when sphericity is violated.). The main effect of anchor condition was not
significant, F(1, 219) = 1.43, p = .233, but the predicted VPE x anchor condition interaction
was, F(1.3, 283.2) = 6.71, p = .006, etap2 = .03. Simple effects tests (following Howell, 1997)
showed that estimates were significantly different (and further from 50%) in the two-anchor
condition for both ‘unlikely’, F(1, 873.3) = 4.67, p = .03, etap2 = .02, and ‘likely’, F(1, 873.3)
= 16.32, p < .001, etap2 = .08. There was no anchor effect for either ‘very likely’ or ‘very
unlikely’ (Fs < 1).
IAS and IPCC
In an analysis including the IAS context, interpretations of ‘likely’ and ‘unlikely’ did
not differ between the contexts: main effect of context, F < 1, interaction between context
and VPE, F(1, 219) = 2.18, p = .142. Figure 3 therefore plots the mean estimates for ‘likely
and ‘unlikely’ in both anchor conditions, collapsed across context. Directionally, estimates
are further from 50% in the two-anchor condition than the single-anchor condition. This
result was borne out with a significant VPE x anchor condition interaction, F(1, 219) = 21.55,
p < .001, etap2 = .09, but this was not qualified by a 3-way interaction with context, F(1, 219)
= 1.32, p = .251, suggesting that the effect is comparable across both the IPCC and IAS
contexts. Separate ANOVAs performed on ‘likely’ and ‘unlikely’ suggested that the effect of
anchor condition was significant for both: ‘likely’, F(1, 219) = 24.41, p < .001, etap2 = .10;
‘unlikely’, F(1, 219) = 9.14, p = .003, etap2 = .040, with no effects of, or interactions
12
involving, context3. Finally, an ANCOVA confirmed that the overall pattern of results was
consistent when controlling for numeracy, political affiliation, age and gender (see Online
Resource 2 for distributions of political affiliations and numeracy scores).
To better understand the nature of the effect, we considered the range endorsed by
participants for the VPEs. To determine this range, participants’ minimum estimates were
subtracted from their maximum estimates. Considering only ‘likely’ and ‘unlikely’, in a 2
(context) x 2 (VPE) x 2 (anchor condition) ANOVA, there was a significant effect of anchor
condition, F(1, 219) = 44.92, p < .001, etap2 = .17, and a context x anchor condition
interaction, F(1, 219) = 5.03, p = .026, etap2 = .022. We therefore analysed the endorsed
range for the IPCC and IAS contexts separately.
Figure 4 plots the ‘minimum’, ‘best’ and ‘maximum’ estimates across the anchor
conditions for the four expressions used in the IPCC context. A visual inspection suggests
that the results are more consistent with the predictions of the extended range account. For
‘very likely’ and ‘likely’, the maximum estimate appears to increase more than the minimum
estimate. For ‘unlikely’ and ‘very unlikely’, the minimum estimate appears to decrease more
than the maximum estimate. A 4x2 (VPE x anchor condition) ANOVA revealed a significant
effect of VPE, F(2.2, 486.2) = 501.73, p < .001, etap2 = .70. Of more interest, there was also a
main effect of anchor condition, F(1, 219) = 13.71, p < .001, etap2 = .06, as well as a
significant VPE x anchor condition interaction, F(2.2, 486.2) = 8.60, p < .001, etap2 = .04
(there was no main effect of anchor condition, F < 1). Simple effects revealed that there was
a significant effect of anchor condition for ‘likely’, F(1, 614.9) = 26.7, p < .001, etap2 = .14,
and ‘unlikely’, F(1, 614.9) = 17.5, p < .001, etap2 = .09, with a larger range endorsed in the
two-anchor condition than the single-anchor condition. There was no effect of anchor
For ‘likely’, the main effect of context approached significance, F(1, 219) = 3.17, p = .076, with slightly higher
estimates in the IAS context (mean = 78.7) than the IPCC context (mean = 78.0).
3
13
condition on the endorsed range for either ‘very likely’ or ‘very unlikely’ (Fs < 1). As
suggested in Figure 4, the increased range for ‘likely’ in the two-anchor condition stems from
participants providing higher maximum estimates, t(164.3) = 5.70, p < .001, d = 0.77, with no
corresponding change in their minimum estimates, t(219) = 1.21, p = .230. Likewise, the
increased range for ‘unlikely’ in the two-anchor condition stems from participants providing
lower minimum estimates, t(193.1) = 4.27, p < .001, d = 0.57, with no change in maximum
estimates (t < 1). The results of the overall ANOVA held when numeracy, political
affiliation, age and gender were included as covariates in an ANCOVA.
Despite the interaction with context, the pattern of results from the IAS context
mirrored those from the IPCC context. A significant effect of VPE was observed, F(1, 219) =
6.56, p = .011, etap2 = .03. More importantly, however, a significant effect of anchor
condition was also observed, F(1, 219) = 20.40, p < .001, etap2 = .19, with a larger range
endorsed in the two-anchor condition than the single-anchor condition. As with the IPCC, the
increased range for ‘likely’ in the two-anchor condition stemmed from participants providing
higher maximum estimates (97.9% vs. 90.0%, t[163.7] = 6.601, p < .001, d = 0.90) with no
difference in minimum estimates across anchor conditions (64.6% vs. 64.0%, t < 1).
Likewise, the increased range for ‘unlikely’ in the two-anchor condition stemmed from
participants providing lower minimum estimates (1.9% vs. 8.6%, t[151.8] = 5.63, p < .001, d
= 0.77) with no difference in maximum estimates (33.7% vs. 34.3%, t < 1; see Online
Resource 3 for figure displaying full descriptive statistics). The overall effect of anchor
condition was also significant in an ANCOVA controlling for numeracy, political affiliation,
age and gender as covariates.4 Thus, the results from both the IPCC and IAS contexts are in
line with the predictions of the extended range account.
4
In contrast to the ANOVA results, there was no main effect of VPE on endorsed range in the ANCOVA.
14
Consistency with prescribed ranges
The reduced regressiveness of interpretations might lead to a greater number of best
estimates and ranges consistent with the prescribed ranges of the IPCC. Across all items, 94%
of best estimates were consistent in the two-anchor condition, versus 90% in the singleanchor condition, χ2(1) = 6.2, p = .013, with more estimates consistent in the two-anchor
condition across all VPEs (Online Resource 4, Table A). Following Budescu et al. (2009), we
defined an endorsed range as consistent if both upper and lower bounds were within the
prescribed range, as inconsistent if both were outside the prescribed range, and as partially
consistent otherwise. Eighty one percent of endorsed ranges were consistent in the twoanchor condition, compared with 77% in the single-anchor condition, χ2(2) = 6.23, p = .044,
with more estimates consistent in the two-anchor condition across all VPEs (Online Resource
4, Table B).
General Discussion
The overall pattern of results is clear, and consistent with the results of an additional
experiment, which recruited university students (Online Resource 5). For ‘likely’ and
‘unlikely,’ best estimates were less regressive with the two-anchor presentation than with the
single-anchor presentation. These results are aligned with the findings using absolute values
(e.g., cost, number of tables) from Teigen et al. (2007a, 2007b). Considering the analysis of
the possible range endorsed by participants, the pattern of results is as predicted by the
extended range account, with an increased endorsed range for both ‘likely’ and ‘unlikely.’
15
The lack of any effects for the extreme expressions, ‘very likely’ and ‘very unlikely’ is also
in line with the extended range account. In AR4, an individual assuming that the range only
extended to 99% (in the case of ‘very likely’) would only have this range extended by a
single percentage point when the extended range is made explicit (as in AR5 – although it
should be noted that simple ceiling and floor effects might also explain these results).5
Although the aim of the present paper was not to choose between two plausible
explanations for the effects we observe, the pattern of results is more consistent with the
extended range account, although we cannot rule out the additional potential influence of a
more general anchoring contribution, which, if present, would appear to exert a smaller effect
than the perceived extended range. Nonetheless, the effects themselves seem robust, holding
when controlling for the influence of potential covariates across two experiments from two
different populations.
The pattern of results observed was consistent across the IPCC and IAS contexts. The
similar pattern across contexts makes us confident that providing two anchors rather than one
in a VPE translation will reduce the regressiveness of interpretations across a variety of
contexts, not solely the ones considered here. Because interpretations of the IPCC’s VPEs
have typically been shown to be too regressive, the reduced regressiveness observed in the
two-anchor condition is an improvement, and indeed results in greater consistency with the
prescribed interpretations. The evaluation might, however, be considered more complex than
this. Although we observed greater differentiation between interpretations of ‘unlikely’ and
‘likely’, because no effect was observed for ‘very unlikely’ and ‘very likely’ this effect
simultaneously reduces the differentiations between ‘(un)likely’ and ‘very (un)likely.’
Ultimately, it is for policy makers to decide which terms are more important to differentiate.
5
Despite obtaining broadly consistent results, the experiment reported in Online Resource 5 found that a
lower best estimate for ‘unlikely’ was not associated with an increase in range. This experiment also did not
report a significant difference in consistency rates between conditions, although numerically the trend was in
the predicted direction in 11 out of 12 instances.
16
Currently, however, with ‘very (un)likely’ nested within ‘(un)likely’ (e.g., 5% risk can be
represented by either ‘unlikely’ or ‘very unlikely’), people’s interpretations in the two-anchor
condition would appear to be more in line with the intentions of the IPCC. Ultimately, it
would be beneficial for future research to identify a means for reducing the regressiveness of
interpretations of the extreme terms ‘very (un)likely,’ which are typically those for which the
most regressive responses have been observed in the past (e.g., Budescu et al., 2009).
The current research has been concerned with numerical interpretations of VPEs, in
the tradition of much work in this area (e.g., Budescu & Wallsten, 1985; Budescu et al., 2009,
2012, 2014; Harris & Corner, 2011; Harris et al., 2013; Ho, Budescu, Dhami, & Mandel, in
press; Mandel, 2015; Smithson, Budescu, Broomell, & Por, 2012). The effect of the current
format manipulation (or indeed those in Budescu et al., 2009, 2012, 2014) on decision
making is, however, less clear, and is an important topic for future research. Previous findings
that are likely to be relevant in this context include those showing that the use of only upper
bounds in describing a range (e.g., ‘less than 33%’ in the case of probability estimates)
encourages downwards comparisons, in the case of probabilities presumably directing
attention to an event’s non-occurrence, whilst lower bounds (e.g., ‘greater than 66%’)
encourage upwards comparisons (Teigen 2008; Teigen et al., 2007a). Such pragmatic
influences suggest further potential advantages of the use of the two-anchor format, since
reducing these influences can be expected to enhance standardisation in interpretation of the
IPCC’s probability phrases.
Conclusion
A number of researchers have criticised the verbal probability scale used by the IPCC
(Table 1). In light of such criticism, it is important that researchers not only highlight
improvements that could be made (e.g., Ho et al., in press, who argued that organisations
17
should use VPEs to represent the probabilities that empirical research shows people best
associate them with), but also acknowledge where changes made by the IPCC are
improvements. Following work demonstrating a benefit of using a verbal-numerical joint
presentation format (e.g., Budescu et al., 2014), we provide evidence that the explicit upper
and lower boundaries prescribed in IPCC AR5 further reduce the regressiveness of people’s
interpretations. The high profile of the reports produced by the IPCC, combined with their
global readership, ensures the importance of attention to any factor that can enhance
communication effectiveness. The inclusion of items from the IAS suggests that the benefit
conferred by a two-anchor format is not unique to climate related contexts. We therefore
recommend that such verbal-numerical presentations explicitly state both the upper and lower
bounds wherever a standardised treatment of VPEs is intended.
References
Beyth-Marom, R. (1982). How probable is probable? A numerical translation of verbal
probability expressions. Journal of Forecasting, 1, 257-269.
Budescu, D. V., Broomell, S., & Por, H-H. (2009). Improving communication of uncertainty
in the reports of the intergovernmental panel on climate change. Psychological
Science, 20, 299-308.
Budescu, D. V., Por, H-H., & Broomell, S. B. (2012). Effective communication of
uncertainty in the IPCC reports. Climatic Change, 113, 181-200.
Budescu, D. V., Por, H-H., Broomell, S. B., & Smithson, M. (2014). The interpretation of
IPCC probabilistic statements around the world. Nature Climate Change, 4, 508-512.
Budescu, D. V., & Wallsten, T. S. (1985). Consistency in interpretation of probabilistic
phrases. Organizational Behavior and Human Decision Processes, 36, 391-405.
18
Budescu, D. V., & Wallsten, T. S. (1995). Processing linguistic probabilities: General
principles and empirical evidence. In J. R. Busemeyer, R. Hastie, & D. L. Medin
(Eds.), Psychology of Learning and Motivation: Advances in Research and Theory
(Vol. 32: Decision Making from a Cognitive Perspective) (pp. 275-318). San Diego,
CA: Academic Press.
Deloitte (2008). Assets held for sale and discontinued operations: A guide to IFRS 5.
Retrieved October 29, 2014 from http://www.iasplus.com/enus/publications/global/guides/pub1923
Dhami, M. K. & Wallsten, T. S. (2005). Interpersonal comparison of subjective probabilities:
Toward translating linguistic probabilities. Memory & Cognition, 33, 1057-1068.
Furnham, A., & Boo, H. C. (2011). A literature review of the anchoring effect. The Journal of
Socio-Economics, 40, 35-42.
Grice, H. P. (1975/2001). Logic and conversation. In A. P. Martinich (Ed.), The Philosophy
of Language (4th Edition) (pp. 165-175). Oxford, UK: Oxford University Press.
Harris, A. J. L., & Corner, A. (2011). Communicating environmental risks: Clarifying the
severity effect in interpretations of verbal probability expressions. Journal of
Experimental Psychology: Learning, Memory, and Cognition, 37, 1571-1578.
Harris, A. J. L., Corner, A., Xu, J., & Du, X. (2013). Lost in translation? Interpretations of the
probability phrases used by the Intergovernmental Panel on Climate Change in China
and the UK. Climatic Change, 121, 415-425.
Ho, E. H., Budescu, D. V., Dhami, M. K., & Mandel, D. R. (in press). Improving the
communication of uncertainty in climate science and intelligence analysis. Behavioral
Science & Policy.
19
Intergovernmental Panel on Climate Change. (2007). A Report of Working Group I of the
Intergovernmental Panel on Climate Change: Summary for policymakers. Retrieved
April 2010 from http://www.ipcc.ch/pdf/assessment-report/ar4/wg1/ar4-wg1-spm.pdf
Intergovernmental Panel on Climate Change (2013). Climate Change 2013: The Physical
Science Basis. Summary for Policymakers. Retrieved June 2015 from
http://www.ipcc.ch/pdf/assessment-report/ar5/wg1/WG1AR5_SPM_FINAL.pdf
Karelitz, T. M., & Budescu, D. V. (2004). You say “probable” and I say “likely”: Improving
interpersonal communication with verbal probability phrases. Journal of
Experimental Psychology: Applied, 10, 25-41.
Leiserowitz A, Maibach E, Roser-Renouf C, Hmielowski JD (2011) Politics & global
warming: democrats, republicans, independents, and the tea party. Yale University
and George Mason University. New Haven, CT: Yale Project on Climate Change
Communication. Retrieved from
http://environment.yale.edu/climate/files/PoliticsGlobalWarming2011.pdf
Mandel, D. R. (2015). Accuracy of intelligence forecasts from the intelligence consumer's
perspective. Policy Insights from the Behavioral and Brain Sciences, 2, 111-120.
Mastrandrea, M. D., Field, C. B., Stocker, T. F., Edenhofer, O., Ebi, K. L., Held, H., et al.
(2010). Guidance note for lead authors of the IPCC fifth assessment report on
consistent treatment of uncertainties. IPCC cross-working group meeting on
consistent treatment of uncertainties, Jasper Ridge, CA. [retrieved from
http://www.ipcc.ch/pdf/supporting-material/uncertainty-guidance-note.pdf August
14th, 2013]
Mochon, D., & Frederick, S. (2013). Anchoring in sequential judgments. Organizational
Behavior and Human Decision Processes, 122, 69-79.
20
Navarro-Martinez, D., Salisbury, L. C., Lemon, K. N., Stewart, N., Matthews, W. J., &
Harris, A. J. L. (2011). Minimum required payment and supplemental information
disclosure effects on consumer debt repayment decisions. Journal of Marketing
Research, 48, S60-S77.
Newell, B. R., & Shanks, D. R. (2014). Prime numbers: Anchoring and its implications for
theories of behavior priming. Social Cognition, 32, 88-108.
Plous, S. (1989). Thinking the unthinkable: The effects of anchoring on likelihood estimates
of nuclear war. Journal of Applied Social Psychology, 19, 67-91.
Smithson, M., Budescu, D. V., Broomell, S. B., & Por, H-H. (2012). Never say “not”: Impact
of negative wording in probability phrases on imprecise probability judgments.
International Journal of Approximate Reasoning, 53, 1262-1270.
Stewart, N. (2009). The cost of anchoring on credit-card minimum repayments.
Psychological Science, 20, 39-41.
Teigen, K. H. (2008). More than X is a lot: Pragmatic implicatures of one-sided uncertainty
intervals. Social Cognition, 26, 379-400.
Teigen, K. H., Halberg, A-M., & Fostervold, K. I. (2007a). Single-limit interval estimates as
reference points. Applied Cognitive Psychology, 21, 383-406.
Teigen, K. H., Halberg, A-M., & Fostervold, K. I. (2007b). More than, less than, or
minimum, maximum: how upper and lower bounds determine subjective interval
estimates. Journal of Behavioral Decision Making, 20, 179 - 201.
Tversky, A., & Kahneman, D. (1974). Judgment under uncertainty: Heuristics and biases.
Science, 185, 1124 –1131.
Wallsten, T. S. (1990). The costs and benefits of vague information. In R. M. Hogarth (Ed.),
Insights in Decision Making: A Tribute to Hillel J. Einhorn (pp. 28-43). Chicago, IL:
The University of Chicago Press.
21
Witteman, C., & Renooij, S. (2003). Evaluation of a verbal-numerical probability scale.
International Journal of Approximate Reasoning, 33, 117-131.
Table 1. Likelihood scale of the IPCC.
Term
Virtually certain
Very likely
Likely
About as likely as not
Unlikely
Very unlikely
Exceptionally unlikely
Likelihood of the Outcome
AR4 (single-anchor)
AR5 (2 anchor)
> 99%
99-100%
> 90%
90-100%
> 66%
66-100%
33 to 66%
33 to 66%
< 33%
0-33%
< 10%
0-10%
< 1%
0-1%
22
Table 2. The statements used in the experiment (examples shown are as the text would appear
in the single anchor condition).
No.
Type
Statement
1
Continued greenhouse gas emissions at or above current rates would cause further warming and
induce many changes in the global climate system during the 21st century that would very likely
(greater than 90%) be larger than those observed during the 20th century.
2
It is very likely (greater than 90%) that hot extremes, heat waves, and heavy precipitation events
will continue to become more frequent.
3
The Greenland ice sheet and other Arctic ice fields likely (greater than 66%) contributed no more
than 4 m of the observed sea level rise.
4
5
6
Temperatures of the most extreme hot nights, cold nights and cold days are likely (greater than
66%) to have increased due to anthropogenic forcing. [Note: Anthropogenic forcing refers to the
IPCC influences on the environment by human, rather than natural, factors.]
Over the past 3,000 to 5,000 years, oscillations in global sea level on time-scales of 100 to 1,000
years are unlikely (less than 33%) to have exceeded 0.3 to 0.5 m.
Reconstructions of climate data for the past 1,000 years also indicate that this warming was
unusual and is unlikely (less than 33%) to be entirely natural in origin.
7
It is very unlikely (less than 10%) that the MOC will undergo a large abrupt transition during the
21st century. [Note: MOC stands for Meridional Overturning Circulation, and refers to the global
ocean currents.]
It is very unlikely (less than 10%) that climate changes of at least the seven centuries prior to 1950
were due to variability generated within the climate system alone.
8
9
IAS 36(21) notes that the fair value less costs to sell of an asset to be disposed of will often
approximate its value in use, as the value in use calculation will consist mainly of the net disposal
proceeds. This is because the future cash flows from continuing use of the asset until its disposal
are likely (greater than 66%) to be negligible. [IFRS5]
10
Investment property shall be recognized as an asset when, and only when it is likely (greater than
66%) that the future economic benefits or service potential that are associated with the investment
property will flow to the entity. [IPSAA 9]
Conversely, where an asset is still in the course of construction, and significant activities will need
to be performed before it can be transferred, it is unlikely (less than 33%) that it could be regarded
as available for immediate sale. [IFRS5]
During the initial one-year period, circumstances arise that were previously considered unlikely
(less than 33%) and, as a result, a non-current asset (or disposal group) previously classified as
held for sale is not sold by the end of that period. [IFRS5]
IAS
11
12
23
Figure 1. Example of an IPCC item with two anchors. The order of minimum, maximum and
best estimates was always as shown here. The sliders turned blue after they were moved and
their numerical value was shown on the right.
24
Mean Best Estimate (%)
100
90
single anchor
80
two anchors
70
60
50
40
30
20
10
0
Very likely
Likely
Unlikely
Very unlikely
Verbal Probability Expression
Figure 2. Mean ‘best estimates’ provided in response to the IPCC sentences. Error bars are
95% confidence intervals. Dashed horizontal lines represent the prescriptions of the IPCC for
the lower bounds of ‘(very) likely’ and the upper bounds of ‘(very) unlikely’ (Table 1).
25
100
Mean Best Estimate (%)
80
single anchor
two anchors
60
40
20
0
Likely
Unlikely
Verbal Probability Expression
Figure 3. Mean ‘best estimates’ for interpretations of Likely and Unlikely, collapsed across
the IPCC and IAS contexts. Error bars are 95% confidence intervals. Dashed horizontal lines
represent the prescriptions of the IPCC for the lower bound of ‘likely’ and the upper bound
for ‘unlikely’ (Table 1).
26
100
Minimum
80
Best
Mean Estimate (%)
Maximum
60
40
20
0
Single
Two
Very likely
Single
Two
Single
Two
Likely
Unlikely
VPE and Anchor Condition
Single
Two
Very unlikely
Figure 4. Mean estimates in the IPCC context. Error bars are 95% confidence intervals.
Dashed horizontal lines represent the prescriptions of the IPCC for the lower bounds of
‘(very) likely’ and the upper bounds of ‘(very) unlikely’ (Table 1).
27
Online Resource 1
Numeracy Items used in the questionnaire (questions taken from Frederick, 2005; Peters,
Västfjäll, Slovic, Mertz, Mazzocco, & Dickert, 2006).
No.
Question
Answer
N1.
Imagine that we roll a fair, six-sided die 1,000 times. (That would mean
that we roll one die from a pair of dice.) Out of 1,000 rolls, how many
times do you think the die would come up as an even number?
Number of
times: 500
N2
In the BIG BUCKS LOTTERY, the chances of winning a $10.00 prize are
1%. What is your best guess about how many people would win a $10.00
prize if 1,000 people each buy a single ticket from BIG BUCKS?
Number of
people: 10
N3
A bat and a ball cost $1.10 in total. The bat costs $1.00 more than the ball.
How much does the ball cost?
Cost of ball:
$0.05
N4
In a lake, there is a patch of lilypads. Every day, the patch doubles in size.
If it takes 48 days for the patch to cover the entire lake, how many days
would it take for the patch to cover half of the lake?
Number of
days: 47
N5
If it takes 5 machines 5 minutes to make 5 widgets, how long would it take
100 machines to make 100 widgets?
Number of
minutes: 5
References
Frederick, S. (2005). Cognitive reflection and decision making. Journal of Economic
Perspectives, 19, 24-42.
Peters, E., Västfjäll, D., Slovic, P., Mertz, C. K., Mazzocco, K., & Dickert, S. (2006).
Numeracy and decision making. Psychological Science, 17, 407-413.
28
Online Resource 2
Distribution of political affiliations.
Political affiliation
N
Cumulative percent
Strong Right Wing
Right to Center
Center
Center to Left
Strong Left Wing
Other
Didn’t answer question
14
33
56
68
45
5
0
6%
21%
47%
77%
98%
100%
--
Number of questions
correct (/5)
N
Cumulative percent
Zero
One
Two
Three
Four
Five
6
19
32
30
50
84
2.7
11.3
25.8
39.4
62.0
100.0
Distribution of numeracy scores.
29
Online Resource 3
100
Minimum
Best
Maximum
Mean Estimate (%)
80
60
40
20
0
Single
Two
Single
Two
Likely
Unlikely
VPE and Anchor Condition
Figure. Mean estimates in the IAS context. Error bars are 95% confidence intervals. Dashed
horizontal lines represent the prescriptions of the IPCC for the lower bound of ‘likely’ and
the upper bound for ‘unlikely’ (Table 1).
30
Online Resource 4
Online Resource 4: Consistency of estimates with the prescriptions in Table 1.
Best estimates were labelled as consistent if they were within the range prescribed by
the IPCC (Table 1). For example, best estimates for ‘unlikely’ were considered to be
consistent if they were less than or equal to 33% (see Table A for consistency rates of best
estimates)
Ranges were labelled as consistent if both upper and lower bounds were within the
prescribed range, as inconsistent if both were outside the prescribed range, and as partially
consistent otherwise. For example, endorsed ranges for ‘unlikely’ were considered to be
consistent if the higher bound was 33% or lower, inconsistent if the lower bound was higher
than 33%, and partially consistent if the lower bound was 33% or lower but the higher bound
was greater than 33% (see Table B for consistency rates of endorsed ranges).
Note that there were no significant differences between conditions for any of the
individual probability phrases, although all were numerically in the predicted direction.
31
Table A. Consistency of best estimates with the prescriptions of Table 1.
Single-anchor format
Probability
phrase
Consistent Inconsistent
Two-anchor format
Consistent Inconsistent
IPCC very
likely
88%
12%
91%
9%
IPCC likely
89%
11%
93%
7%
IPCC
unlikely
91%
9%
96%
4%
IPCC very
unlikely
91%
9%
92%
8%
IAS
likely
92%
8%
97%
3%
IAS
unlikely
92%
8%
96%
4%
All terms
90%
10%
94%
6%
32
Table B. Consistency of endorsed ranges with the prescriptions of Table 1.
Single-anchor format
Two-anchor format
Probability
phrase
Consistent
Partially Inconsistent
consistent
Consistent
Partially Inconsistent
consistent
IPCC very
likely
73%
21%
6%
81%
17%
2%
IPCC likely
71%
25%
4%
75%
24%
1%
IPCC
unlikely
81%
16%
3%
84%
12%
4%
IPCC very
unlikely
78%
16%
6%
82%
15%
3%
IAS
likely
75%
23%
2%
79%
20%
1%
IAS unlikely
81%
16%
3%
86%
13%
1%
All terms
77%
19%
4%
81%
17%
2%
33
Online Resource 5
Additional experiment with university students
Method
Participants
Nineteen male and 98 female (aged 18-22; median = 19 years; IQR = 1.0) first year
psychology undergraduates at University College London (UCL) participated in the
experiment as part of a course requirement. A further 30 male and 32 female (aged 18-33;
median = 20.5 years; IQR = 2.0) social science undergraduates at Carnegie Mellon University
(CMU) participated for course credit.
Design and Materials
The design was the same as in the main experiment.
The only difference in the materials was in the introduction to the IPCC and the IAS
that participants were presented with prior to a practice trial using “about as likely as not (3366%).” The translation table presented on this page in the single-anchor condition presented
the range with an inequality sign (e.g., “>99%”) instead of in words (i.e., “greater than
99%”). Words were used elsewhere throughout the experiment. Replacing the inequality
signs with words was a ‘fix’ employed in the main experiment, to ensure that there could be
no confusion with participants misunderstanding the directions of the inequalities.
Procedure
34
Participants at UCL completed the experiment in two large (approximately equalsized) groups. Participants at CMU accessed the experiment from the online undergraduate
participant pool in their own time. The experiment used the same qualtrics computer program
and was completed on individual desktop computers. Participants were provided with the
experimental link and all instructions were presented on the computer.
Other than this, all aspects of the procedure were identical to the main experiment
except for the fact that no consistency check was conducted, with the age and gender
questions being asked at the end of the experiment, with no additional year of birth question.
Results
We follow the same procedure and structure for reporting our results as in the main
experiment.
IPCC
Mean ‘best estimates’ for the four VPEs across both anchor conditions are plotted in
Figure A. A visual inspection of Figure A shows that the predicted result of estimates being
further from 50% in the two-anchor condition than the single-anchor held, directionally, for
the expressions ‘likely’, ‘unlikely’ and ‘very unlikely.’ A 2 (anchor condition) x 4 (VPE)
mixed ANOVA revealed a main effect of VPE, F(1.4, 251.5) = 3579, p < .001, etap2 = .95
(Greenhouse-Geisser correction applied), as well as a main effect of anchor condition, F(1,
177) = 4.89, p = .028, etap2 = .03. The main effect was qualified by the predicted interaction
between VPE and anchor condition, F(1.4, 251.5) = 4.37, p = .024, etap2 = .02. Simple effects
tests showed that the effect was significant only for ‘unlikely’, F(1, 704.8) = 12.36, p < .001.
35
Mean Best Estimate (%)
100
90
single anchor
80
two anchors
70
60
50
40
30
20
10
0
Very likely
Likely
Unlikely
Very unlikely
Verbal Probability Expression
Figure A. Mean ‘best estimates’ provided in response to the IPCC sentences. Error bars are
95% confidence intervals. Dashed horizontal lines represent the prescriptions of the IPCC for
the lower bounds of ‘(very) likely’ and the upper bounds of ‘(very) unlikely’ (Table 1).
IAS and IPCC
In the analysis of ‘likely’ and ‘unlikely’ including the IAS context, interpretations of
‘likely’ and ‘unlikely’ did not significantly differ between the contexts: main effect of
context, F < 1, interaction between context and VPE, F(1, 177) = 2.77, p = .098, etap2 = .015.
Figure B therefore plots the mean estimates for ‘likely and ‘unlikely’ across both anchor
conditions, collapsed across context. Directionally, estimates are further from 50% in the
two-anchor condition than the single-anchor condition. This result was borne out with a
significant VPE x anchor condition interaction, F(1, 177) = 13.20, p < .001, etap2 = .069.
Separate ANOVAs performed on ‘likely’ and ‘unlikely’ suggested that the anchor condition
was significant for both: ‘likely’, F(1, 177) = 7.44, p = .007, etap2 = .040; ‘unlikely’, F(1,
36
177) = 12.14, p = .001, etap2 = .064, with no effects of, or interactions involving, context.
Finally, an ANCOVA confirmed that the overall pattern of results was consistent when
controlling for numeracy, political affiliation6, age and gender. Thus, overall these results are
consistent with those of the main experiment.
Mean Best Estimate (%)
100
80
single anchor
two anchors
60
40
20
0
Likely
Unlikely
Verbal Probability Expression
Figure B. Mean ‘best estimates’ for interpretations of ‘likely’ and ‘unlikely’, collapsed across
the IPCC and IAS contexts. Error bars are 95% confidence intervals. Dashed horizontal lines
represent the prescriptions of the IPCC for the lower bound of ‘likely’ and the upper bound
for ‘unlikely’ (Table 1).
To better understand the nature of the effect, participants’ minimum and maximum
estimates of ‘likely’ and ‘unlikely’ were analysed. Figure C plots the ‘minimum’, ‘best’ and
6
The distribution of political affiliations and numeracy scores are included in an appendix at the end of this
document.
37
‘maximum’ estimates across the anchor conditions for these expressions. For ‘likely’, the
maximum estimate increased to a greater degree than the minimum estimate in the twoanchor condition, resulting in a greater endorsed range in the two-anchor condition, as in the
main experiment and predicted by the Extended Range hypothesis. This pattern appears less
clear for ‘unlikely’, however, as both the minimum and maximum estimates appear to
decrease in the two-anchor condition – a result more consistent with an anchoring account.
To determine the range endorsed by participants for each VPE, participants’ minimum
estimates were subtracted from their maximum estimates. In a 2 (context) x 2 (VPE) x 2
(anchor condition) ANOVA, a main effect of VPE was observed, F(1, 177) = 11.89, p = .001,
etap2 = .063. The interaction between VPE and anchor condition was again significant, F(1,
177) = 12.88, p < .001, etap2 = .001. There was no main effect of context, F(1, 177) = 1.64, p
= .202, nor were there any interactions involving context (all Fs < 1, except VPE x context,
F(1, 177) = 1.05, p = .307), suggesting that the documented effects do not systematically
differ between the IPCC and IAS contexts. Separate ANOVAs for ‘likely’ and ‘unlikely’,
revealed that the effect of anchor condition was significant for ‘likely’, F(1, 177) = 7.49, p =
.007, etap2 = .041, but not for ‘unlikely’, F < 1. As suggested in Figure C, the increased range
for ‘likely’ in the two-anchor condition stems from participants providing higher maximum
estimates, F(1, 177) = 19.62, p < .001, etap2 = .100, with no corresponding change in their
minimum estimates, F(1, 177) = 1.36, p = .246. By contrast, the consistent range endorsed for
‘unlikely’ in the two conditions, coupled with the effect of anchor condition for ‘best
estimates’ shows that both the minimum and maximum estimates were also less regressive in
the two-anchor condition (minimum: F(1, 177) = 15.07, p < .001, etap2 = .078; maximum:
F(1, 177) = 4.41, p = .037, etap2 = .024). In an ANCOVA including numeracy, political
38
affiliation, age and gender as covariates, the VPE x anchor condition interaction remained
significant, with no interactions or effects of context.7
100
Minimum
Mean Estimate (%)
80
Best
Maximum
60
40
20
0
Single anchor
Two anchors
Single anchor
Two anchors
Likely
Unlikely
VPEs and Anchor Condition
Figure C. Mean estimates for ‘likely’ and ‘unlikely’, collapsed across the IPCC and IAS
contexts. Error bars are 95% confidence intervals. Dashed horizontal lines represent the
prescriptions of the IPCC for the lower bound of ‘likely’ and the upper bound for ‘unlikely’
(Table 1).
Consistency with prescribed ranges
Best estimates were labelled as consistent if they were within the range prescribed by
the IPCC (Table 1). For example, best estimates for ‘unlikely’ were considered to be
consistent if they were less than or equal to 33% (see Table A for consistency rates of best
estimates). Unlike in the experiment reported in the main text, the effect of anchor condition
on consistency rates was not significant, χ2(1) = 0.61, p = .44, although the numerical trend
7
In contrast to the ANOVA, there was no main effect of VPE on endorsed range in the ANCOVA.
39
was in the predicted direction for all instances apart from ‘very likely’ in the IPCC context
(Table A).
Table A. Consistency of best estimates with the prescriptions of Table 1.
Single-anchor format
Probability
phrase
Consistent Inconsistent
Two-anchor format
Consistent Inconsistent
IPCC very
likely
83%
17%
82%
18%
IPCC likely
91%
9%
93%
7%
IPCC
unlikely
94%
6%
96%
4%
IPCC very
unlikely
86%
14%
89%
11%
IAS
likely
91%
9%
94%
6%
IAS
unlikely
95%
5%
96%
4%
All terms
90%
10%
91%
9%
Ranges were labelled as consistent if both upper and lower bounds were within the
prescribed range, as inconsistent if both were outside the prescribed range, and as partially
consistent otherwise. For example, endorsed ranges for ‘unlikely’ were considered to be
consistent if the higher bound was 33% or lower, inconsistent if the lower bound was higher
than 33%, and partially consistent if the lower bound was 33% or lower but the higher bound
was greater than 33% (see Table B for consistency rates of endorsed ranges). Unlike in the
experiment reported in the main text, the effect of anchor condition on consistency rates was
40
not significant, χ2(2) = 5.65, p = .059, although the numerical trend was in the predicted
direction for all instances (Table B).
Table B. Consistency of endorsed ranges with the prescriptions of Table 1.
Single-anchor format
Two-anchor format
Probability
phrase
Consistent
Partially Inconsistent
consistent
Consistent
Partially Inconsistent
consistent
IPCC very
likely
55%
41%
4%
59%
36%
5%
IPCC likely
54%
45%
1%
60%
38%
2%
IPCC
unlikely
75%
22%
3%
86%
12%
2%
IPCC very
unlikely
67%
28%
5%
77%
20%
3%
IAS
likely
58%
37%
5%
64%
36%
0%
IAS unlikely
79%
17%
4%
83%
13%
2%
All terms
65%
32%
3%
72%
26%
2%
Discussion
Generally, the pattern of results was the same as in the main experiment. There were
three differences observed in the patterns of results, which we draw attention to here.
Firstly, the regressiveness of participants’ best estimates of ‘likely’ in the IPCC
context was not significantly attenuated in this experiment, whilst it was in the main
experiment. In the analysis combining the IPCC and IAS data, however, the same result was
41
observed as in the main experiment, and there was no interaction with context. We are
therefore confident in our overall conclusion that the two-anchor presentation reduces the
regressiveness of ‘best estimate’ interpretations of ‘likely’ and ‘unlikely.’
In the main experiment, we posited that the extended range account seemed to provide
a better account of the present findings on the basis of the range of plausible estimates
increasing in the two-anchor condition. The present experiment yielded one result that did not
follow this pattern. This was for ‘unlikely.’ In this instance, the lower best estimate is not
associated with an increase in range, and is also associated with a lower maximum estimate
(as well as lower minimum estimate). In the Introduction to the paper, we stated that such a
pattern of results was difficult to predict with the extended range account, suggesting a role
for anchoring. As we state in the General Discussion of the main manuscript, although we
believe that the data are generally more consistent with the extended range hypothesis, we
could not have ruled out an influence of anchoring in any case, and this result potentially
underscores that point. Nonetheless, the effects themselves seem robust, holding when
controlling for the influence of potential covariates across two experiments from two
different populations.
Finally, we do not observe a significant effect in the consistency analyses. We have
no clear explanation for the difference between the two experiments in this regard, but do
note that numerically, across both experiments, the trends were in the predicted direction for
23 out of 24 comparisons. Consequently, we do not perceive there to be a strong discrepancy
in the overall conclusions that can be drawn from the two experiments.
42
Appendix
Distribution of political affiliations in the additional experiment.
Political affiliation
N
Cumulative percent
Strong Right Wing
Right to Center
Center
Center to Left
Strong Left Wing
Other
Didn’t answer question
4
25
58
47
17
26
2
2%
16%
49%
76%
85%
100%
--
Distribution of numeracy scores in the additional experiment.
Number of questions
correct (/5)
N
Cumulative percent
Zero
One
Two
Three
Four
Five
2
13
47
41
40
36
1.1
8.4
34.6
57.5
79.9
100.0