A Concise Guide To Compositional Data Analysis
A Concise Guide To Compositional Data Analysis
John Aitchison
Honorary Senior Research Fellow
Department of Statistics University of Glasgow
Email: john.aitchison@btinternet.com
A Concise Guide to
Compositional Data Analysis
Contents
Preface
5.1 Introduction
5.2 Differential perturbation processes
5.3 A simple example: Arctic lake sediment
5.4 Exploration for possible differential processes
5.5 Convex linear mixing processes
5.6 Distinguishing between alternative hypothesis
Postlude
Appendix Tables
Preface
and is claimed to be hongite. Can we say whether this is fairly typical of hongite? If
not, can we place some measure on its atypicality?
7
Chapter 1 The nature of compositional problems
8
Chapter 1 The nature of compositional problems
Interesting questions are readily formulated. To what extent does the pattern of budget
share of expenditures for men depend on the total amount spent? Are there differences
between men and women in their expenditure patterns? Are there some commodity
groups which are given priority in the allocation of expenditure?
9
Chapter 1 The nature of compositional problems
10
Chapter 1 The nature of compositional problems
Archaeology
Ceramic compositions
Developmental biology
Economics
Household budget compositions and income elasticities of demand
Portfolio compositions
11
Chapter 1 The nature of compositional problems
Environometrics
Pollutant compositions
Geography
Geology
Literary studies
Sentence compositions
Manufacturing
Global car production compositions
Medicine
Blood compositions
Renal calculi compositions
Urine compositions
Ornithology
Sea bird time budgets
Plumage colour compositions of greater bower birds
Palaeontology
Foraminifera compositions
Zonal pollen compositions
Psephology
US Presidential election voting proportions
12
Chapter 1 The nature of compositional problems
Psychology
Time budgets of various groups
Waste disposal
Waste composition
We must look back to 1897 for our starting point. Over a century ago Karl Pearson
published one of the clearest warnings (Pearson, 1897) ever issued to statisticians and
other scientists beset with uncertainty and variability: Beware of attempts to interpret
correlations between ratios whose numerators and denominators contain common
parts. And of such is the world of compositional data, where for example some rock
specimen, of total weight w, is broken down into mutually exclusive and exhaustive
parts with component weights w1 , . . . , wD and then transformed into a composition
(x 1 , . . . , x D ) = (w1 , . . . , wD )/(w1 + . . . + wD ).
Our reason for forming such a composition is that in many problems composition is
the relevant entity. For example the comparison of rock specimens of different
weights can only be achieved by some form of standardization and composition (per
unit weight) is a simple and obvious concept for achieving this. Equivalently we could
say that any meaningful statement about the rock specimens should not depend on the
largely accidental weights of the specimens.
13
Chapter 1 The nature of compositional problems
Sarmanov and Vistelius (1959), Mosimann (1962,1963). The main problem was
perceived as the impossibility of interpreting the product moment correlation
coefficients between the raw components and was commonly referred to as the
negative bias problem. For a D-part composition [ x1 , . . . , x D ] with the component
sum x1 + . . . + x D = 1 , since
cov( x1 , x1 + . . . + x D ) = 0
we have
The right hand side here is negative except for the trivial case where the first
component is constant. Thus at least one of the covariances on the left must be
negative or, equivalently, there must be at least one negative element in the first row
of the raw covariance matrix. The same negative bias must similarly occur in each of
the other rows so that at least D of the elements of the raw covariance matrix. Hence
correlations are not free to range over the usual interval (-1, 1) subject only to the
non-negative definiteness of the covariance or correlation matrix, and there are bound
to problems of interpretation.
The problem was described under different headings: the constant-sum problem, the
closure problem, the negative bias problem, the null correlation difficulty. Strangely
no attempt was made to try and establish principles of compositional data analysis.
The approach was essentially pathological with attempts to see what went wrong
when standard multivariate analysis was applied to compositional data in the hope
that some corrective treatments could be applied; see, for example, Butler (1979),
Chayes (1971, 1972), Chayes and Kruskal (1966), Chayes and Trochimczyk (1978),
Darroch and James (1975), Darroch and Ratcliff (1970, 1978).
14
Chapter 1 The nature of compositional problems
In the early 1980’s it seemed to the writer that there was an obvious way of analysing
compositional data. Since compositional data provide information only about the
relative magnitudes of the parts, not their absolute values, then the information
provided is essentially about ratios of the components. Therefore it seemed to make
sense to think in terms of ratios. There is clearly a one-to-one correspondence
between compositions and a full set of ratios. Moreover, since ratios are awkward to
handle mathematically and statistically (for example there is no exact relationship
between var( xi / x j ) and var( x j / x i ) ) it seems sensible to work in terms of logratios,
var{log( xi / x j )} = var{log( x j / xi )} .
15
Chapter 1 The nature of compositional problems
Since there is also a one-to-one correspondence between compositions and a full set
of logratios, for example,
with inverse
One might therefore have expected the logratio transformation technique to have been
an immediate happy and successful end of story. While it has eventually become so,
immediate opposition along Pearsonian lines undoubtedly came to the fore. The
16
Chapter 1 The nature of compositional problems
reader interested in pursuing the kinds of anti- transformation and other arguments
against logratio analysis may find some entertainment in the following sequence of
references published in the Mathematical Geology: Watson and Philip (1989),
Aitchison (1990a), Watson (1990), Aitchison (1990b), Watson (1991), Aitchison
(1991, 1992), Woronow (1997a, 1997b), Aitchison (1999), Zier and Rehder (1998),
Aitchison et al (2000), Rehder and Zier (2001), Aitchison et al (2001).
While much of this argumentative activity has been unnecessary and time-consuming,
there have been episodes of progress. While the transformation techniques of
Aitchison (1986) are still valid and provide a comprehensive methodology for
compositional data analysis, there is now a better understanding of the fundamental
principles which any compositional data methodology must adhere to. Moreover,
there is now an alternative approach to compositional data analysis which could be
termed the staying- in-the-simplex approach, whereby the tools introduced by
Aitchison (1986) are adapted to defining a simple algebraic-geometric structure on the
simplex, so that all analysis may be conducted entirely within this framework. This
makes the analysis independent of transformations and results in unconstrained
multivariate analysis. It should be said, however, that inferences will be identical
whether a transformation technique or a staying- in-the-simplex approach is adopted.
Which approach a compositional data analyst adopts will largely depend on the
analyst’s technical understanding of the algebraic-geometric structure of the simplex.
In this guide we will adopt a bilateral approach ensuring that we provide examples of
interpretations in both ways.
One of the disputed principles of compositional data analysis in the early part of the
sequence above is that of scale invariance. When we say that a problem is
compositional we are recognizing that the sizes of our specimens are irrelevant. This
trivial admission has far-reaching consequences.
A simple example can illustrate the argument. Consider two specimen vectors
17
Chapter 1 The nature of compositional problems
in R+3 as in Figure 1.4, representing the weights of the three parts (a, b, c) of two
specimens of total weight 8g and 15g, respectively. If we are interested in
compositional problems we recognize that these are of the same composition, the
difference in weight being taken account of by the scale relationship W =(15/8) w.
More generally two specimen vectors w and W in R+D are compositionally equivalent,
written W ∼ w, when there exists a positive proportionality constant p such that W=
pw. The fundamental requirement of compositional data analysis can then be stated as
follows: any meaningful function f of a specimen vector must be such that f(W)=f(w)
when W∼w, or equivalently
In other words, the function f must be invariant under the group of scale
transformations. Since any group invariant function can be expressed as a function of
any maximal invariant h and since
Note that there are many equivalent sets of ratios which may be used for the purpose
of creating meaningful functions of compositions. For example, a more symmetric set
1/D
of ratios such as w/g(w), where g(w) = (w1 . . . wD ) is the geometric mean of the
components of w, would equally meet the scale- invariant requireme nt.
18
Chapter 1 The nature of compositional problems
Fig. 1.4 Representation of equivalent specimen vectors as points on rays of the positive orthant
The marginal or projection concept for simplicial data is slightly more comple x than
that for unconstrained vectors in RD , where a marginal vector is simply a subvector of
the full D-dimensional vector. For example, a geologist interested only in the parts
(Na2 O, K2O, Al2 O3 ) of a ten-part major oxide composition of a rock commonly forms
the subcomposition based on these parts. Formally the subcomposition based on parts
(1, 2, . . . ,C) of a D-part composition [x 1 , ... , x D ] is the (1, 2, . . . ,C)-subcomposition
[s1 , . . . , sC ] defined by
[s1 , . . . , sC ] = [x 1 , . . . , xC ] / (x 1 + . . . + xC).
Note that this operation is a projection from a subsimplex to another subsimplex. See,
for example, Aitchison (1986, Section 2.5).
1.6 Compositional classes and the search for a suitable sample space
In my own teaching over the last 45 years I have issued a warning to all my students,
similar to that of Pearson. Ignore the clear definition of your sample space at your
19
Chapter 1 The nature of compositional problems
peril. When faced with a new situation the first thing you must resolve before you do
anything else is an appropriate sample space. On occasions when I have found some
dispute between students over some statistical issue the question of which of them had
appropriate sample spaces has almost always determined which students are correct in
their conclusions. If, for example, it is a question of association between the directions
of departure and return of migrating New York swallows then an appropriate sample
space is a doughnut.
In the first step of statistical modelling, namely specifying a sample space, the choice
is with the modeller. It is how the sample space is used or exploited to answer
relevant problems that is important. We might, as in our study of scale invariance
above, take the set of rays through the origin and in the positive orthant as our sample
space. The awkwardness here is that the notion of placing a probability measure on a
set of rays is less familiar than on a set of points. Moreover we know that as far as the
study of compositions is concerned any point on a ray can be used to represent the
corresponding composition. The selection of each representative point x where the
rays meet the unit hyperplane w1 + . . . + wD = 1 with x = w/(w1 + . . . + wD) is surely
the simplest form of standardization possible. We shall thus adopt the unit simplex
D
S = { [x 1 , . . . , x D ]: x i>0 (i = 1, . . . , D) , x 1 + . . . + x D = 1}.
20
Chapter 1 The nature of compositional problems
Our next logical requirement will reinforce the good sense of this formulation in
terms of ratios.
21
Chapter 1 The nature of compositional problems
Scientist A would report the correlation between animal and vegetable as corr(x 1 , x2 )
= 0.5 whereas B would report corr(s1 , s2 ) = -1. There is thus incoherence of the
product- moment correlation between raw components as a measure of dependence.
Note, however, that the ratio of two components remains unchanged when we move
from full composition to subcomposition: si / s j = xi / x j , so that as long as we work
with scale invariant functions, or equivalently express all our statements about
compositions in terms of ratios, we shall be subcompositionally coherent.
22
Chapter 1 The nature of compositional problems
requirement of invariance under the group of translations. This is the reason, though
seldom expressed because of its obviousness in this simple space, for the use of the
mean vector ?µ = E ( y ) and the covariance matrix Σ = cov( y ) = E {( y − µ )( y − µ ) T }
as meaningful measures of ‘central tendency’ and ‘dispersion’. Recall also, for further
reference, two basic properties: for a fixed translation t,
The second operation, that of scalar multiplication, also plays a substantial role in, for
example, linear forms of statistical analysis such as principal component analysis,
where linear combinations a1 y1 + . . . + a D y D with certain properties are sought.
Recall, again for further reference, that for a fixed scalar multiple a,
The perturbation operator can be motivated by the following observation within the
positive orthant representation of compositions. For any two equivalent compositions
w and W on the same ray there is a scale relationship W = pw for some p > 0, where
23
Chapter 1 The nature of compositional problems
X = p ⊕ x = [ p1 x1... p D x D ] /( p1 x1 +...+ p D x D )
= C[ p1 x1... p D x D ],
where C is the so-called closure operation which divides each component of a vector
by the sum of the components, thus scaling the vector to the constant sum 1. Note that
because of the nature of the scaling in this relationship it is not strictly necessary for
the perturbation p to be a vector in S D .
In mathematical terms the set of perturbations in S D form a group with the identity
perturbation e = [1 / D . . . 1 / D ] and the inverse of a perturbation p being the closure
p −1 = C[ p1−1 ... p D−1 ] . We use the notation x Θ p to denote the operation of this inverse
on x giving x Θ p = C[ x1 / p1 . . . x D / p D ] . The relation between any two
compositions x and X can always be expressed as a perturbation operation
X = ( X Θ x ) ⊕ x , where X Θ x is a perturbation in the group of perturbations in the
24
Chapter 1 The nature of compositional problems
invariance, subcompositional coherence and some other simple requirements, has led
Aitchison (1992) to advocate the follolowing definition:
2
x X
∆ ( x , X ) = ∑ log i − log i
2
i< j xj Xj
x1 = p1 ⊕ x 0 , x2 = p2 ⊕ x1 , . . . , xn = pn ⊕ x n−1
so that
x n = ( p1 ⊕ p2 ⊕ . . . ⊕ pn ) ⊕ x 0 .
It is clear that in this mechanism we have the makings of some form of central limit
theorem but we delay consideration of this until we have completed the more
mathematical aspects of the simplex sample space.
25
Chapter 1 The nature of compositional problems
imprecision or error. A simple example will suffice for the moment. In the process of
replicate analyses of aliquots of some specimen in an attempt to determine its
composition ξ ?we may obtain different compositions x 1 , . . . , x N because of the
imprecision of the analytic process. In such a situation we can model by setting
x n = ξ ⊕ pn ( n = 1, . . . , N ) ,
The simplicial operation analogous to scalar multiplication in real space is the power
operation. First we define the power operatio n and then consider its relevance in
compositional data analysis. For any real number a ∈ R1 and any composition x ∈ S D
we define
X = a ⊗ x = C [ x1a ...x aD ]
x = ξ ⊕{log d ⊗ β } ⊕ p ,
26
Chapter 1 The nature of compositional problems
It must be clear that together the operations perturbation ⊕ and power ⊗ play roles
in the geometry of S D analogous to translation and scalar multiplication in R D and
indeed can be used to define a vector space in S D . We shall take up the full algebraic-
geometric structure of the simplex sample space later in this guide.
There is a tendency in some compositional data analysts to expect too much in their
inferences from compositional data. For these the following situation may show the
nature of the limitations of compositional data.
Outside my home I have a planter consisting of water, soil and seed. One evening
before bedtime I analyse a sample and determine its (water, soil, seed) composition as
x = [3/6 2/6 1/6]. I sleep soundly and in the morning again analyse a sample finding
X = [6/9 2/9 1/9]. I measure the change as the perturbation
X Θ x = C[( 6 / 9) / ( 3 / 6) ( 2 / 9 ) / (2 / 6) (1 / 9) / (1 / 6)] = [1 / 2 1 / 4 1 / 4] .
Now I can picture two simple scenarios which could describe this change. Suppose
that the planter last evening actually contained [18 12 6] kilos of (water, soil, seed),
corresponding to the evening composition [3/6 2/6 1/6], and it rained during the
night increasing the water content only so that the morning content was [36 12 6]
kilos, corresponding to the morning composition [6/9 2/9 1/9]. Although this rain
only explanation may be true, is it the only explanation? Obviously not, because the
change could equally be explained by a wind only scenario, in which the overnight
wind has swept away soil and seed resulting in content of [18 6 3] kilos and the same
morning composition [6/9 2/9 1/9]. Even more complicated scenarios will produce a
similar change. For example a combination of rain and wind might have resulted in a
27
Chapter 1 The nature of compositional problems
combination of increased water and decreased soil and seed, say to a content of [27 9
4.5] kilos, again with morning composition [6/9 2/9 1/9].
The point here is that compositions provide information only about the relative
magnitudes of the compositional components and so interpretations involving absolute
values as in the above example cannot be justified. Only if there is evidence external
to the compositional information would such inferences be justified. For example, if I
had been wakened by my bedroom windows rattling during the night and I found my
rain gauge empty in the morning would I be justified in painting the wind only
scenario. But I slept soundly during the night.
A consequence of this example is that we must learn to phrase our inferences from
compositional data in terms which are meaningful and we have seen that the
meaningful operations are perturbation and power. In subsequent chapters we shall
how we may use these operations successfully.
28
Chapter 2 The simplex sample space
What has come to be known as logratio analysis for compositional data problems
arose in the 1980’s out of the realisation of the importance of the principle of scale
invariance and that its practical implementation required working with ratios of
components, This, together with an awareness that logarithms of ratios are
mathematically more tractable than ratios led to the advocacy of a transformation
technique involving logratios of the components. There were two obvious contenders
for this. Let x = [ x1 . . . x D ] ∈ S D be a typical D-part composition. Then the so-called
where the ratios involve the division of each of the first D – 1 components by the final
component. The inverse transformation alr −1: R D−1 → S D is
where C denotes the closure operation. Note that this transformation takes the
composition into the whole of R D −1 and so we have the prospect of using standard
unconstrained multivariate analysis on the transformed data, and because of the one-
to-one nature of the transformation transferring any inferences back to the simplex
and to the components of the composition.
One apparent drawback with this technique is the choice of the final component as
divisor, with a much asked question. Would we obtain the same inference if we chose
29
Chapter 2 The simplex sample space
where
x = C [exp( z1 )...exp( z D )] .
This transformation to a real space again opens up the possibility of using standard
unconstrained multivariate methods.
We note here that the mean vector µ = E {alr ( x)} and covariance matrix
Σ = cov{alr ( x )} of the logratio vector alr ( x ) will play an important role in our
compositional data analysis, as will do the centred logratio analogues λ = E {clr ( x )}
and cov{clr ( x)} .
30
Chapter 2 The simplex sample space
2.2 The unit simplex sample space and the staying-in the-simplex approach
Clearly the speaker has been very successful in fitting simple models to normal
transformed data, the counterpart to the simplicity of these models is the
complexity of corresponding relationships among the untransformed components.
This is hardly an original observation. Yet there are certain aromas rising from the
murky potage of compositional data problems which are redolent of some aspects
of problems with directional data, and herein lies the point. When attacking these
latter problems, one is ultimately better off working within the confines of the
original geometry (of the circle, sphere cylinder, . . .) and with techniques
particular thereto (vector methods, etc), in terms of perceiving simple underlying
ideas and modelling them in a natural way. Mapping from, say, the sphere into the
plane, and then back, rarely produces these elements, and usually introduces
unfortunate distortion. I still hold out some hope that simple models of dependence
can be found, peculiar to the simplex. . . . Meanwhile, I shall analyse data with the
normal transform method.
31
Chapter 2 The simplex sample space
The lack of success in transforming the sphere into the plane is that the spaces are
topologically different whereas the simplex and real space are topologically
equivalent. Nevertheless there is a challenge to confine the statistical argument to the
geometry of the simplex, and this approach has been emerging over the last decade,
based on the operations of perturbation and power and on the already indicated
simplicial metric. It is now certainly possible to analyse compositional data entirely
within simplicial geometry. Clearly the success of such an approach must depend
largely on the mathematical sophistication of the user. In the remainder of this guide
we shall adopt a bilateral approach, attempting to interpret inferences from our
compositional data problems both from the logratio analysis approach and the
staying- in-the-simplex approach.
2.3.1 Introduction
The purpose of this section is to provide a reasonably agreed account in terms of
terminology and notation of the algebraic-geometric structure of the unit simplex as a
standard sample space for those compositional data analysts wishing to adopt a
staying- in-the-simplex approach as an alternative to logratio transformation
techniques. Emphasis is placed on the metric vector space structure of the simplex,
with perturbation and power operations, the associated metric, the importance of
bases, power-perturbation combinations, and simplicial subspaces in range and null
space terms. Concepts of rates of compositional change, including compositional
differentiation and integration are also considered. For compositional data sets, some
basic ideas are discussed including concepts of distributional centre and dispersion,
and the fundamental simplicial singular value decomposition. The sources of the ideas
are dispersed through the References and will not be cited throughout the text.
32
Chapter 2 The simplex sample space
Note that the ‘closure’ operator C standardises the contained vector by dividing by the
sum of its components so that a subcomposition consists of components summing to
unity. In geometric terms formation of a subcomposition is geometrically a projection.
The fundamental operations of change in the simplex are those of perturbation and
power transformation. In their simplest forms these can be defined as follows. Given
33
Chapter 2 The simplex sample space
x ⊕ y = C [ x1 y1 ,..., x D y D ],
where C is the well known closure or normalizing operation in which the elements of
a positive vector are divided by their sum; and given a D-part composition x ∈ S D
and a real number, a the power transformed composition is
a ⊗ x = C [ x1a ,..., x aD ]
Note that we have used the operator symbols ⊕ and ⊗ to emphasize the analogy with
the operations of displacement or translation and scalar multiplication of vectors in
R D . It is trivial to establish that the internal ⊕ operation and the external ⊗ ?operation
properties of ⊕ and ⊗ :
x ⊕ y = y ⊕ x , ( x ⊕ y ) ⊕ z = x ⊕ ( y ⊕ z), a ⊗ ( x ⊕ y ) = ( a ⊗ x ) ⊕ (a ⊗ y ).
xΘ y = C [ x1 / y1 ,..., x D / y D ]
and plays an important role in compositional data analysis, for example in the
construction of compositional residuals.
∆: S D × S D → R≥0
defined as follows:
1/2
D yi
2 1/ 2
D yi
2
xi xi
∆ ( x, y ) = ∑ log − log = ∑ log − log ( x , y ∈ S D ),
i=1 g ( x) g( y)
i< j x j y j
34
Chapter 2 The simplex sample space
where g( ) is the geometric mean of the components of the composition. The metric
∆ satisfies the usual metric axioms:
M2 Symmetry: ∆ ( x, y ) = ∆ ( y , x)
M3 Power relationship: ∆ (a ⊗ x, a ⊗ y ) =| a | ∆ ( x, y )
M4 Triangular inequality: ∆ ( x, z ) + ∆ ( z , y ) ≥ ∆( x, y)
The fact that this metric has also desirable properties relevant and logically necessary,
such as scale, permutation and perturbation invariance and subcompositional
dominance, for meaningful statistical analysis of compositional data is now well
established and the relevant properties are recorded briefly here:
2
D
x
|| x || = ∆ ( x, e) = ∑ log i
2 2
i =1 g (x )
35
Chapter 2 The simplex sample space
D D
bi xi
∑ log g (b) log g (x ) = ∑ ai log x i
i =1 i =1
where a = log{ b / g (b)} and so a1 + . . . + a D = 0 . Thus inner products play the role of
logcontrasts, well established as the compositional ‘linear combinations’ required in
many forms of compositional data analysis such as principal component analysis and
investigation of subcompositions as concomitant or explanatory vectors.
x = ( u1 ⊗ β 1) ⊕...⊕ (u C ⊗ β C )
and such combinations play a central role. In such a specification the β ’s are
compositions regarded as generators, and the combination generates some subspace of
the unit simplex as the real number u-coefficient vary. When this subspace is the
whole of the unit simplex then the β ’s form a basis. Generally a basis should be
chosen such that the generators are ‘linearly independent’ in the sense that β 1,..., β C
are linearly independent if and only if
36
Chapter 2 The simplex sample space
bases a central role in a data-analytic sense in terms of the simplicial singular value
decomposition of a compositional data set.
As in standard multivariate analysis range and null spaces play an important and
complementary role in such areas of data investigation as compositional regression ,
principal logcontrast component analysis and in the study of compositional processes.
The set Β = [ β 1;...; β C ] of linearly independent generators identifies a range space
null ( Β) = {x : 〈 β 1, x 〉 = 0,..., 〈 β C , x〉 = 0}
37
Chapter 2 The simplex sample space
1 d
Dx (t ) = lim dt − > 0 ⊗ ( x(t + dt )Θx(t)) = C (exp( log x( t )))
dt dt
where d/dt denotes ‘ordinary’ derivation with respect to t. Thus, for example, if
x ( t) = ξ ⊕ h( t) ⊗ β , then Dx( t ) = h'( t ) ⊗ β . There are obvious extensions through
partial differentiation to compositional functions of more than one variable. We note
also that the inverse operation of integration of a compositional function x (t ) over an
interval (t 0 , t) can be expressed as
t0
There are a number of criteria which dictate the choice of any measure V(x) of
dispersion and dependence which forms the basis of characteristics of compositional
variability in terms of second order moments:
38
Chapter 2 The simplex sample space
p? (Recall the result in Section 1.8.1 that for y ∈ R D the covariance matrix V
is invariant under translation: V(t + y) = V(y)).
(d) Is the measure tractable mathematically?
To ensure a positive answer to (a) we must clearly work in terms of ratios of the
components of compositions to ensure scale invariance. At first thought this might
suggest the use of variances and covariances of the form
There are a number of useful and equivalent ways (Aitchison, 1986, Chapter 4) in
which to summarize such a sufficient set of second-order moment characteristics. For
example, the logratio covariance matrix?
using only the final component x D as the common ratio divisor, or the centered
39
Chapter 2 The simplex sample space
Note that T is symmetric, has zero diagonal elements, and cannot be expressed as the
standard covariance matrix of some vector. It is a fact, however, that S, G and T are
equivalent: each can be derived from any other by simple matrix operations
(Aitchison, 1986, Chapter 4). A first reaction to this variation matrix characterization
is surprise because it is defined in terms of variances only. The simplest statistical
analogue is in the use of a completely randomized block design in, say, an industrial
experiment . From such a situation information about var(yi - yj) for all i, j is a
sufficient description of the variability for purposes of inference.
So far we have emphasized criteria (a), (b) and (d). Fortunately criterion (c) is
automatically satisfied since, for each of the dispersion measures dis ( p ⊕ x) = dis ( x)
for any constant perturbation p. We should also note here that the dimensionality of
the covariance parameter is ½ D(D -1) and so is as parsimonious as corresponding
definitions in other essentially (D-1)-dimensional spaces.
To sum up, importantly these dispersion characteristics are consistent with the
simplicial metric defined above and satisfy the following properties:
40
Chapter 2 The simplex sample space
There are substantial implications in the above development for the analysis of a
N × D compositional data set X = [ x1 ; . . . ; x N ] . A main feature is that the estimate
There is for the simplex a central result, analogous to the singular value
decomposition for data sets associated with the sample space RD, which plays a
central role. Any N x D compositional data matrix X with nth row composition x n can
be decomposed in a perturbation-power form as follows
x n = ξ ⊕ (u n1s1 ⊗ β 1 ) ⊕ . . . ⊕ ( u nm s m ⊗ β m )
where ξ is the centre of the data set, the s’s are positive ‘singular values’ in
descending order of magnitude, the β ’s are compositions, m ( ≤ D − 1 ) is a readily
defined rank of the compositional data set, and the u’s are power components specific
to each composition. In a way similar to that for data sets in RD we may consider an
approximation of order r <m to the compositional data set given by
x n( r) = ξ ⊕ (u n1 s1 ⊗ β1 ) ⊕ . . . ⊕ (u nr s r ⊗ β r ).
of the total variability of the N × D compositio nal data matrix as measured by the
trace of the centered logratio covariance matrix or equivalently in terms of total
mutual distances as
N
( N ( N − 1)) −1 ∑ ∆2 ( x n' , x n ).
n ' <n
We may also note here that the power-perturbation expression of the singular value
decomposition has exactly the same form as regression of a composition on some set
41
Chapter 2 The simplex sample space
of variables. The form is exactly that of what would obtained if the logratio form of
regression analysis in Aitchison (1986, Chapter 7) were transformed back into terms
of the simplex.
2.4.1 Introduction
In this section we first present some results in distributional calculus leading to a
central limit theorem analogous to the role of the multivariate normal limit in real
space. This leads us to the definition of a number of useful parametric classes of
distributions on the simplex sample space.
U D = {[ u1 . . . u D ]: u1 + . . . + uD = 0} .
Suppose that a composition x ∈ S D has density function f ( x) . Then define its Mellin
M x ( u) = ∫
SD
x1u1 . . . x DuD f ( x ) dx .
Note that the restriction of the vector u to the hyperplane U D rather than R D is
dictated by the need to meet the requirement of scale invariance, here ensured by the
fact that integrand is expressible in terms of ratios of the components of x. The Mellin
generating function has perturbation, power and limit properties similar to additive
42
Chapter 2 The simplex sample space
Property M1. M x ( 0) = 1.
M x ⊕ y ( u) = M x ( u) M y ( u).
M b⊕ x (u ) = b1u1 . . . bDuD M x (u ).
where ξ and Τ are the centre and D × D variation matrix with (i, j) element
differentiation process but we shall not pursue that here; see Aitchison (2001,
Section 6) for details.
43
Chapter 2 The simplex sample space
for yn = n −1/2 xn :
D
M ( u) = exp ∑ ui log ξii − 14 uTu T .
i=1
If the perturbations are random then the sum within the brackets will, under certain
regularity conditions which need not divert us here, tend for large n towards a
multivariate normal pattern of variability. It is a simple application of distribution
theory to deduce the form of the probability density function f ( x) on the unit
simplex as
where µ is (D – 1) row vector, Σ a positive definite square matrix of order D-1. This
44
Chapter 2 The simplex sample space
In contrast the popular Dirichlet class Di(α ) on the simplex with density function
f ( x) ∝ x1α1 , , , x αDD (x ∈ X D )
f ( x ) ∝ x α 1 K x α d
1 d
has so many drawbacks that it has virtually no role to play in simplicia l inference. For
example, it has no simple perturbation or power transformation properties and so is
ill-suited to the basic operations of the simplex. Moreover, it has so many inbuilt
independence properties that, apart from being a model of extreme independence, it
has almost no role to play in the investigation of the nature of the dependence
structure of compositional variability.
There are other classes of distributions on S D . The fact that the LD −1 ( µ , Σ ) class is
45
Chapter 2 The simplex sample space
For comparison with the fitting of parametric distributions to simplicial data or for use
when there is no satisfactory parametric class, resort may be made to a non-parametric
approach through kernel density estimation (Aitchison and Lauder, 1985).
Such linear contrasts have also emerged naturally as inner product in our study of the
algebraic-geometric structure of the simplex space. Just as linear combinations can be
used to define subspaces of the vector space RD by way of null spaces or range spaces,
so logcontrasts can be used to identify subspaces of the already identified vector
46
Chapter 2 The simplex sample space
Property L1. If composition x has geometric center ? and variation matrix ? then the
vector l = [l1 , . . . , lC ] ∈ RC , where
D
l r = ∑ ari log x i ( r = 1, K, C )
i =1
We may comment here on the negative signs that appear in this last result. This is
because of the nature of the variation matrix ?. This can easily be shown to have a
restricted form of negative definiteness in the sense that, for any u ∈ U D , u? uT , so
that the covariance matrix − 12 A T ΤA in the above result is positive definite.
Compositional data, in the form of N compositions each with D parts can be set out in
the form of a N × D matrix X = [ x ni ] , where x ni is the ith component of the nth
composition. In such a compositional data matrix compositions are set out in the rows
and the part components are set out in the columns. We shall denote the nth row of the
matrix, the nth composition, by x n .
47
Chapter 2 The simplex sample space
∧
µ = (1 / N ) j NT alr ( X ),
∧ ∧ ∧ T
Σ = {1 / ( N − 1)}{alr ( X ) alr ( X ) T − N µ µ } ,
∧
λ = (1 / N ) j NT clr( X ),
∧ ∧ ∧ T
Γ = {1 / ( N − 1)}{clr( X ) clr ( X ) T − N λ λ } .
x3
x2
P
x1
2 3
48
Chapter 2 The simplex sample space
Albita
Blandita Cornita
Let us now consider the estimation process with the alr transformation. The estimate
∧
µ of µ = E {alr ( x)} is [1.600 0.799], and this is shown as the point Q (red) in
Figure 2.6.d. The question of interest is how this point transforms back into the
appropriate simplex sample space, in this case triangle ABC. This is achieved by
computing
and this composition is shown as the point G (red) in the triangle ABC of Figure
2.6.c. It clearly lies within the cluster of data points within the triangle.
This estimated centre is in sharp contrast to what is almost universally quoted in raw
compositional data analysis, namely the arithmetic mean vector of the compositional
data set: (1 / N ) j Nt X , which, for the (A, B, C) subcomposition of hongite, is [0.443
49
Chapter 2 The simplex sample space
0.229 0.148], substantially different from the centre arrived at through the
transformation process. This composition is plotted as the point A (green) in the
triangle ABC of Figure 2.6.c, and compared with centre G is more like an outlier than
a central characteristic.
Albita
Arithmetic Mean
Geometric Mean
Blandita Cornita
Fig. 2.6.c Arithmetic average composition (A = green) and the geometric centre (G = red) for 3-part
compositional data set in a ternary diagram
4-
Arithmetic Mean
3- Geometric Mean
2-
log(Blandita/Cornita)
1-
0-
-1 -
-2 -
0' 1' 2' 3' 4' 5' 6'
log(Albita/Cornita)
Fig. 2.6.d Arithmetic average composition (A = green) and the geometric centre (G = red) for 3-part
compositional data set in the logratio diagram.
50
Chapter 2 The simplex sample space
∧
µ = [log( g1 / g D )...log( g D−1 / g D )]
and then
∧
alr −1 ( µ ) = C [ g1...g D ] ,
We give below the estimates of Σ , Γ , Τ for the hongite compositional data matrix:
∧
Σ=
0.1386 0.2641 -0.2233 0.1214
0.2641 0.6490 -0.7020 0.1444
-0.2233 -0.7020 0.9476 0.0116
0.1214 0.1444 0.0116 0.1871
∧
Γ=
0.0644 0.1791 -0.2441 0.0145 -0.0140
0.1791 0.5530 -0.7337 0.0266 -0.0249
-0.2441 -0.7337 0.9803 -0.0419 0.0394
0.0145 0.0266 -0.0419 0.0475 -0.0467
-0.0140 -0.0249 0.0394 -0.0467 0.0462
51
Chapter 2 The simplex sample space
∧
Τ=
0 0.2593 1.5329 0.0828 0.1386
0.2593 0 3.0007 0.5473 0.6490
1.5329 3.0007 0 1.1115 0.9476
0.0828 0.5473 1.1115 0 0.1871
0.1386 0.6490 0.9476 0.1871 0
We shall see later as we develop our methodology the various ways in which these
measures of dispersion come into play. For the moment we concentrate on a simp le
point. Hopefully by now early warners of the fallacy of using raw product- moment
correlations such as Chayes (1960, 1962), Krumbein (1962), Sarmanov and Vistelius
(1959) have reinforced Karl Pearson’s century-old warning and have at least raised
uneasiness about interpretations of product- moment correlations cov(x i , x j). Relative
variances such as var{log(x i /x j )} provide some compensation for such deprivation of
relationship between x i and x j in the sense that the ratio x i / x j is constant, replacing
the unusable idea of perfect positive correlation between x i and x j by one of perfect
proportionality. Again, the larger the value of var{log( xi / x j )} the more the
idea of zero correlation or independence between x i and x j. For scientists who are
uneasy about scales that stretch to infinity we can easily provide a finite scale by
52
Chapter 2 The simplex sample space
var{log( x i / x j )}
cv =
| E{log( x i / x j )}|
giving
1 −2 cv 1 +2 cv
gi x g
≤ i ≤ i ,
gj xj gj
where g i , g j are the geometric means of the ith and jth components.
D
In the study of unconstrained variability in R it is often convenient to have available
a measure of total variability, for example in principal component analysis and in
biplots. For such a sample space the trace of the covariance matrix is the appropriate
measure. Here we might consider trace(G) the trace of the symmetric centered
logratio covariance matrix. Equally we might argue on common sense grounds that
the sum of all the possible relative variances in Τ , namely
∑ var{log( x
i< j
i / x j )} ,
would be equally good. These two measures indeed differ only by a constant factor
and so we can define totvar(x), a measure of total variability, as
1 x
totvar ( x) = trace( Γ) = ∑
D i< j
var log i
x j
We may also note here that our scalar measure of distance, the simplicial metric, is
compatible with the above definitions of covariance analogous to the compatibility of
Euclidean distance with the covariance matrix of an unconstrained vector. As an
illustration of this consider how we might construct a measure of the total variability
for a N × D compositional data set . The above definition suggests that we may
53
Chapter 2 The simplex sample space
obtain such a total measure, totvar1 say, by replacing each var{log( xi / x j )} in the
totvar2 ( x) = ∑ ∆2 ( x m , xn ) ,
m< n
where here x m , x n denote the mth and nth compositions in X. The easily established
proportional relationship totvar1 = [D/{N(N-1)}] totvar2 confirms the compatibility
of the defined covariance structures and scalar measures of distance for compositional
variability.
A caveat on the use of the centred logratio covariance matrix. Because of the
symmetry of the centred logratio covariance Γ there is a temptation to imagine that
corr [log{x i / g ( x)}, log{x j / g ( x )}] is somehow a sound measure of a relationship
A B D E
A 1,00000 0,74025 -0,86129 -0,02096
B 0,74025 1,00000 -0,89208 -0,34832
D -0,86129 -0,89208 1,00000 -0,08899
E -0,02096 -0,34832 -0,08899 1,00000
whereas the (A, B, D, E) correlations extracted from the centred logratio correlation
matrix for the full composition (A, B, C, D, E) is
54
Chapter 2 The simplex sample space
A B c D E
A 1,00000 0,94865 -0,97117 0,26291 -0,25602
B 0,94865 1,00000 -0,99656 0,16410 -0,15593
C -0,97117 -0,99656 1,00000 -0,19412 0,18519
D 0,26291 0,16410 -0,19412 1,00000 -0,99765
E -0,25602 -0,15593 0,18519 -0,99765 1,00000
2.7.1 Introduction
In most of our applications we shall be assuming that there is a sufficiently general
parametric model which is the most complex we would consider as capable of
explaining, or useful in explaining, the experienced pattern of variability. We are
hesitant, however, to believe that the complexity of the model with its many
parameters is really necessary and so postulate a number of hypotheses which provide
a simpler explanation of the variability than the model. These hypotheses place
constraints on the parameters of the model or equivalently allow a reparametrisation
of the situation in terms of fewer parameters than in the model. We can then usually
show the hypotheses of interest and their relations of implication with respect to each
other and the model in a diagrammatic form in a lattice. The idea is most simply
conveyed by a simple example.
2.7.2 Example
Suppose that our data set consists of the measurements of some characteristic of a
sediment, such as specific gravity or, in a compositional problem, logratio of sand to
clay components, at different depths in a lake bed. Suppose that our aim is to explore
the nature of the dependence, if any, of characteristic y on depth u, and that we are
prepared to assume that the most complex possible dependence is with expected
55
Chapter 2 The simplex sample space
There is much to be said for having a clear picture of the lattice of hypotheses of
interest before attempting any statistic analysis of data and indeed before embarking
on any experimental or observational exercise.
56
Chapter 2 The simplex sample space
Fig. 2.7.a Lattice of hypotheses within the model with expected characteristic of the form
α + βu + γu 2 + log u .
57
Chapter 2 The simplex sample space
is ideal for our view of hypothesis testing within a lattice under the simplicity
postulate. In moving from a lower level to a higher level we are seeking a mandate to
complicate the explanation, to introduce further parameters. The rejection of a
hypothesis gives us a positive reassurance that we have reasonable grounds for
moving to this more complicated explanation.
Our lattice testing procedure can then be expressed in terms of the following rules.
1. In every test of a hypothesis within the lattice, regard the model as the
alternative hypothesis.
2. Start the testing procedure at the lowest level, by testing each hypothesis at
that level within the model.
3. Move from one level to the next higher level only if all hypotheses at the
lower level are rejected.
4. Stop testing at the level at which the first non-rejection of a hypothesis occurs.
All non-rejected hypotheses at that level are acceptable as ‘working models’
on which further analysis such as estimation and prediction may be based.
∧ ∧
Lh ( X ) = L{θ h ( X )| X ) and Lm ( X ) = L{θ m ( X )| X )
denote the maximised likelihood under the hypothesis (h) and the model (m),
respectively. The generalised likelihood ratio test statistic is then
R ( X ) = Lm ( X ) / Lh ( X ) ,
58
Chapter 2 The simplex sample space
and the larger this is the more critical of the hypothesis h we shall be. When the exact
distribution of this test statistic under the hypothesis h is not known, we shall make
use of the Wilks (1938) asymptotic approximation under the hypothesis h which
palaces c constraints on the parameters, the test statistic
Q ( X ) = 2 log{R( X )}
is distributed approximately as χ 2 (c ) .
For the staying in the simplex approach compositional regression uses the power and
perturbation operations in the following way: for a composition x regressing on a real
concomitant u we would set
x = ξ ⊕ (u ⊗ β ) ⊕ p ,
where ξ, β , p are all compositions, ξ playing the role of ‘constant’, β the role of
‘regression coefficient’ and p the role of the ‘error term’. The relation to the
transformation version is simply seen since
59
Chapter 2 The simplex sample space
alr ( x ) = α + uγ + error.
Although the staying- in-the-simplex and the transformation technique lead to the
same inferences a main difference will lie in the nature of the interpretation. In the
staying- in-the-simplex approach, for example, the definition of residual will be
∧ ∧
x Θ x$ , where xˆ = ξ ⊕ (u ⊗ β ) . We shall see in the next chapter through an example
how all these ideas fit into place.
∧ ∧ ∧
f ( x| data ) ∝ ( x1 . . x D ) −1[1 + {alr( x) − µ }[( N − 1)(1 + N −1 ) Σ ] − 1{alr ( x ) − µ } N /2
for compositional data matrix X. For large data sets there is little difference between
estimative and predictive fitted distributions, but for moderate compositional data sets
the difference can be substantial. The fact that geological sets often have N small (a
60
Chapter 2 The simplex sample space
few rock specimens) and D large (ten or more major oxides) should recommend the
use of the predictive distribution in applications to compositional geology.
and this is easily evaluated in terms of standard incomplete beta functions; for details
see Aitchison (1986, Section 7.10). Atypicality indices lie between 0 and 1, with near-
zero corresponding to a composition near the center of the distribution and near 1
corresponding to an extremely atypical composition lying in a region of very low
density. Atypicality indices are therefore useful in detecting possible outliers or
anomalous compositions. For inspection of a given data set it is advisable to use the
now standard jackknife or leaving-one-out technique to avoid resubstitution bias in
assessing the atypicality index of any composition in the data set. Again atypicality
indices for such a procedures are readily computable.
61
62
Chapter 3 From theory to practice: some simple apllications
A general question that we asked in Example 1 of Section 1.1 was whether any
differences could be detected between the hongite and kongite compositional
experience. After an alr logratio transformation of the compositional vectors we are
then faced with two multivariate normal samples with questions about equality of
mean vectors and covaria nce matrices. We have already obtained the estimates for
hongite in Section 2.6. The corresponding estimations for kongite are as follows.
The kongite centre is [0.486 0.201 0.114 0.105 0.094], again quite different from
the arithmetic mean [0.438 0.214 0.165 0.097 0.086].
∧
Σ= 0.1131 0.2352 -0.2008 0.0961
0.2352 0.6554 -0.7231 0.1061
-0.2008 -0.7231 1.0504 0.0911
0.0961 0.1061 0.0911 0.1951
∧
Γ= 0.0646 0.1807 -0.2441 -0.0014 0.0002
0.1807 0.5949 -0.7724 0.0026 -0.0058
0.2441 -0.7724 1.0123 -0.0012 0.0054
-0.0014 0.0026 -0.0012 0.0487 -0.0487
0.0002 -0.0058 0.0054 -0.0487 0.0489
∧
Τ= 0 0.2981 1.5652 0.1161 0.1131
0.2981 0 3.1520 0.6384 0.6554
1.5652 3.1520 0 1.0634 1.0504
0.1161 0.6384 1.0634 0 0.1951
0.1131 0.6554 1.0504 0.1951 0
Following the lattice strategy we can set out the model of two completely different
distributions and the hypotheses within that model in a self explanatory lattice
diagram (Figure 3.1). We are now within the structure of standard multivariate
63
Chapter 3 From theory to practice: some simple apllications
analysis apart from the constraints of the simplicity postulate in the order and nature
of the hypothesis testing within the model. To simplify matters here we use the
asymptotic forms of the generalised likelihood ratio test statistics, the Q of Section
2.7, to be compared against appropriate chi-squared percentiles.. The computational
procedures are uninteresting and can be found in Aitchison (1986, Section 7.5). The
only unusual feature is the computation for the hypothesis µ1 = µ 2 with different
covariance matrices, commonly referred to as the Fisher-Behrens problem.
Model
L (π1 , Σ1 )
5
L5 ( µ2 , Σ 2 )
No of parameters 28
µ1 = µ 2 Σ1 = Σ 2
No of parameters 24 Level 2 No of parameters 18
Test statistic 160.8 Test statistic 10.7
Level 1
µ1 = µ 2
Σ1 = Σ 2
Test statistic 46.7
Fig. 3.1 Lattice of hypotheses for comparison of hongite and kongite compositions
64
Chapter 3 From theory to practice: some simple apllications
The sequence of tests are then as follows. The hypothesis at level 1, that the hongite
and kongite distributions are identical compares the value of the Q-statistic 46.7
against the 95 percentile of χ 2 (14) , namely 23.7, and so we reject this hypothesis and
move up to testing the hypotheses at level 2. The hypotheses that the mean vectors are
equal, allowing different covariance matrices, has a Q-statistic value of 160.8, to be
compared with the 95 percentile of χ 2 (34 ) , namely 36.4, and so again this
hypothesis has to be rejected. Finally the hypothesis that the covariance matrices are
equal but that the mean vectors are different has a Q-statistic value of 10.7 to be
compared with the 95 percentile of χ 2 (6) , namely 12.6. Thus we cannot reject this
hypothesis and so would conclude that a reasonable working model would assume
equal covariance structure for hongite and kongite but with different mean vectors.
Along the lines of Section 2.9 we could apply the leaving-one-out technique to
compute the atypicality indices of the hongite and kongite sets. For example, for the
hongite set theses are:
Atypicality
Speciment A B C D E
index
1 48,8 31,7 3,8 6,4 9,3 0,7122
2 48,2 23,8 9,0 9,2 9,8 0,0171
3 37,0 9,1 34,2 9,5 10,2 0,1990
4 50,9 23,8 7,2 10,1 8,0 0,0318
5 44,2 38,3 2,9 7,7 6,9 0,7086
6 52,3 26,2 4,2 12,5 4,8 0,8284
7 44,6 33,0 4,6 12,2 5,6 0,8822
8 34,6 5,2 42,9 9,6 7,7 0,9689
9 41,2 11,7 26,7 9,6 10,8 0,1000
10 42,6 46,6 0,7 5,6 4,5 0,9873
11 49,9 19,5 11,4 9,5 9,7 0,1678
12 45,2 37,3 2,7 5,5 9,3 0,7012
13 32,7 8,5 38,9 8,0 11,9 0,8148
14 41,4 12,9 23,4 15,8 6,5 0,7405
15 46,2 17,5 15,8 8,3 12,2 0,2679
16 32,3 7,3 40,9 12,9 6,6 0,7508
17 43,2 44,3 1,0 7,8 3,7 0,7298
18 49,5 32,3 3,1 8,7 6,3 0,1454
19 42,3 15,8 20,4 8,3 13,2 0,5135
20 44,6 11,5 23,8 11,6 8,5 0,3501
21 45,8 16,6 16,8 12,0 8,8 0,0990
22 49,9 25,0 6,8 10,9 7,4 0,0739
23 48,6 34,0 2,5 9,4 5,5 0,8282
24 45,5 16,6 17,6 9,6 10,7 0,2800
25 45,9 24,9 9,7 9,8 9,7 0,2864
65
Chapter 3 From theory to practice: some simple apllications
The hongite specimens with atypicality index in excess of 0.95 are specimens no 8
(0.97) and no 10 (0.99). From inspection of the components we can see that for no 8
the atypicality probably arises from the relatively low value of the B component, and
for no 10 the relatively low value of the C component. Clearly we should place these
in a possible outlier category and refer the question of possible reasons to the
geologist.
What remains is to answer the question as to whether the new composition [44.0,
20.4, 13.9, 9.1, 12.6] can be regarded as typical of the hongite experience. Here a
straightforward application of the new case atypicality computation of Section 2.9
produces an atypicality index of 0.997, raising substantial doubt as to the hongiteness
of this new specimen.
We asked the question in Example 2 of Section 1.1 whether the Arctic lake sediments
of Table 1.1.2 were dependent of depth and, if so, what is the nature of the
dependence. Again the alr transformation of the 3-part sediment compositions
produces two-dimensional vectors in real space and we can then consider a
straightforward multivariate (bivariate) regression on some function of depth. In this
respect we follow the two-dimensional counterpart of the lattice used as an
introductory example in Section 2.7. This is shown in Fig 3.2.a in which the various
hypotheses on the functional form of the regressand and their rela tionships to each
other are detailed. The tests of these hypotheses within the model (based on ratios of
determinants of residual matrices) are a familiar part of unconstrained multivariate
analysis and need not be dwelt on here; details can be found in Aitchison (1986,
Sections 7.6-7.9).
66
Chapter 3 From theory to practice: some simple apllications
Fig. 3.2.a Lattice of hypotheses for investigation of the dependence of Arctic lake sediments on water
depth, showing residual determinants |Rh | and significance probabilities Ph .
The working model in alr logratio terms can thus be expressed in the following terms:
67
Chapter 3 From theory to practice: some simple apllications
The working model here clearly conforms to theories that as depth increases sand
gives way to silt and more so to clay with these differential effects decreasing with
depth. It would be interesting to compare the rates of change associated with such
processes for different locations.
Here we can show the regression line within the [sand silt clay] ternary diagram as
in Figure 3.2.b. The fit is obviously reasonably convincing.
68
Chapter 3 From theory to practice: some simple apllications
Sand
Direction of
increasing depth
Silt Clay
Fig. 3.2.b Arctic sediments and regression line of sediment on logarithm of depth.
Residual analysis can obviously be carried out either in terms of the transformed
regression or in a stay- in-the simplex format. Since the latter is less familiar we
demonstrate it briefly. This simply involves computation and investigation of the
residual compositions, namely composition Θ fitted composition . The residual
compositions here are:
69
Chapter 3 From theory to practice: some simple apllications
These should be spread around the centre of a ternary diagram, as in Figure 3.2.c. The
question of outliers among these residuals obviously arises. We report that there are
two sediment compositions -S12 and S7- with residual atypicality indices of 0.9998
and 0.9990, respectively.
sand
Arithmetic Mean
Geometric Mean
silt clay
Fig. 3.2.c Residuals of the Arctic sediments fitted by the regression line of sediment on logarithm of
depth
70
Chapter 3 From theory to practice: some simple apllications
In the literature of consumer demand analysis there have been only a few attempts to
incorporate compositional analysis directly into the analysis of household budgets.
This technique has many advantages and provides opportunities for new forms of
investigation. Suppose the w is a record of household expenditure on D mutually
exclusive and exhaustive commodity groups so that t = w1 + . . . + w D is total
expenditure and x = C( w) is the proportional pattern of allocation to the groups.
β i = εi − ε D (i = 1, . . ., D − 1)
71
Chapter 3 From theory to practice: some simple apllications
reasonable assumption that, for given total expenditure t, the pattern x is independent
of income s, leading to the above concentration on p( x| t ) .
As a simple start to our analysis of the household budgets of Table 1.1.3 let us first
apply tests of compositional invariance separately to the 20 single male households
and to the 20 single female households. Estimation and testing follows standard
unconstrained multivariate analysis, with the result that we reject the hypothesis of
compositional invariance for both single male and single female households. Thus for
each set there is strong evidence against the hypothesis of compositional invariance:
in other words, the patterns of expenditures do appear to depend on total expenditure.
From the relations hip above we see that although the D ‘income’ elasticities are not
determined by the D – 1 regression coefficients they can at least be placed in order of
magnitude. The commodity groups arranged in increasing order of elasticity, that is,
in conventional economic terminology from necessity to increasing luxury groups are
(for each gender):
The fact that the ordering is the same for males and females raises the question of
whether the dependence of pattern on total expenditure is really different for males
and females. This suggests that it might have been more fruitful to consider
hypotheses expressed in terms of the parameters of the model
Note that the separate compositional invariance hypotheses tested above are the
hypotheses βM = 0 and β F = 0 at level 2. All the hypotheses of the lattice can be
tested within the standard framework of multivariate linear modelling. We omit the
72
Chapter 3 From theory to practice: some simple apllications
details here but show on the lattice the significance probabilities associated with each
hypothesis, noting that we can move up the lattice by rejection until level 3 where we
would fail to reject the hypothesis α = α F .
The data of Table 1.1.4 are of a before- and after-nature. Each cow has had milk
composition determined at the beginning and at the end of the trial and so we have
essentially, in standard statistical analysis terms, paired comparisons. The major
difference is that we require to use a measure of difference appropriate to
compositional change and we have seen this to be perturbation. Thus for each cow we
record the set of perturbations below.
73
Chapter 3 From theory to practice: some simple apllications
74
Chapter 3 From theory to practice: some simple apllications
We can address the problems that we face here in three stages by posing three
questions.
Question 2. Is there similar evidence of a change in the treatment group? Here the Q-
statistic value is even larger, 75.6, again to be compared against the same percentile
value, and so we ha ve real evidence of change, with the centre of the treatment group
perturbations being
Question 3. The remaining question is to ask whether there are differences between
the control and treatment group perturbations and this question can be answered by
using a separate sample lattice identical to that for the hongite-kongite comparison of
Section 3.1. The three Q-statistics in the same order as for the previous example are
153.7, 45.6 and 212.0 to be compared against 95 percentiles of the chi-squared
distribution at 20, 15 and 5 degrees of freedom, all giving significant differences.
Thus there is strong evidence of differences between control and treatment changes.
75
Chapter 3 From theory to practice: some simple apllications
A good indication of what the nature of this change is can be obtained by computing
the perturbation difference between the control and treatment perturbation centres,
namely
Thus we can see that relatively there is enhancement of protein, carbohydrate and
calcium, presumably a successful nutritional result.
3.5.1 Introduction
For compositional problems in which the analysis depends in an assumption of
distributional form tests can be applied to assess the multivariate normality of the
transformed logratio vectors. For this purpose there is a whole battery of such tests:
univariate marginal tests, bivariate angle tests, multivaria te radius tests, with different
forms of test statistics: Anderson-Darling, Cramer-von Mises, Watson, with
accompanying useful graphical plots. All of these are examined in great detail in
Aitchison (1986) and need not divert us here from the main task of presenting
principles and practice of compositional data analysis. There has been discussion of
whether the choice of divisor in the alr transformation is crucial to the result. So here
we present a simple alternative avoiding this problem and which may also provide
some measure of the degree of success of the normality assumption.
76
Chapter 3 From theory to practice: some simple apllications
expressible in terms of Π = diag (π ) and Β as Γ = clr (Β)Π clr ( Β) T , where clr denotes
the operation of forming centred logratios from the rows of Β . Similarly a logistic-
Student distribution is obtained with the same centre and covariance matrix and with
ν degrees of freedom when u follows a St D−1 (ν ,0, I D −1 ) distribution. Further a
logistic skew normal distribution for x is obtained if u follows a multivariate skew
normal distribution with density function φ D−1 (u | 0, I D−1 ) Φ( uγ T ).
We note at this point that this characterisation in terms of the u-distribution is useful
for simulation purposes since it requires only simple simulation algorithms for
standardised distributions. Simulated data is here useful for testing the effectiveness
of the distributional form tests.
The first step is to estimate the parameters in the above characterisation. This is easily
done from the standard singular value decomposition of the doubly centred matrix Z
constructed form the log X. Suppose that the standard singular value decomposition of
Z is Z = UPV T , where U and V have zero-sum orthonormal columns and
P = diag ( p1 , . . . , p D−1 ) , where p1 , . . . , p D −1 are the singular values in descending
order of magnitude. Then it is easy to see that our estimates of the parameters ξ, Π , Β
clr −1 (V T ) . Also, and importantly for our distributional form testing here the u
77
Chapter 3 From theory to practice: some simple apllications
normality. In this procedure the first column of U$ represents the first order
approximations with a proportion p1 explained, the first two columns of U$ the second
order approximation with a proportion p2 explained, and so on. Thus in terms of the
marginal, bivariate and radius tests as described in Aitchison (1986, Section 7.3) the
sequence of testing proceeds as follows.
78
Chapter 3 From theory to practice: some simple apllications
79
Chapter 3 From theory to practice: some simple apllications
Whether grain-size data is considered as grouped ordinal data and some class of
univariate distributions is used to characterize each such ‘histogram’ or each
histogram is considered a compositional vector is certainly an open question. In
situations where the objective is to compare a number of weight_x_diameter profiles,
until a satisfactory class of distributions giving good fits to the histogram emerges, the
treatment of such data as compositional is certainly viable, with possibilities of
inferring the nature of an underlying process through the study of possible differential
perturbation processes.
80
Chapter 4 More complex compositional problems
with eigenvalues 38.26, 2.186, 0.142, 0.004. The measure of total variability is 1.69
and the first principal logcontrast ‘explains’ 94.2 percent of this variability, the second
bringing this to 99.6 percent. Thus we would be justified in regarding the variability
81
Chapter 4 More complex compositional problems
For the record we give the staying- in-the-simplex version of logcontrast principal
component analysis. This is by way of the simplicial singular value decomposition. It
could reasonably be argued that the major statistical tool in the analysis of
multivariate data associated with a metric vector space such as R D , R+D , S D must be
the associated singular value decomposition. For a N × D compositional data matrix
X with nth composition x n this, as we have already seen in Chapter 2, takes the form
x n = ξ ⊕ ( un1π1 ⊗ β1 ) ⊕ . . . ⊕ (u nR π R ⊗ βR ) .
It is interesting to apply this to a simple compositional data set such as hongite. The
details of the process of estimation will be taken up in the next section. Here we
simply record the results.
∧
β 1 0.222 0.329 0.189 0189
. 0.177
∧
β 2 0.169 0198
. 0179
. 0.089 0.364
∧ = 0.395
β 3 0103
. 0.143 0167
. 0.192 ,
∧ 0.125 0125
. 0.125 0.310 0.314
β
4
82
Chapter 4 More complex compositional problems
Specimen u1 u2 u3 u4
1 0,1529 0,2871 0,1325 -0,2833
2 0,0177 0,1155 -0,0512 0,0403
3 -0,2512 0,0646 0,0109 0,0705
4 0,0493 -0,0258 0,1437 0,0351
5 0,2046 0,0752 -0,2816 -0,2748
6 0,1314 -0,3607 0,2511 -0,2053
7 0,1359 -0,2561 -0,4477 -0,0110
8 -0,3336 -0,1047 0,5093 0,0235
9 -0,1928 0,0984 0,0224 0,0122
10 0,4027 0,0494 0,1957 0,4844
11 -0,0298 0,0803 0,1656 -0,1737
12 0,2084 0,3751 -0,0332 -0,1205
13 -0,2796 0,2203 -0,1642 0,3033
14 -0,1612 -0,3733 -0,2201 0,0101
15 -0,0858 0,2479 0,0340 -0,1349
16 -0,2951 -0,2955 -0,1939 -0,0771
17 0,3560 -0,2078 -0,0156 -0,1994
18 0,1847 -0,0418 0,1284 -0,0841
19 -0,1311 0,2812 -0,1400 0,1719
20 -0,1750 -0,1090 0,2042 -0,1500
21 -0,0952 -0,0859 -0,0731 0,1594
22 0,0614 -0,0949 0,0350 0,1608
23 0,2174 -0,1375 0,1050 0,4186
24 -0,1035 0,1131 -0,0370 -0,2569
25 0,0114 0,0848 -0,2802 0,0808
The connection between this and the logcontrast principal components is simply that
the eigenvalues correspond to the squares of the π ’s, and that the a-coefficients of the
logcontrast approach are the clr transforms of the β ’s. The interpretation remains the
same.
The biplot (Gabriel, 1971, 1981) is a well established graphical aid in other branches
of statistical analysis. Its adaptation for compositional data is simple and can prove a
useful exploratory and expository tool. For a compositional data matrix X the biplot is
based on a singular value decomposition of the doubly centered logratio matrix Z =
[zri], where
N
z ri = log{ x ri / g ( xr )} − N
−1
∑ log{ x
ri / g ( x r )}..
r =1
83
Chapter 4 More complex compositional problems
Links, rays and covariance structure. The links and rays provide information on the
covariance structure of the compositional data set.
|ij| 2 ≈ var{log( xi / x j )} ,
| Oi | 2 ≈ var[log{ x i / g ( x)}] ,
84
Chapter 4 More complex compositional problems
It is tempting to imagine that this last relation can be used to replace discredited
corr(x i, x j) as a measure of the dependence between two components. Unfortunately
this measure does not have subcompositional coherence.
A particular case of this is when the two links are at right angles so implying that
cos(iMk ) ≈ 0 and there is zero correlation of the two logratios. This is useful in
investigation of subcompositions for possible independence.
Case markers and recovery of data. Such markers have the easily established property
that Ocn . ji represents the departure of log(x i /x j) for case cn from the average of this
logratio over all the cases. Let P and Pn in Figure 4.3.b denote the projections of the
85
Chapter 4 More complex compositional problems
center O and the compositional marker cc on the possibly extended link ji . Then
Ocn . ji = ± |PPn | | ji |,
where the positive sign is taken if the directions of PPn and ji are the same, otherwise
the negative sign is taken. A simple interpretation can be obtained as follows.
Consider the extended line ji as divided into positive and negative parts by the point
P, the positive part being in the direction of ji from P. If Pn falls on the positive
(negative) side of this line then the logratio of log(x ni /x nj) of the nth composition
exceeds (falls short of) the average value of this logratio over all cases and the further
Pn is from P the greater is this excedance (shortfall); if Pn coincides with P then the
compositional logratio coincides with the average.
A similar form of interpretation can be obtained from the fact that Ocn .Oi represents
the departure of the centered logratio log{x ni /g(xn )} of the nth composition from the
average of this centered logratio over all replicates.
It must be clear from the above aspects of interpretation that the fundamental
elements of a compositional biplot are the links, not the rays as in the case of variation
86
Chapter 4 More complex compositional problems
diagrams for unconstrained multivariate data. The complete set of links, by specifying
all the relative variances, determines the compositional covariance structure and
provides direct information about subcompositional variability and independence. It is
also obvious that interpretation of the relative variation diagram is concerned with its
internal geometry and would, for example, be unaffected by any rotation or indeed
mirror- imaging of the diagram.
Another fundamental difference between the practice of biplots for unconstrained and
compositional data is in the use of data scaling. For unconstrained data if there are
substantial differences in the variances of the components, biplot approximation may
concentrate its effort on capturing the nature of the variability of the most variable
components and fail to provide any picture of the pattern of variability within the less
variable components. Since such differences in variances may simply arise because of
scales of measurement a common technique in suc h biplot applications is to apply
some form of individual scaling to the components of the unconstrained vectors prior
to application of the singular value decomposition. No such individual scaling is
necessary for compositional data when the analysis involves logratio transformations.
Indeed, since for any set of constants (c1 , . . . , cD), we have
it is obvious that the covariance structure and therefore the compositional biplot are
unchanged by any differential scaling or perturbation of the compositions. This, of
course, is simply an aspect of the perturbation invariance of measures of dispersion
for compositional data. Only the centering process is affected by such differential
scaling. Moreover any attempt at differential scaling of the logratios of the
components would be equivalent to applying differential power transformations to the
components of the compositions, a distortion which would prevent any compositional
interpretatio n from the resulting diagram.
87
Chapter 4 More complex compositional problems
In the MN blood group system there are three genotypes, namely MN, MM, NN, and
the proportions of these genotypes within a population provide a blood group
composition for that population. Table 4.4.1 shows these compositions for 24 native
populations; the data are reconstructed from Figure 12 of Gower (1987). Let us
suppose that we know nothing about gene tic theory and decide to explore this data set
by the construction of a relative variation biplot (Fig. 4.4.a) as described in Section
4.3. For such a 3-part compositional data set the biplot retains all the variability and
provides an exact representation of the pattern of variability. The approximate
collinearity of the vertices MN, MM, NN indicates that the variability is mainly one-
dimensional and suggests a logcontrast principal component analysis to determine the
form of the constant logcontrast. Suc h a principal component analysis (Aitchison,
1986, Section 8.3) yields the following eigenvalues and logcontrasts
19
= Outliers
The near-constant logcontrast arises from the near-zero second eigenvalue. Moreover
the fact that the coefficient 0.816 is approximately twice the coefficients 0.406 and
0.411 suggests that we can obtain a substantial simplification to our interpretation if
88
Chapter 4 More complex compositional problems
We can obtain an estimate 1.348 of the constant from the average value of the
logcontrast over the sample of 24 compositions. Moreover the fact that this is
approximately log 4 encourages the following conjecture,
2
MN = 4MM × NN.
Thus through examination of the relative variation biplot and its clear indication of
the need for a logcontrast principal component analysis we have been led to the
rediscovery of the fundamental Hardy-Weinberg equilibrium curve.
89
Chapter 4 More complex compositional problems
Pr. Components:
MM NN
Fig. 4.4.b The ternary plot of the MN blood group compositions and the Hardy-Weinberg ‘curve’.
Table 4.5.1 reports a compositional data set which will be new to everyone and so no
preconceived ideas will dictate our analysis. It consists of 20 6-part mineral
compositions of goilite rocks from a site on the edge of Loch Goil near Carrick
Castle. I am told that this is an interesting site so let us see what we can discover
90
Chapter 4 More complex compositional problems
about it.
Inspection of the variation array of Table 4.5.2 provides little insight into the nature of
variability of the goilite compositions of Table 4.5.1. In contrast, the relative variation
biplot of Figure 4.5.a, retaining 98.2 per cent of the total compositional variability,
allows easy identification of a number of characteristics. For simplicity in our
interpretation we shall use only the initial letters to identify the mineral parts. First,
we see that the de link is by far the longest indicating the greatest relative variation in
the ratios of components is between d and e. Secondly, the near coincidence of the
vertices a and c implies that the a and c are in almost constant proportion with the
approximate relationship of a/c = 0.55 easily obtained from Table 4.5.1 or from the
estimate -0.605 for E{log(a/c)} in the variation array of Table 4.5.2. Note that in the
ternary diagram of the abc subcomposition in Figure 4.5.b the representative
compositional points lie roughly on a ray through the vertex b. Applying the
approximate 95 percent range formula and noting that
and coefficients of variation for log( e/f) and log(a/e) are -0.716 and -0.214 we obtain
the ranges
0.073 < e / f < 1.59; 0.42 < a / c < 0.71.
Cumulative proportion 1
explained: 14
0,89
0,98 8 6
Broomite
0,99 13
1 22 20 3
1 7
Dhuite 2 Eckite
Fyneite 11
4 19 Carronite
Arkaigite
17 12 16
10
21 9
15
5 18
= Outliers
91
Chapter 4 More complex compositional problems
Arkaigite
Geometric Mean
Broomite Carronite
Fig. 4.5.b Ternary diagram for (arkaigite, broomite, carronite) subcompositions showing the near
proportionality of arkaigite to carronite
Thirdly and most strikingly we see the near-orthogonality of the ab (or cb) link and
the links de, df and ef. We can immediately infer that the ratios d/e, d/f and e/f are
independent of the ratio of a/b or c/b. Another way of expressing this feature is to
state that the subcompositions [c,d,e] and [a,b] are independent. A formal test of this
hypothesis of subcompositional independence (Aitchison, 1986, Section 10.3) results
in a significance probability 0.27 confirming our conclusion. Fourthly, the collinearity
of the three mineral links de, df and ef and the consequent one-dimensionality of the
pattern of variability of this (d, e, f)-subcomposition, confirmed by the corresponding
subcompositional ternary diagram of Figure 4.5.c, implies some relationship between
the proportions of the minerals d, e and f. Direct investigation by logcontrast principal
component analysis leads to the following eigenvalues and corresponding logcontrast
principal components:
92
Chapter 4 More complex compositional problems
Dhuite
Geometric Mean
Eckite Fyneite
Fig. 4.5.c Ternary diagram of the (dhuite, eckite, fyneite) subcompositions showing the one-
dimensional pattern of variability of this subcomposition
The near-constant logcontrast arises from the near-zero second eigenvalue. Moreover
the fact that the coefficients are roughly in the ratios of -2 : -1 : 3 suggests that we
can make a substantial simplification to our interpretation if we consider the constant
logcontrast
where the constant value is estimated from the sample average of the logcontrast. This
can be simply converted into the approximate relationship;
3
e / f = 0.85 × (f / d)
As a final comment here we note that any subcomposition can be viewed as a set of
logcontrasts (Aitchison, 1984) and so are included in any logcontrast principal
component analysis for stud y of the dimensionalty of the pattern of compositional
variability.
93
Chapter 4 More complex compositional problems
Inspection of the variation array of Table 2 provided little insight into the nature of
variability of the colour compositions of Table 1.1.5. In contrast, the relative variation
biplot of Figure 4.6.a, retaining 98.2 per cent of the total compositional variability,
allows easy identification of a number of characteristics. First, we see that the red-
yellow link is by far the longest indicating that the greatest relative variation in the
pictures is between red and yellow. Secondly, the near coincidence of the vertices
black and other implies that the artist uses black and the non-primary colour in almost
constant proportion with the approximate relationship of other/black = 1.85 easily
estimated from Table 1.1.5 or from the estimate 0.605 for E{log(other/black)} in the
variation array of Table 2. Thirdly only two compositions, those of paintings 14 and
22, have atypicality indices 0.999 and 0.953 greater than 0.95. From the position of
the marker for composition 14 in Figure 4.6.a it is clear that this atypicality is
probably due to a combination of its unusually high ratios of yellow to blue with that
of white to black, facts easily confirmed from Table 1.1.5. Composition 22 is atypical
because of its high blue to yellow, white to black and white to other ratios.
21 9
15
5
18
= Outliers
Fourthly and most strikingly we see the near-orthogonality of the black-white link (or
other-white link) and the links blue-red, blue-yellow, red-yellow associated with the
94
Chapter 4 More complex compositional problems
primary colours. Thus, we can immediately infer that the ratios in which the artist
uses the primary colours are independent of the ratio of black to white, or other to
white. Another way of expressing this feature is that the subcompositions (blue, red,
yellow) and (black, white) are independent; a formal test of this hypothesis of
subcompositional independence (Aitchison, 1986, Section 10.3) results in a
significance probability 0.27 confirming our conclusion. Fifthly, the collinearity of
the three primary colour links and the consequent one-dimensionality of the pattern of
variability of the primary subcomposition, confirmed by the corresponding
subcompositional triangular diagram of Figure 4.6.b implies some relationship
between the proportions of the primary colours used. Investigation along the lines of
the previous example leads to an approximate relationship
3
or, in terms of ratios of colour use, red/yellow ∝ (yellow/blue) . Whether this
suggested 'cubic rule' is worth further investigation as an artistic principle is
questionable, but such relationships can play an important role in compositional
analysis (Aitchison, 1998).
blue
red yellow
95
Chapter 4 More complex compositional problems
Fig. 4.7 Conditional biplot showing the dependence of the mineral composition on the major
oxide compositions for tektite compositions
96
Chapter 4 More complex compositional problems
chemical association between minerals and major oxides, we see that, apart from
SiO 2 , each of the other seven major oxides is associated with only one of the minerals,
for example MgO is contained only in enstatite. In the biplot diagram each of these
seven major oxide vertices is close to its corresponding mineral vertex. This means
that the link associated with any two of these major oxides is nearly parallel to the
link of the corresponding minerals and so the mineral logratios are all highly
correlated with the corresponding major oxide logratios. It is in this sense that the
conditional biplot identifies the chemical relationships. Moreover even SiO 2 , which is
a constituent of all eight minerals is nevertheless primarily identified with quartz
which is simply its oxide self.
All of this seems splendid until the quality of the approximation is investigated. The
proportion of the covariance matrix G which is retained by the biplot is only 0.204.
The reason is not too difficult to detect. The singular value decomposition has
singular values 1.00, 1.00, 1.00. 0.999, 0.994. 0.868, 0,060 and it would require a
fourth order approximation and a four-dimensional biplot to raise the quality to a
reasonable 0.911 proportion retained. The reason for this disappointing quality is
easily determined. It lies in the fact that within the constraints of compositional data
each mineral is almost independently related to its major oxide, in the sense that each
mineral logratio is almost perfectly linearly related to the corresponding major-oxide
ratio. An analogous situation with unconstrained data would be the assemblage of
97
Chapter 4 More complex compositional problems
trace{Γ ( s )} / trace{Γ ( x )}
as the proportion of the total variation explained by the subcomposition. In such forms
of analysis it should be noted that a (1, . . . , C–1)-subcomposition is a set of C–1
particular logcontrasts and so the variability explained by a C-part subcomposition
can also be compared with that achieved by the first C–1 principal logcontrasts.
We can illustrate this simply for the hongite experience of Table 1.1.1a. For example
for 3-part subcompositions we have the 10 possible subcompositions in ascending
order of variability (where 1=A. . . . , 5=E):
98
Chapter 4 More complex compositional problems
Proportion of
Subcompositions variability
explained
A D E 0.08
A B D 0.17
A B E 0.20
B D E 0.27
C D E 0.44
A C E 0.51
A C D 0.53
B C E 0.90
B C D 0.91
A B C 0.94
99
Chapter 4 More complex compositional problems
between two types of limestone. Thomas and Aitchison (1998) show that of the 17-
part (major-oxide, trace element) composition a simple major-oxide subcomposition
(CaO,Fe2 O3 ,MgO) provides excellent discrimination, equal to that of the full
composition. Figures 4.9.a and 4.9.b show the separation in logratio and ternary
diagram space, respectively.
Dufftown
Inchory
1-
0-
log(Fe2O3/MgO)
-1 -
-2 -
-3 -
Fig. 4.9.a Scattergram of logratios log(CaO / MgO) and log(Fe2O3/ MgO) for Scottish limestones
CaO
Dufftown
Inchory
Fe2O3 MgO
Fig. 4.9.b Ternary diagram of ‘centre perturbed’ (CaO, Fe2O3, MgO) subcompositions of Scottish
limestones
100
Chapter 4 More complex compositional problems
The simplest model for such investigations is clearly when the expected response is a
logcontrast of the ingredients and it is clear from the discussion of the previous
section how investigation of subcompositional hypotheses would proceed. It is,
however, possible to have a more general model involving second power terms in
logratios, together with a hierarchy of hypotheses of inactivity of parts, of partition
additivity , completely additive. For full details on the motivation for such definitions,
for the practical meaning of the hypotheses and for implementation of a testing lattice,
see Aitchison and Bacon-Shone (1984) and Aitchison (1986, Sections 12.4-5).
Because of the constant sum constraint, equivalently because of the nature of the
simplex sample space, independence hypotheses must clearly take radically different
forms from those associated with R D . For example, the analogue of complete
independence of components in unconstrained space is for compositional data
complete subcompositional independence, in which any subset of non-overlapping
subcompositions is independent. These, of course can be specified in terms of
associated logratios and in fact result in a particular parameterisation of the
101
Chapter 4 More complex compositional problems
covariance structure. Tests of such hypotheses are readily available; see, for example,
Aitchison (1986, Chapter 10).
We use the time budgets of Table 1.1.6 to provide a very simple example, and
examine the hypothesis that the work and leisure subcompositions are independent.
This is almost clear in the biplot of Figure 4.11, in which the links of the working
parts are roughly at right angles to the links of the leisure parts, indicating lack of
correlation. The formal test involves testing whether the correlations between work
logrations and leisure logratios are all zero. This is easily assessed and results in a
significance probability of 0.56, so that we cannot reject the hypothesis of
independence of work and leisure parts of the statistician’s day.
4 17
= Outliers
102
Chapter 5 Compositional processes
5.1 Introduction
Most scientists are interested in the nature of the process which has led to the data
they observe, not least geologists in their search for explanations of how our planet
has developed geologically. Unfortunately they are seldom in the fortunate position of
observing a closed system where fundamental principles such as conservation of mass
and energy apply. Commonly the only data available take the form of compositional
data providing information only on relative magnitudes of the constituents of the
specimens. In some disciplines there is a wide variety of terminology associated with
such realised or hypothetical compositional processes. For example, geological
langua ge contains many terms to describe a whole variety of envisaged geochemical
processes, such as denudation, diagenesis, erosion, gravity transport, metasomatism,
metamorphism, orogenesis, polymetamorphism, sedimentation, transportation,
weathering. Often the data for the study of such possible processes consist of variable
compositional vectors, such as major oxide compositions, major and minor element
compositions, granulometric weight by diameter profiles such as (sand, silt, clay)
sediments or palaeontological compositions such as foraminifera abundances. It is our
purpose here to study a variety of ways in which statistical analysis of the variability
in such data sets may be directed towards quantification of such processes and also,
where there may be rival hypotheses as to the nature of the geological process, the
extent to which the nature of the variability may be used to distinguish between the
hypotheses.
Many of the terms used by geologists to describe the processes they study appear to
envisage some kind of differential change in the components of the composition –
denudation, erosion, sedimentation, metamorphism, weathering. Since differential
103
Chapter 5 Compositional processes
t0
i
x (t) = x (t0 ) ⊕ H (t ) ⊗ β ,
104
Chapter 5 Compositional processes
t0
With actual compositional data the regression either in logratio terms or in staying in
the simplex mode is easily accomplished. The important feature here is the possibility
of alternative approaches to interpretation.
We continue the example used by Aitchison and Thomas (1998) to illustrate various
ways of describing the process by which the variability of Arctic lake sediments may
depend on depth. The previous study arrived at logratio regression equations
log{sand(t) / clay(t)} = 9.70 -2.74 log t; log{silt(t) / clay(t)} = 4.80 - 1.10 log t,
x ( t + dt ) = x ( t ) ⊕ 13 [1 − 146
. / t ), 1 + 0,18 / t , 1 + 1.28 / t ].
All provide the same interpretation of the process: as depth increases sand gives way
to silt and more so to clay with differential effects decreasing with depth. Which
characterisation of the process is chosen may well depend on personal choice.
105
Chapter 5 Compositional processes
Given a compositional data set X = [ x1 , ;...; x N ] forming some possible process but
with no obvious driving variable such as time, temperature or pressure, it is of interest
to explore the possibility that there may be some unknown process at work. A suitable
tool for such an investigation is the simplicial singular value decomposition
(Aitchison et al, 2002). With each x n expressible in power-perturbation form:
where ξ is the centre of the data set. Here the hope is that the singular values are
decreasing so rapidly that the variability will be described by a low order truncation of
the power-perturbation combination.
Suppose that for the Arctic lake sediments we were unaware of the possibility of
depth as a process variable. Then the application of the singular value decomposition
gives a representation
x n = ξ ⊕ ( un1s1 ⊗ β1 ) ⊕ ( un2 s2 ⊗ β2 ),
where
s1 = 9,51, s2 = 1..85;ξ = [0.1780.5640.258];
β1 = [0.1360.3040.560],β 2 = [ 0.1960.6290.175].
The second order approximation explains 96.3 of the total variability. This should
encourage the search for a possible driving variable. If depth is considered and plotted
against the resulting u’s a log- like scatter of points is obtained confirming the nature
of the earlier regression analysis.
106
Chapter 5 Compositional processes
description null( Β⊥ ) . This is convenient if the objective is to produce some law- like
description of the process. This was the situation in Aitchison and Thomas (1998) in
the study of olivines. There, for example for kimberlitess, the range space approach
would have resulted in
0. 089 0. 011
Fe Mg
= 1.13.
Si Si
107
Chapter 5 Compositional processes
x = π1ξ1 + . . . + π C ξC
The additive nature of such modelling does not mean that basic principles of
compositional data analysis are thereby neglected. For example an approach to the
so-called endmember problem where a set of say C endmember compositions
ξ1 , . . . , ξC is sought such that each composition x n (n = 1, . . . , N) of the data set can
∑ ∆ (x 2
n ,ξn )
n =1
108
Chapter 5 Compositional processes
∑∆ 2
(ξ b , ξ c ).
b< c
All that are required to implement such a procedure are good algorithms for the
minimisation of functions over a product of simplices. Our own, still at the testing
stage, are based on an iterative search program where each step involves perturbations
of the attained position, For the hongite data set for example on the supposition that
there are three endme mbers these turn out to be
Each of the processes – differential perturbation and convex linear mixing – will
result in fitted compositions, say x nP and x nC , for each of the observed compositions
x n . The goodness of fit G P and G C of each of these processes may then be reasonably
judged in terms some such measures as
N N
G = ∑ ∆ ( x n , xn ),G = ∑ ∆ ( x n , xn ).
P 2 P C 2 C
n =1 n =1
For the hongite data set we can compare these goodness of fit measures at various
orders of approximation:
109
Chapter 5 Compositional processes
It is fairly clear that for this data set the differential perturbation model has the edge
over the convex linear model. This is in concurrence with the known method by
which the data set was originally simulated.
110
Postlude Pockets of resistance and confusion
Postlude
The describers
As long as we are just describing a compositional data set we can use any
characteristics. In describing compositional data we can use arithmetic means,
covariance matrices of raw components and indeed any linear methods such as
principal components of the raw components. After all we are simply describing the
data set in summary form, not analyzing it (Le Maitre, 1982).
The openers
The fact that most compositions are recorded by first arriving experimentally at an
'open vector' of quantities of the D parts constituting some whole and then forming a
'closed vector', the composition, seems to have led to a particular form of wishful
thinking. All will be resolved if we can reopen the closed vector in some ideal way
and then perform some statistical analysis on the open vectors to reveal the inner
secrets of the compositions. The notion that there is some magic powder which can be
sprinkled on closed data to make them open and unconstrained dies hard. Most
recently Whitten (1995) takes as closed vectors major-oxide compositions of rocks
111
Postlude Pockets of resistance and confusion
The pathologists
A study of the compositional literature suggests that much of compositional data
112
Postlude Pockets of resistance and confusion
analysis in the period 1965-85 was directed at trying to find some inspiration from
calculation of crude correlations and other linear methods. Those who were aware that
things go wrong with crude correlations attempted to describe the nature of the
disease instead of trying to find a cure. Thus we have many papers with titles such as
‘An effect of closure on the structure of principal component’ (Chayes and
Trochimczyk, 1978) and ‘The effect of closure on the measure of similarity between
samples’ (Butler, 1979).
The non-transformists
Despite his warning about the spuriousness of correlations of crude proportions,
Pearson would have been unhappy about the solution through logratio
transformations. He had bitter arguments (Pearson, 1905, 1906) with some of the
rediscoverers (for example, Kapteyn, 1903 ) of the lognormal distribution. This lay in
his distrust of transformations: what can possibly be the meaning of the logarithm of
weight? I had hoped that we were now sufficiently convinced, particularly in geology,
that the lognormal distribution has a central role to play in many geological
applications. But the mention of a logratio of components still brings forth that same
resistance. What is the meaning of such a logratio is a question posed by Fisher in the
discussion of Aitchison (1982) and even more recently by Whitten (1995). We hope
that the analogy with the lognormal distribution and the comments earlier that every
piece of compositional statistical analysis can be carried out within the simplex may
mean that this resistance will soon collapse.
The sphericists
There have been various attempts to escape from the unit simplex to what are thought
to be simpler or more familiar sample spaces. One popular idea (Atkinson and
Stephens in the discussion of Aitchison (1982), and Stephens(1982)) is to move from
the unit simplex S D to the positive orthant of the unit hypersphere by the
distributions on the hypersphere. There are two insuperable difficulties about such a
transformation. First, the transformation is only onto part of the hypersphere and so
established distributional theory, associated as it is with the whole hypersphere, does
113
Postlude Pockets of resistance and confusion
not apply. There is clearly no way round this since the simplex and hypersphere are
topologically different: there is no way of transforming a triangle to the surface of a
two-dimensional sphere. As serious a difficulty is the impossibility of representing the
fundamental operation of perturbation on the simplex as something tractable on the
hypersphere. This is not surprising since the fundamental algebraic operation on the
hypersphere is rotation and this bears no relationship to the structure of perturbation.
The additional step of Stanley (1990) in transforming z to spherical polar coordinates
further complicates such issues. Although the angles involved are scale invariant
functions of the composition their relationship to the composition is bewilderingly
complicated. Moreover there would be no subcompositional coherence since in terms
of our previous discussion scientist B would be transforming onto a hypersphere of
lower dimension with impossibly complicated relationships between the angles used
by scientist A and B.
Conclusion
The only sensible conclusion, it seems to me, is to reiterate my advice to my students.
Recognize your sample space for what it is. Pay attention to its properties and follow
through any logical necessities arising from these properties. The solution here to the
apparent awkwardness of the sample space is not so difficult. The difficulty is facing
up to reality and not imagining that there is some esoteric panacea.
114
References
References
AITCHISON, J.(1981a). A new approach to null correlations of proportions. J. Math. Geol. 13, 175-
189.
AITCHISON, J. (1981b). Distributions on the simplex for the analysis of neutrality, in Statistical
Distributions in Scientific Work (Taillie, C., Patil, G.P. and Baldessari, B.,eds), Vol 4, pp.147-156.
Dordrecht, Holland: D. Reidel Publishing Company.
AITCHISON, J. (1981c). Some distribution theory related to the analysis of subjective performance in
inferential tasks, in Statistical Distributions in Scientific Work (Taillie, C., Patil, G.P. and
Baldessari, B., eds), Vol 5, pp.363-385. Dordrecht, Holland: D. Reidel Publishing Company.
AITCHISON, (1982). The statistical analysis of compositional data (with discussion). J. R. Statist. Soc.
B 44, 139-177.
AITCHISON, J.(1983). Principal component analysis of compositional data. Biometrika 70, 57-65.
AITCHISON, J. (1984a). The statistical analysis of geochemical compositions. J. Math. Geol. 16, 531-
64.
AITCHISON, J. (1984b). Reducing the dimensionality of compositional data sets. J. Math. Geol. 16,
617-36.
AITCHISON, J. (1985). A general class of distributions on the simplex. J. R. Statist. Soc. B 47, 136-
146.
AITCHISON, J. (1986). The Statistical Analysis of Compositional Data. London: Chapman and Hall.
AITCHISON, J. (1989a). Letter to the editor. Reply to "Interpreting and testing compositional data" by
Alex Woronow, Karen M. Love, and John C. Butler. J. Math. Geol. 21, 65-71.
AITCHISON, J. (1989b). Letter to the Editor. Measures of location of compositional data sets. J. Math.
Geol. 21, 787-790.
AITCHISON, J. (1990a). Letter to the Editor. Comment on "Measures of Variability for Geological
Data" by D. F. Watson and G. M. Philip, J. Math. Geol. 22, 223-6.
AITCHISON, J. (1991a). Letter to the Editor. Delusions of uniqueness and ineluctability. J. Math.
Geol. 23, 275-277.
AITCHISON, J. (1991b). A plea for precision in Mathematical Geology. J. Math, Geol. 23, 1081-1084.
AITCHISON, J. (1992a). The triangle in statistics. Chapter 8 in The Art of Statistical Science. A
Tribute to G. S. Watson (ed. K. V. Mardia), pp 89-104. New York: Wiley
AITCHISON, J. (1992b). On criteria for measures of compositional differences. J. Math. Geol. 24,
365-380.
AITCHISON, J. (1993). Principles of compositional data analysis. In Multivariate Analysis and its
Applications (eds. T.W. Anderson. I. Olkin and K.T. Fang), p.73-81. Hayward, California:
Institute of Mathematical Statistics.
115
References
AITCHISON, J. (1997). The one-hour course in compositional data analysis or compositional data
analysis is easy. In Proceedings of the Third Annual Conference of the International Association
for Mathematical Geology (ed. Vera Pawlowsky Glahn). 3-35. Barcelona: CIMNE
AITCHISON, J. (1999a). Logratios and natural laws in compositional data analysis. J. Math. Geol. 31,
563-89.
AITCHISON, J. (2002), Simplicial inference. In Algebraic Methods in Statistics and Probability, eds
M. A. G. Viana and D. St. P. Richards, 1-22. Contemporary Mathematics Series 287. Providence,
Rhode Island: American Mathematical Society.
AITCHISON, J. and BROWN, J.A.C. (1957). The Lognormal Distribution. Cambridge University
Press.
AITCHISON, J. and GREENACRE, M. (2002) Biplots for compositional data. Applied Statistics, 51,
num. 4, pp. 375-392.
AITCHSION, J. and LAUDER, I.J. (1985). Kernel density estimation for compositional data. Applied
Statistics 34,129-137.
AITCHISON, J. and SHEN, S. M. (1980). Logistic-normal distributions: some properties and uses.
Biometrika 67, 261-272.
AITCHISON, J. and SHEN, S.M. (1984). Measurement error in compositional data. J. Math. Geol. 16,
637-50.
AITCHISON J. and THOMAS, C. W. (1998) Differential perturbation processes: a tool for the study of
compositional processes, in Buccianti A., Nardi, G. and Potenza, R., eds., Proceedings of IAMG'98
- The Fourth Annual Conference of the International Association for Mathematical Geology: De
Frede Editore, Napoli (I), p. 499-504.
116
References
BUTLER, J. C. (1979). The effect of closure on the measure of similarity between samples: J. Math.
Geol. 11, 73-84.
CHAYES, F. (1960) On correlation between variables of constant sum: J. Geophys. Res. 65, 4185-
4193.
CHAYES, F. (1962) Numerical correlation and petrographic variation: J. Geology. 70, 440-552.
CHAYES, F. (1972). Effect of the proportion transformation on central tendency. J. Math. Geol. 4,
269-70.
CHAYES, F. and KRUSKAL, W. (1966) An approximate statistical test for correlation between
proportions: J. Geology, 74, 692-702.
CHAYES, F. and TROCHIMCZYk, J. (1978) The effect of closure on the structure of principlal
components: J. Math. Geol. 10, 323-333.
DARROCH, J. N. and JAMES, J. R. (1974). F-independence and null correlations of bounded sum
positive variables. J. R. Statist. Soc. B 36, 247-52.
DARROCH, J. N. and RATCLIFF, D. (1970). Null correlations for proportions II. J. Math. Geol. 2,
307-12..
DARROCH, J. N. and RATCLIFF, D. (1978). No association of proportions.. J. Math. Geol. 10, 361-
8..
GABRIEL, K. R. (1981). Biplot display of multivariate matrices for inspection of data and diagnosis.
In: V. Barnett, Ed., Interpreting Multivariate Data, Wiley, New York, 147-173.
GOWER, J. C. (1987). Introduction to ordination techniques, in Legendre, P. and Legendre, L., eds.,
Developments in Numerical Ecology: Springer-Verlag, Berlin, p. 3-64.
KAPTEYN, J. C. (1903). Skew Frequency Curves in Biology and Statistics: Astronomical Laboratory,
Groningen, Noordhoff.
117
References
KRUMBEIN, C. (1962). Open and closed number systems: stratigraphic mapping: Bull. Amer. Assoc.
Petrol. Geologists, 46, 322-37.
McALISTER, D. (1879). The law of the geometric mean: Proc. Roy. Soc. 29, 367-
MOSIMANN, J. E. (1963). On the compound negative binomial distribution and correlations among
inversely sampled pollen counts. Biometrika 50, 47-54.
PAWLOWSKY, V., OLEA, R. A., and DAVIS, J. C. (1995). Estimation of regionalized compositions:
a comparison of three methods: J. Math. Geol. 27, 105-48.
PEARSON, K. (1905). Das Fehlergetz und seine erallgemeinerungen durch Fechner und Pearson. A
rejoinder: Biometrika. 4, 169-212..
PEARSON, K. (1906). Skew frequency curves. A rejoinder to Professor Kapteyn: Biometrika.5, 168-
71.
REHDER, U. and ZIER, S. (2001). Comment on “Logratio analysis and compositional distance by
Aitchison et al. (2000)”: J. Math. Geol. 32.
RENNER, R.M. (1993) The resolution of a compositional data set into mixtures of fixed source
components. Applied Statistics. 42, 615-311.
STANLEY, C. R. (1990). Descriptive statistics for N-dimensional closed arrays: a spherical coordinate
approach, J. Math. Geol. 22, 933-56.
STEPHENS, M.A. (1982) Use of the von Mises distribution to analyze continuous proportions,
Biometrika 69, 197-203.
118
References
THOMAS, C. W. and AITCHISON, J. (1998). The use of logratios in subcompositional analysis and
geochemical discrimination of metamorphosed limestones from the northeast and central Scottish
Highlands. In: A. Buccianti, G. Nardi and R. Potenza, Eds., Proceedings of IAMG98, The Fourth
Annual Conference of the International Association for Mathematical Geology, De Frede, Naples,
549-554.
WATSON, D. F. and PHILIP, G. M. (1989). Measures of variability for geological data: J. Math.
Geol.. 21, 233-54.
WHITTEN, E. H. T. (1995). Open and closed compositional data in petrology: J. Math. Geol. 27, .789-
806.
WORONOW, A. (1997a). The elusive benefits of logratios. In: V. Pawlowsky-Glahn, Ed., Proceedings
of IAMG97, The Third Annual Conference of the International Association for Mathematical
Geology, CIMNE, Barcelona, 97-101.
WORONOW, A. (1997b). Regression and discrimination analysis using raw compositional data - is it
really a problem? In: V. Pawlowsky-Glahn, Ed., Proceedings of IAMG97, The Third Annual
Conference of the International Association for Mathematical Geology, CIMNE, Barcelona, 157-
162.
ZIER, U. and REHDER, S. (1998). Grain -size analysis –a closed data proble. In A. Buccianti, Nardi,
G. and Potenza, R., eds., Proceedings of IAMG'98 - The Fourth Annual Conference of the
International Association for Mathematical Geology: De Frede Editore, Napoli, p. 555-8..
119
120
Appendix Tables
Appendix Tables
___________________________________________
Specimen Percentages by weight of minerals
no A B C D E
___________________________________________
121
Appendix Tables
_____________________________________________
122
Appendix Tables
Table 1.1.2 (sand, silt, clay) compositions (percentages by weight) and water depth
(m) of 39 Arctic lake sediments
_____________________________________________
Sediment Percentages by weight Water
no sand silt clay depth
_____________________________________________
1 77.5 19.5 3.0 10.4
2 71.9 24.9 3.2 11.7
3 50.7 36.1 13.2 12.8
4 52.2 40.9 6.6 13.0
5 70.0 26.5 3.5 15.7
6 66.5 32.2 1.3 16.3
7 43.1 55.3 1.6 18.0
8 53.4 36.8 9.8 18.7
9 15.5 54.4 30.1 20.7
10 31.7 41.5 26.8 22.1
123
Appendix Tables
_________________________________________________________________
124
Appendix Tables
Table 1.1.4 Dietary compositions of the milk of 60, thirty in the control group and 30
in the treatment group (pr = protein, mf = milk fat, ch = carbohydrate)
125
Appendix Tables
126
Appendix Tables
127
Appendix Tables
128
Appendix Tables
________________________________________________________
Painting Proportions of total area assigned to colours
No black white blue red yellow other
________________________________________________________
129
Appendix Tables
______________________________________________
130
Appendix Tables
Pollutant
a b c d
131
Appendix Tables
a b c d e f
______________________________________________________________
1 0.125 0.353 0.266 0.163 0.031 0.181
2 0.143 0.224 0.313 0.111 0.051 0.159
3 0.147 0.231 0.303 0.058 0.129 0.133
4 0.164 0.209 0.282 0.120 0.047 0.178
5 0.197 0.151 0.299 0.132 0.033 0.188
6 0.157 0.256 0.246 0.072 0.116 0.153
7 0.153 0.232 0.282 0.101 0.062 0.170
8 0.115 0.249 0.259 0.176 0.025 0.176
9 0.178 0.167 0.347 0.048 0.143 0.118
10 0.164 0.183 0.281 0.158 0.027 0.186
11 0.175 0.211 0.283 0.070 0.104 0.157
12 0.168 0.192 0.305 0.120 0.044 0.171
13 0.155 0.251 0.257 0.091 0.085 0.161
14 0.126 0.273 0.269 0.045 0.156 0.131
15 0.199 0.170 0.318 0.080 0.076 0.158
16 0.163 0.196 0.335 0.107 0.054 0.144
17 0.136 0.185 0.304 0.162 0.020 0.193
18 0.184 0.152 0.350 0.110 0.039 0.165
19 0.169 0.207 0.300 0.111 0.057 0.156
20 0.146 0.240 0.250 0.141 0.038 0.184
21 0.200 0.172 0.313 0.059 0.120 0.136
22 0.135 0.225 0.217 0.217 0.019 0.187
______________________________________________________________
Column j
a b c d e f
_________________________________________
a 0 0.307 0.129 0.502 0.617 0.225
b -0.275 0 0.270 0.465 0.646 0.221
Row i c -0.605 -0.330 0 0.486 0.628 0.213
d 0.432 0.706 1.037 0 1.071 0.314
e 1.047 1.322 1.652 0.615 0 0.769
f -0.027 0.247 0.578 -0.459 -1.074 0
Estimates below the diagonal are of E(log(xj /xi ) and above the diagonal of var{log( xi / x j )}
132
Appendix Tables
Minerals compositions
Case qu or al an en ma il ap
______________________________________________________
1 62.02 10.99 10.15 2.58 1.15 0.040 1.23 0.010
2 61.13 11.76 11.59 2.30 1.05 0.160 1.25 0.050
3 59.17 13.25 12.94 2.58 0.92 0.260 1.14 0.070
4 58.93 11.23 11.50 2.66 1.47 0.070 1.31 0.020
5 53.79 14.36 15.56 3.74 1.74 0.070 1.12 0.050
6 52.54 14.30 15.22 4.37 1.24 0.090 1.18 0.140
7 55.20 13.17 14.46 2.96 1.82 0.330 1.41 0.090
8 52.78 14.06 13.45 3.76 1.47 0.320 1.41 0.090
9 57.90 10.69 10.74 2.86 1.39 0.010 1.58 0.020
10 54.19 12.05 13.53 3.06 1.34 0.330 1.52 0.090
11 53.22 13.12 13.79 3.34 1.84 0.390 1.41 0.120
12 55.38 11.99 12.69 2.20 1.44 0.480 1.46 0.120
13 51.24 14.30 14.55 3.59 1.64 0.290 1.39 0.120
14 50.15 14.18 15.56 3.59 2.37 0.300 1.35 0.120
15 50.97 13.06 14.97 3.53 2.37 0.460 1.48 0.140
16 52.39 13.29 13.62 3.34 1.67 0.510 1.42 0.120
17 54.92 10.87 12.01 2.35 1.52 0.250 1.71 0.120
18 54.01 10.87 12.69 3.01 1.07 0.190 1.63 0.090
19 51.99 11.40 12.18 2.83 1.87 0.450 1.65 0.070
20 52.95 9.63 12.09 1.58 1.74 0.460 1.88 0.170
21 52.79 9.45 10.83 1.97 1.94 0.390 1.99 0.090
______________________________________________________
qu: quartz or: orthoclase al: albite an: anorthite
en: enstatite ma: magnetite il: ilmenite ap: apatite
133
Appendix Tables
_________________________________________________________________
__________________________________________________________________
134