Volume 39(2)
Journal
of the
Arizona-Nevada Academy of Science
Special Issue: Psychology – Analysis and Measurement
Golden Anniversary Meeting
University of Arizona,Tucson, AZ
April 7, 8, 9, 2006
CONTENTS
Foreword ............................................................................................................................................. iii
Assortative Mating in the Jewel Wasp: 1. Female Matching of Eye-Color Genotype, Not Host-Feeding
Phenotype
Aurelio José Figueredo and Rebecca M. S. Sage ........................................................................... 51
Assortative Mating in the Jewel Wasp: 2. Sequential Canonical Analysis as an Exploratory Form of Path
Analysis
Aurelio José Figueredo, and Richard L. Gorsuch .......................................................................... 59
Differential Parental Investment in the Southwestern United States
Melinda F. Davis, Cordelia B. Guggenheim, Aurelio José Figueredo, and Catherine J. Locke ... 65
Sons Or Daughters: A Cross-Cultural Study Of Sex Ratio Biasing And Differential Parental Investment
Cordelia B. Guggenheim, Melinda F. Davis, and Aurelio José Figueredo ................................... 73
Sexual Risk Behavior Among Kenyan University Students
Mary B. Adam and Mike Mutungi ................................................................................................ 91
Likelihood-Evidential Support And Bayesian Analysis On A Prospective Cohort Of Children And
Adolescents With Mild Scoliosis Under Chiropractic Management
J. Michael Menke, Gregory Plaugher, Christina A. Carrari, Roger R. Coleman, Luca Vannetiello,
and Trent R. Bachman, .................................................................................................................. 99
2007
Published by the
ARIZONA-NEVADA ACADEMY OF SCIENCE
i
ARIZONA-NEVADA ACADEMY OF SCIENCE
OFFICERS FOR 2007-2008
CARLETON “BUCK” JONES, Midwestern University
CARLETON “BUCK” JONES, Midwestern University
INGRID NOVODVORSKY, University of Arizona
ELIZABETH HULL, Midwestern University
WILLIAM PERRY BAKER, University of Arizona
KAREN CONZELMAN, Glendale Community College
WILLIAM PERRY BAKER, University of Arizona
MICHAEL W. DIEHL, Desert Archaeology, Inc., Tucson
ROBERT REAVIS, Glendale Community College
AREGAI TECLE, Northern Arizona University
J-PETRINA MCCARTY-PUHL, Robert McQueen High School, Reno
President
President-Elect
Membership Secretary
Recording Secretary
Permanent Secretary
Treasurer
Editor
Director/Southern Arizona
Director/Central Arizona
Director/Northern Arizona
Director/Nevada
ASSOCIATE EDITORS
AREGAI TECLE, Northern Arizona University
JIM DEVOS, Arizona Game and Fish Department
GORDON JOHNSON, Northern Arizona University
LINDA SMITH, Glendale Community College
HENRI GRISSINO-MAYER, Valdosta State University
ROBERT REAVIS, Glendale Community College
CINDY D. ZISNER, Arizona State University
Hydrology/Geography
Conservation/Biology
Education/General Science
Anthropology
Geology
Advertising Editor
Production Editor
Subscription rates for institutions and libraries within the United States is $35.00 per year;
institutions and libraries outside the United States subscribe at the rate of $45.00 per year. To subscribe
please address correspondence to: Ingrid Novodvorsky, Membership Secretary - ANAS, Physics
Department, 1118 E 4th St, University of Arizona, Tucson, AZ 85721.
Individual memberships to the Academy (which includes a subscription to the Journal) are $35.00
per year within the United States and $45 per year for foreign members. Student membership is $15.00
per year (requires proof of student status). Family and patron memberships are also available. For
membership information, please see our web site at http://www.geo.arizona.edu/anas or contact the
membership secretary at the address listed above.
Single copies of the Journal are $6.00; special issues are $10. For a list of previous issues see our
web site at http://www.geo.arizona.edu/anas/publicatons.html.
EDITORIAL POLICY
The Journal of the Arizona-Nevada Academy of Science (ISSN 1533-6085) is published principally
by and for the members of the Arizona-Nevada Academy of Science. It is the intention of the Editorial
Board that the Journal shall serve all members, and publications are not restricted to formal, original
scientific papers. The Editorial Board will decide the material to be included in each issue and will use
its prerogative to edit all published items.
The Arizona-Nevada Academy of Science is a nonprofit organization, and publication costs for its
journal are subsidized in part by page charges. Page charges for ANAS members are $15 per page up to
eight pages and $30 per page for each additional page. Page charges for nonmembers are $30 per page.
Contributors without grant support may request waiver of page charges.
ii
FOREWORD
There have now been four psychology sessions at the ANAS annual meetings, and this issue of JANAS
is an introduction to psychological research in Arizona for ANAS members. This issue has several themes;
measurement, data analysis, and evolutionary psychology. The first four articles demonstrate an analytical approach that is intended for studies with multiple, correlated dependent variables that can be implemented using common statistical packages. The first two articles are likely to be of interest to biologists
and science educators, because they use this novel approach to tease out the genetic and developmental
factors influencing assortative mating in insects. While there is some overlap between Jewel Wasp 1 and
Jewel Wasp 2, they are designed for different readers. Jewel Wasp 1 addresses the study hypotheses, and
Jewel Wasp 2 focuses on the data analytic approach. The next two articles in this issue apply the same
analytic approach to examine the differential allocation of parental investment in humans according to the
sex of the offspring; in a sample of Arizona babies, and in a secondary analysis of a multinational dataset.
The availability of high quality datasets will undoubtedly be increasingly useful for researchers and educators. The final articles are health related, examining sexual risk behavior in university students, and
demonstrating an application of Bayesian techniques to the measurement of scoliosis.
The authors are all members of the Evaluation Group for the Analysis of Data (EGAD), led by Lee
Sechrest, in the Psychology Department at the University of Arizona, where A. J. Figueredo is also the
Director of the Graduate Program in Ethology and Evolutionary Psychology.
Contributions to this issue were primarily selected from presentations at the Psychology Session at the
50th Anniversary meeting of the Arizona Nevada Academy of Sciences, held in Tucson in 2006. The most
relevant contributions from that session are included in this publication. All submissions received external
peer reviews.
Melinda Davis, Chair, Psychology Section, Owen Davis, Past-President and Michael Menke, editors
iii
iv
ASSORTATIVE MATING IN THE JEWEL WASP 1 ! FIGUEREDO AND SAGE
51
ASSORTATIVE MATING IN THE JEWEL WASP: 1. FEMALE MATCHING OF
EYE-COLOR GENOTYPE, NOT HOST-FEEDING PHENOTYPE
AURELIO JOSÉ FIGUEREDO and REBECCA M. S. SAGE, Department of Psychology, University of Arizona,
Tucson, AZ 85721
ABSTRACT
An experiment was performed to test the influence of two factors on assortative mating in the Jewel wasp (Nasonia
vitripennis): (1) inherited eye-color genotype, and (2) conditioned foraging phenotype. To determine the female
choice of sires, genetically marked wasps were used of two recessive mutant eye-color genotypes: white or red. Each
female (of independently crossed eye-color and juvenile host) was given a choice of two males, one of each eye-color,
counterbalanced for juvenile host: blowfly (Sarcophaga bullata) or housefly (Musca domestica) pupae. Blowflyreared females produced more total offspring, indicating higher fertility. Red-eyed females produced more offspring
sired by red-eyed males, indicating assortative mating by eye color. Blowfly-reared females, however, did not produce
more offspring sired by blowfly-reared males, indicating no assortative mating by juvenile host.
INTRODUCTION
The Jewel wasp (Nasonia vitripennis) is a beneficial parasitoid insect which may feed on a variety of
species of two-winged flies (Diptera), such as blowflies
(Sarcophaga bullata) and houseflies (Musca domestica).
Like certain other insects, the Jewel wasp is behaviorally predisposed in adulthood to feed its own offspring
(by selective oviposition) on the particular host species
that it fed on as a juvenile, or larva (Figueredo 1987,
1989). For such insects, it has been proposed that such
behavioral feeding traditions may lead to mutual reproductive isolation between divergent feeding types, and
thus to sympatric speciation, which is the splitting of
one parent species into two or more daughter species
without the benefit of geographical isolation (e.g., Bush
1969, Futuyma and Mayer 1980, Berlocher and Feder
2002). In fact, host-specific reproductive isolation has
been specifically proposed as a potential mechanism for
sympatric speciation in the Jewel wasp (Smith and
Cornell 1979).
In the absence of geographical barriers, reproductive isolation can only occur by some form of assortative mating. Assortative mating is the tendency for
phenotypically similar individuals to mate. This tendency may be based on either one or several phenotypic
variations and may include both physical characteristics and behavioral characteristics. Assortative mating
can be distinguished from inbreeding in that inbreeding
involves mating with individuals which are genetically
similar at all or most variable loci whereas assortative
mating minimally involves similarity only at those loci
affecting variation in the traits for which the assortment
occurs. In nature, most assortative mating is positive,
favoring similarity over dissimilarity (Partridge 1983).
Although positive assortment can theoretically lead to
sympatric speciation (Bush 1969, Tauber and Tauber
1977a, 1977b), this potential consequence is believed
to be epiphenomenal and not is hypothesized to represent its evolved adaptive function.
Assortative mating may be passive or active. Passive assortative mating can occur as a result of any
persistent habitat preferences among individuals with
different host-plant feeding experience that might cause
them to breed in the same habitats in which they
developed (Papaj and Prokopy 1988, Feder et al. 1994),
as in the case of the Apple Maggot fly (Rhagoletis
pomonella). This habitat preference might be mediated
by differential attraction to volatile host-specific
semiochemicals, as in the case of the Apple Maggot Fly
(Forbes et al. 2005), the Apple fruit moth (Argyresthia
conjugella) (Bengtsson et al. 2006), and the pea aphid
(Acyrthosiphon pisum) (Del Campo et al. 2003). Segregation of host-specific phenotypes can also occur in
time as well as in space, as in the case of the European
Corn Borer (Ostrinia nubilalis Hübner), in which moths
feeding on different host plants emerge at different
times (Bethenod et al. 2005, Malausaet et al. 2005), or
in the case of Enchenopa treehoppers, in which eggs
hatch at different times on different host plants and
therefore have asynchronous life histories and temporal
mating windows (Wood and Keese 1990). In the latter
case, cohorts of treehoppers that were experimentally
synchronized in the laboratory showed no behavioral
tendencies towards assortative mating by host plant of
origin.
On the other hand, active assortative mating can
occur when individuals seek out sexual partners that are
genetically similar to themselves by phenotype matching (Rushton 1989, Rushton and Bons 2005). For
example, behavioral male preferences have been shown
to produce positive assortative mating based on genetically distinct female pheromone strains in the Cabbage
Looper moth (Trichoplusia ni) (Zhu, Chastain, Spohn,
FIGUEREDO, A. J., AND R. M. S. SAGE. 2007. ASSORTATIVE MATING IN THE JEWEL WASP: 1.
FEMALE MATCHING OF EYE-COLOR GENOTYPE, NOT HOST-FEEDING PHENOTYPE. JOURNAL OF
THE ARIZONA-NEVADA ACADEMY OF SCIENCE 39(2):51-57.
ASSORTATIVE MATING IN THE JEWEL WASP 1 ! FIGUEREDO AND SAGE
and Haynes 1997). In addition, active assortative mating
may also be based on traits that are not necessarily
genetic in origin. For example, behaviorally-mediated
assortative mating based on adult size has been shown in
water striders (Heteroptera: Gerridae) (Rowe and
Arnqvist 1996) and in meloid beetles (Lytta magister)
(Snead and Alcock 1985). Furthermore, active assortative mating may be based upon acquired characteristics
that are host-specific and based upon feeding experience. For example, in the case of the cactophilic fruit fly
(Drosophila mojavensis), assortative mating is based on
the distinct cuticular hydrocarbon compositions acquired
as adults by larvae raised on different hosts, producing
behaviorally-mediated reproductive isolation (Etges
1992, Brazner and Etges 1993, Etges and Ahrens 2001).
The development of host-specific cuticular hydrocarbons has also been documented in the case of the bethylid
wasp (Cephalonomia tarsalis Ashmead) (Howard 1998).
Thus, assortatively mating individuals are sometimes
able to discriminate potential sexual partners based on
host-specific chemosensory cues.
In the case of parasitoid feeding traditions, we need
to pay close attention to the proximate behavioral mechanisms presumably involved. Because of the observed
plasticity in foraging behavior, the putative mechanism
or behavioral predisposition for assortative mating are
not likely to be under direct genetic control because there
might be no simple correspondence between an
individual’s genotype and its host-feeding phenotype.
For the same reason, philopatric patterns of dispersal
(Shields 1982), the tendency to remain in the natal area
through adulthood, might be similarly ineffective mechanisms for assortative mating. The major candidate for a
proximate behavioral mechanism therefore appears to be
phenotype matching, as has been documented in the
monarch butterfly (Norton-Griffiths 1968).
There are several theoretical considerations to be
taken into account when proposing assortative mating
hypotheses for the Jewel wasp. There are ecological
factors which might mitigate against assortative mating
for host specificity. For example, the population densities of different species of hosts may vary significantly
and behaviorally-inflexible daughters might then suffer,
and the foraging opportunity costs of not finding a
preferred host might exceed the benefits of increased
offspring fecundity associated with ovipositing on the
preferred host. For the Jewel wasp, it has been found that
adult foraging experience may modify the behavioral
effects of juvenile feeding experience to track the relative local abundance of alternative host species (Figueredo
1987, 1989). Thus, it was concluded that sympatric
speciation based on host-specific assortative mating was
an unlikely result of such a lifelong learning mechanism.
On the other hand, the influence of juvenile feeding
experience on adult mate choice, per se, was not determined. Because adult experience might modify foraging
behavior without altering any sexual preferences im-
52
printed earlier, reproductive isolation by differential
juvenile feeding experience — affecting mate choice
directly, rather than indirectly through diet — was still
technically possible. It remained possible that Jewel
wasps might mate assortatively based on juvenile feeding experience. Alternatively, because blowfly-reared
females were found to be significantly more fecund
(Figueredo 1987, 1989), one might instead hypothesize
a possible species-typical disposition for mate choice to
favor males that were blowfly-reared to maintain that
host predisposition. Finally, there is also the possibility
of assortative mating for other phenotypic characteristics reflecting genetic differences that are unrelated to
feeding experience.
This paper reports the results of an experiment
which was designed to test the effects of juvenile feeding
experience on adult mate choice in the Jewel wasp. The
principal hypothesis was that adult female Jewel wasps,
when given a choice of males as sires for their offspring,
would assortatively mate with males reared on the same
juvenile host as themselves. This hypothesis was tested
in the context of an alternative hypothesis that adult
female Jewel wasps would assortatively mate, given the
same choice conditions, on the basis of an unrelated and
purely genetically determined phenotypic trait, such as
eye color. The latter finding would instead support
assortative mating based on genetic similarity (e.g.,
Rushton 1989, Rushton and Bons 2005).
METHODS
Subjects
A total of 85 adult female and 170 adult male Jewel
wasps (Nasonia vitripennis) were used in this mate
choice experiment. To accurately determine female
choice of male wasps as sires for their offspring, genetically marked male and female wasps were used. Two
recessive mutant genotypes were used, one for white
eyes and one for red eyes (technically called “oyster”
eyes and “scarlet” eyes), which, when crossed, produce
hybrids with brown eyes (called the “wild” phenotype).
Female wasps of both eye-color genotypes were randomly assigned to oviposit on pupae of either the
carrion blowfly (Sarcophaga bullata) or the common
housefly (Musca domestica). These initial ovipositing
exposures were carried out in 32x102 mm plastic culture vials stoppered with foam plugs. All host fly pupae
used were three to five days old as pupae and were used
directly after one day of preconditioning in the incubator at LD 12:12, 25C +/- 2C, and 80%RH +/- 10%RH.
Both the male and female offspring of the originally
ovipositing female Jewel wasps were used as experimental subjects for this study. Cultures of all three
species, including both mutant eye-color genotypes of
Jewel wasp, were originally obtained from Carolina
Biological Supply Company.
53
ASSORTATIVE MATING IN THE JEWEL WASP 1 ! FIGUEREDO AND SAGE
The experimental offspring were reared to maturity
in the incubator at the photoperiods, temperatures, and
humidities specified for host fly pupae. To obtain virgin
females for this study, male and female subjects had to be
separated from each other while they were still within the
pupal stage. This was necessary because the males typically eclose a day or two earlier than their sisters, and are
known to sometimes successfully inseminate female
pupae. Separation of the wasp pupae was performed by
carefully dissecting the pupal cases of the host flies that
they had developed in and then extracting and sexing the
individual wasp pupae under a microscope. This was a
very delicate procedure which sometimes damaged the
soft bodies of the immature wasp subjects. Moreover,
housefly pupae are generally much smaller than blowfly
pupae, and female Jewel wasps reared on housefly hosts
are also generally smaller than those reared on blowfly
hosts. The necessary handling therefore caused a certain
amount of differential mortality between subjects reared
on different juvenile host species. After eclosion, all
adult wasps were separately housed in 15x85 mm borosilicate glass tubes stoppered with cotton plugs and fed
on honey until the controlled mating and ovipositing
exposures to eliminate any effects of prior feeding experience on pupae other than the larval rearing host.
PROCEDURE
All mating exposures were conducted for a standard
period of 24 hours in separate 15x85 mm borosilicate
glass tubes stoppered with cotton plugs under the controlled environmental conditions specified above. Adult
male and female wasps were refrigerated at 6C for about
6 minutes each to facilitate handling without chemical
anaesthesia and then transferred into these tubes by
camel’s hair brush. After the mating exposures, separate
ovipositing exposures were conducted for another standard period of 24 hours in separate 32x102 mm plastic
culture vials stoppered with foam plugs under the same
controlled environmental conditions. Each adult female
wasp was separately transferred by camel’s hair brush
into an individual culture vial, each of which contained
one blowfly pupa for oviposition. All offspring of the
experimental females were reared to maturity in these
vials and then counted and classified as described below.
Experimental design
Within each test tube, every adult female was given
a choice of two adult males for mating, one of each eyecolor genotype, which were counterbalanced for the
particular host species, either blowfly or housefly, that
they had been reared on as larvae. The basic idea was that
one of these males would be blowfly-reared and that the
other would be housefly-reared, but that their eventual
offspring could only be distinguished from each other by
the particular eye-color genotype that was randomly
associated with each individual sire. Because the different mutant eye-color genotypes used might themselves
influence female mate choice, however, the experiment
was ultimately designed to test the relative influence of
two separate factors on assortative mating in the Jewel
wasp: (1) inherited eye-color genotype, and (2) conditioned foraging phenotype. Each adult male wasp in this
study therefore had both an eye-color genotype and an
independently assigned rearing-host phenotype. Females
of independently crossed eye-color genotype and juvenile feeding experience were also used.
Data encoding
The critical information in this study was provided
by the paternity of the female offspring produced by the
experimental matings, but had to be statistically controlled for various related and causally prior reproductive outcomes. Because these outcomes represented the
effects of various interdependent reproductive decisions
made by the ovipositing females, they were not amenable to experimental control. Therefore, several dependent variables were recorded for hierarchical statistical
analysis.
Because female Jewel wasps reared on housefly
hosts are significantly less fertile than those reared on
blowfly hosts (Figueredo 1987), we could expect significantly different total numbers of offspring (TOT), regardless of paternity, from subjects reared on different
juvenile hosts. Second, a variable number of Jewel wasp
offspring remain immature in a dormant condition, called
diapause, for up to six months past the normal fourteen
days (and, thus, the end of our experiment). The total
number of such dormant immature offspring (TIO) is
known to vary with certain environmental factors, such
as ambient temperatures and photoperiods, but might
also be affected in some unknown way by the local
mating conditions experienced by the mother. Because
these dormant larvae had not yet developed eyes of either
color, it was not possible to determine their paternity
without costly genetic analysis, perhaps producing
nonrandomly missing data. Third, male Jewel wasps are
always produced asexually by the mother (by parthenogenesis, or “virgin birth”) and, therefore, have no biological father at all. Thus, a female Jewel wasp that does
not mate with either of the males provided might produce
a clutch of all male offspring. The total number of male
offspring (TMO) could not be simply ignored in this
study because it might have indicated female rejection of
both males, and perhaps constituted more nonrandomly
missing data. These first three dependent variables, although of lesser theoretical interest, were modeled hierarchically as causally prior to the last two to be able to
statistically control any effect that they might have on the
remaining two dependent variables.
The two dependent variables of principal interest
were the total number of female offspring sired by red-
ASSORTATIVE MATING IN THE JEWEL WASP 1 ! FIGUEREDO AND SAGE
eyed, rather than white-eyed, males (TRFO), and the
total number of female offspring sired by blowfly-reared,
rather than housefly-reared, males (TBFO). Of course,
the specified eye colors and juvenile hosts referred to are
those of the fathers, rather than those of the daughters,
because all genetic hybrids had brown eyes, and all
posttest offspring were reared on the larger blowfly hosts
for convenience (given the greater number of offspring
typically produced therein). Because paternal genotypes
and juvenile hosts were also slightly correlated, by the
postoperative mortality described above, eye color was
modeled hierarchically as causally prior to juvenile host
in these two dependent variables (TRFO and TBFO).
For the same reasons, eye color was modeled hierarchically as causally prior to juvenile host in the two
corresponding independent variables (FG and FJ). Consistently in direction with TRFO and TBFO, female eyecolor genotype (FG) was coded 1 for red-eyed females
and 0 for white-eyed females; female juvenile rearing
host (FJ) was coded 1 for blowfly-reared females and 0
for housefly-reared females.
Statistical Analyses
Although the experiment seemed superficially quite
simple, there were several nontrivial complications. Due
to the differential mortality by rearing host alluded to
above, we wound up with somewhat more blowflyreared than housefly-reared female subjects. This also
produced a slight, though nonsignificant, negative correlation between the independent variables, female genotype (FG) and female juvenile host (FJ), due to the
random subject mortality across the two eye colors. Such
correlated independent variables are easily modeled by
hierarchical regression. To be conservative in our testing
of the effects of female host, however, we assigned
causal priority to female genotype. Moreover, we did not
hypothesize an interaction between these two factors.
The multiple dependent variables, however, presented problems that could not be handled by unaided
Multiple Regression/Correlation (MRC) techniques
(Cohen and Cohen 1983). Sequential canonical analysis
(SEQCA) was selected as the optimal analytical model
for the present data because it partitions the covariance
among multiple dependent variables sequentially, as a
hierarchical regression does, while maintaining their
separate identity (Gorsuch 1991, Gorsuch and Figueredo
1991). This method isolates the direct effects of the
independent variables, or interventions, sequentially on
each of the dependent variables or outcomes, controlling
for all indirect effects through the prior dependent variables or outcomes. The only theoretical guidance required is a tentative specification of the causal order
between the dependent variables (Figueredo and Gorsuch
2007). Because the two model predictor variables were
binary in form, this SEQCA can also be conceptualized
as mathematically equivalent to a “Step-Down” (i.e.,
54
hierarchical) Multivariate Analysis of Variance (MANOVA).
This statistical method has been used previously in the
study of oviposition in parasitoid wasps (Henneman et
al. 1995). A full theoretical consideration of the merits
and limitations of this statistical method in comparison
with related methods is provided in Figueredo and
Gorsuch (2007).
RESULTS
Descriptive Statistics
A total of 1072 offspring were produced by the
experimental wasps. Of these, 15 were diapausing larvae
at the end of the experiment and 313 were adult males.
Both of these offspring categories were of indeterminate
paternity and the diapausing larvae were also of undetermined sex. Of the remaining 744 adult female offspring,
256 were sired by red-eyed males and 488 were sired by
white-eyed males.
Inferential Statistics
Sequential canonical analysis was performed using
UNIMULT (Gorsuch 1991). All that was required for
running this multivariate model was the specification of
two hierarchies of causal priority, one for the two independent variables and another for the five dependent
variables. The theoretical justifications for these hierarchies of causal priority were detailed above. The PillaiBartlett “V” Statistic (PBV) for the whole model was .21,
F(10,158)= 1.86 (p=.05), which provided an overall
protected test of statistical significance. Table 1 displays
the overall tests of the proportions of the variance of each
of the five dependent variables accounted for by the
linear combinations of the two independent variables.
The SEQCA or “Step-Down” MANOVA also provided hierarchical significance tests for the separate
effects of each of the two independent variables on each
of the five dependent variables, as shown in Table 2.
Tabl e 1. Mult iple Correlat ions and Associat ed Test s
of Signif icance f or Each Dependent Variable in t he
Mult iv ariat e Model.
Variable
Effect Size
PBV
F(2,158)
p
TOT
R = . 25
0. 0 6
2. 6 8
0. 0 7
TIO
R = .14
0. 0 3
1.31
0. 3
TMO
R = . 08
0. 0 2
0. 6 5
n.s.
TRFO
R = .32
0. 1 5
7. 0 6
< 0.005
TBFO
R = .12
0. 0 2
0. 7 3
n.s.
ASSORTATIVE MATING IN THE JEWEL WASP 1 ! FIGUEREDO AND SAGE
55
Tabl e 2. Semipart ial Correlat ions and Associat ed
Test s of Signif icance f or Each Independent
Variable in t he Mult iv ariat e Model.
Variable
Effect
Size
PBV
F(1,158)
p
Tabl e 3. St andardized Regression C oef f icient s f or
E a c h In d e p e n d e n t Va r ia b le in t h e Mu lt iv a r ia t e
Model.
TOT
=
- .105*FG
+ .217*FJ
TIO
=
- .144*FG
+ .004*FJ
TMO =
- .080*FG
+ .016*FJ
TRFO =
.300*FG
- .092*FJ
TBFO =
.121*FG
+ .012*FJ
Dependent variable: TOT
FG
r =- .13
0.02
1.43
0.2
FJ
r = .22
0.05
3.93
0.05
Dependent variable: TIO
FG
r =- .14
0.03
2.62
0.1
FJ
r = .00
0
0
n.s.
Dependent variable: TMO
FG
r =- .08
0.02
1.25
0.3
FJ
r = .02
0
0.05
n.s.
Dependent variable: TRFO
FG
r = .31
0.14
13
<.001
FJ
r =- .09
0.01
1.12
0.3
Dependent variable: TBFO
FG
r = .12
0.02
1.44
0.2
FJ
r = .01
0
0.02
n.s.
The standardized regression coefficients (“betaweights”) obtained by SEQCA are shown in Table 3.
Thus, controlling for all the prior dependent variables, there were only two statistically significant effects
of the independent variables on the dependent variables.
These significant effects were: (1) the predicted effect of
female juvenile rearing host (FJ) on total number of
offspring (TOT), and (2) the effect of female genotype
(FG) on total number of female offspring sired by redeyed males (TRFO).
These findings indicate that: (1) blowfly-reared
experimental females produced more offspring in general than did housefly-reared females (beta=.217), and
(2) red-eyed experimental females produced more female offspring sired by red-eyed males than did whiteeyed females (beta=.300). The former is a previously
documented effect of rearing host (Figueredo 1987,
1989) and the latter constitutes evidence for positive
assortative mating for eye color. The mean number of
offspring in all categories combined produced by blowfly-reared females was 14.52 and that produced by
housefly-reared females was only 6.81, both regardless
of eye-color genotype. The mean number of female
offspring sired by red-eyed males produced by red-eyed
females was 4.46 and that produced by white-eyed
females was only 1.14, both regardless of juvenile rearing host.
When these means are statistically adjusted for the
effects of all prior dependent variables, they are 4.90 and
1.36, respectively, which represent an even larger difference.
The most important negative result of this analysis
is that female juvenile rearing host (FJ) had no significant effect on the total number of female offspring sired
by blowfly-reared males (TBFO). This latter result represents empirical evidence against the hypothesis that
there is positive assortative mating for juvenile rearing
host in the Jewel wasp. When these means are statistically adjusted for the effects of all prior dependent
variables, they are 4.09 and 4.15, respectively, which
would hardly indicate a difference of any consequence
even had it reached statistical significance.
DISCUSSION
Our study revealed no evidence of assortative mating by juvenile feeding experience in the Jewel wasp. It
is therefore unlikely that differential juvenile feeding
experience can lead to sympatric speciation in this particular species by either of the mechanisms that have
been proposed. The requisite reproductive isolation can
apparently not be produced by differential juvenile feeding experience either indirectly by permanently differentiated foraging behavior or directly by differentially
imprinted sexual preferences. Instead, assortative mating was shown to be greatly influenced by genetic
markers, such as eye color.
Like many other species, the Jewel wasp does
assortatively mate. Although the mutant eye-color genotypes used in this experiment were artificially selected
laboratory strains, assortment by eye color was probably
indicative of a preference for the greater set of genetic
similarities that presumably existed between the members of the same recessive mutant strains. The selective
breeding of these two recessive mutant strains was likely
to have resulted in homozygosity on a variety of other
ASSORTATIVE MATING IN THE JEWEL WASP 1 ! FIGUEREDO AND SAGE
traits as well as on eye color. It is therefore possible that
eye color, per se, was not even directly perceived or used
as markers by the wasps as proximate cue for mate
discrimination, but that the relevant genetic information
regarding relatedness was carried instead by other indicators of phenotypic similarity, such as olfactory cues.
Assortative mating is thought to function in the wild to
maintain specific homozygous genotypic adaptations
and reduce the costs of mating (Bateson 1983). Assortative mating tracks local selective forces and thus allows
local populations to become genetically better adapted to
local circumstances in which surviving genotypes have
been successful (see also Kawecki 1994 for other genetic
benefits). Why, then, do Jewel wasps not assortatively
mate by host-feeding experience? Should not similar
benefits accrue?
The advantage of specialization to a single host
species is that the parasitoid can maximize its adaptations to that particular host’s physiology and lifecycle
and thus reduce its competition with generalists. The
disadvantage is that complete dependence on a single
fluctuating host population may indirectly result in the
occasional extinction of local parasitoid populations.
The most likely reason for the absence of either natural
or sexual selection for more restricted host specificity in
the Jewel wasp might therefore reside in the nature of the
host fly life histories. Because the host flies feed on
ephemeral resources, such as animal carcasses, fly populations are locally unstable. As colonizers, the first pioneering fly species to find a favorable microenvironment
temporarily becomes the locally predominant species.
When these ephemeral resources are exhausted, however, the local population of that host species then
precipitously crashes. Proverbially, they breed like flies
and they die like flies (Figueredo 1987, 1989). Any
parasitoid specializing on such a host species will experience large shifts in host abundance. If there were any
genetic variance in the restrictedness of host selection by
the parasitoid, this fluctuating food supply could result in
the elimination of any more restricted feeding genotypes
and the survival of any less restricted feeding genotypes
which were able to utilize any of a variety of fly species
that colonized the local area.
As demonstrated in adult foraging behavior
(Figueredo 1987, 1989), Jewel wasps display a lifelong
conditional feeding strategy based on the relative abundances of alternative host species. The host species on
which an individual wasp was reared influences but does
not permanently limit its foraging options as an adult. A
more restricted host preference, whether environmental
or genetic in origin, would force the wasp to search until
a host of a specific type is found, even in cases where
acceptable substitutes were available in the immediate
vicinity. Assortative mating for host-feeding phenotype
would therefore impose an additional foraging cost on
the offspring that is evidently suboptimal in the adult. In
the absence of lifelong host fidelity in the adult (which
56
could lead to some microgeographic segregation and,
thus, to passive assortative mating), the most plausible
mechanism remaining for sympatric speciation was thus
active assortative mating for prior host-feeding experience. This hypothesis was disconfirmed. Although Jewel
wasp females were found to be mating assortatively
based on manipulated markers of genetic relatedness
(presumably indicated by phenotypic similarity), they
were not found to be mating assortatively based on hostfeeding phenotype. We can only conclude that both of
the plausible mechanisms proposed for reproductive
isolation leading to sympatric speciation, passive assortative mating by permanently differentiated foraging
behavior and active assortative mating by differentially
imprinted sexual preferences, are absent in the Jewel
wasp. Behaviorally flexible parasitoid species like the
Jewel wasp are thus likely to remain polyphagous.
LITERATURE CITED
BATESON, P. 1983. Mate choice. Cambridge University
Press, London, UK.
BENGTSSON, M., G. JAASTAD, G. KNUDSEN, S. KOBRO, A. C.
BÄCKMAN, E. PETTERSSON, AND P. W ITZGALL. 2006.
Plant volatiles mediate attraction to host and non-host
plant in apple fruit moth, Argyresthia conjugella.
Entomologia Experimentalis et Applicata 118(1):77–
85.
BERLOCHER, S. H., AND J. L. FEDER. 2002. Sympatric speciation in phytophagous insects: Moving beyond controversy? Annual Review of Entomology 47:773-815.
BETHENOD, M. T., Y. THOMAS, F. ROUSSET, B. FRÉROT, L.
PÉLOZUELO, G. GENESTIER, AND D. BOURGUET. 2005.
Genetic isolation between two sympatric host plant
races of the European corn borer, Ostrinia nubilalis
Hübner. II: assortative mating and host-plant preferences for oviposition. Heredity 94: 264–270.
BRAZNER, J. C., AND W. J. ETGES. 1993. Premating isolation
is determined by larval rearing substrates in cactophilic
Drosophila mojavensis. II. Effects of larval substrates
on time to copulation, mate choice and mating propensity. Evolutionary Ecology 7:605–624.
BUSH, G. L. 1969. Mating behavior, host specificity and the
ecological significance of sibling species in frugivorous flies of the genus Rhagoletis (Diptera: Tephritidae.
American Naturalist 103:669-672.
COHEN, J., AND P. COHEN. 1983. Applied multiple regression/
correlation analysis for the behavioral sciences.
Lawrence Erlbaum, Hillsdale, NJ.
DEL CAMPO, M. L., S. VIA, AND M. C. CAILLAUD. 2003.
Recognition of host-specific chemical stimulants in
two sympatric host races of the pea aphid Acyrthosiphon
pisum. Ecological Entomology 28(4):405-412.
ETGES, W. J. 1992. Premating isolation is determined by
larval substrates in cactophilic Drosophila mojavensis.
Evolution 46:1945–1950.
ETGES, W. J., AND M. A. AHRENS. 2001. Premating isolation
is determined by larval-rearing substrates in cactophilic
Drosophila mojavensis. V. Deep geographic variation
in epicuticular hydrocarbons among isolated populations. American Naturalist 158:585–598.
57
ASSORTATIVE MATING IN THE JEWEL WASP 1 ! FIGUEREDO AND SAGE
FEDER, J. L., S. B. OPP, B. WLAZLO, K. REYNOLDS, W. GO, AND
S. SPISAK. 1994. Host fidelity is an effective premating
barrier between sympatric races of the apple maggot fly.
Proceedings of the National Academy of Sciences 91:79907994.
FIGUEREDO, A. J. 1987. The statistical measurement, developmental mechanisms, and adaptive ecological functions of
conditioned host selection in the parasitoid Jewel wasp.
Dissertation, University of California, Riverside, CA.
FIGUEREDO, A. J. 1989. Host-cue information processing by
foraging Jewel wasps: Response-bias tracking of expected host-species encounter rates, not modified
profitabilities. Psychobiology 17:435-444.
FIGUEREDO, A. J. AND R. GORSUCH. 2007. Assortative mating in
the jewel wasp: 2. Sequential canonical analysis as an
exploratory form of path analysis. Journal of the Arizona
Nevada Academy of Sciences, 39:59-64.
FORBES, A. A., J. FISHER, AND J. L. FEDER. 2005. Habitat
avoidance: Overlooking an important aspect of hostspecific mating and sympatric speciation? Evolution
59(7):1552–1559.
FUTUYMA, D. J., AND G. C. MAYER. 1980. Non-allopatric speciation in animals. Systematic Zoology 29(3):254-271.
GORSUCH, R. L., and A. J. FIGUEREDO. 1991. Sequential canonical analysis as an exploratory form of path analysis. Paper.
American Evaluation Association Conference, Chicago,
Illinois.
GORSUCH, R. L. 1991. UniMult: For univariate and multivariate data analysis. UniMult, Altadena, CA.
HENNEMAN, M. L., D. R. PAPAJ, A. J. FIGUEREDO, AND L. E. M.
VETT. 1995. Egg-laying experience and acceptance of
parasitized hosts by the parasitoid, Leptopilina heterotoma
(Hymenoptera: Eucoilidae). Journal of Insect Behavior
8:331-342.
HOWARD, R. W. 1998. Ontogenetic, reproductive, and nutritional effects on the cuticular hydrocarbons of the hostspecific ectoparasitoid Cephalonomia tarsalis (Hymenoptera: Bethylidae). Annals of the Entomological
Society of America 91(1):101-112.
KAWECKI, T. J. 1994. Accumulation of deleterious mutations
and the evolutionary cost of being a generalist. The
American Naturalist 144(5):833-838.
MALAUSA, T., M. T. BETHENOD, A. BONTEMPS, D. BOURGUET, J.
M. CORNUET, AND S. PONSARD. 2005. Assortative mating in
sympatric host races of the European corn borer. Science
308(5719):258-260.
NORTON-GRIFFITHS, M. 1968. The feeding behavior of the
Oystercatcher, Haematatopus ostralegus. D. Phil. Thesis.
University of Oxford.
PAPAJ, D. R., AND R. J. PROKOPY. 1988. The effect of prior adult
experience on components of habitat preference in the
apple maggot fly (Rhagoletis pomonella). Oecologia
76(4):538-543
PARTRIDGE, L. 1983. Non-random mating and offspring fitness.
Pp. 227-256 in P. Bateson ed., Mate choice. Cambridge
University Press, London, UK.
ROWE, L., AND G. ARNQVIST. 1996. Analysis of the causal
components of assortative mating in water striders. Behavioral Ecology and Sociobiology 38:279–286.
RUSHTON, J. P. 1989. Genetic similarity, human altruism, and
group selection. Behavioral and Brain Sciences 12:503559.
RUSHTON, J. P., AND T. A. BONS. 2005. Mate Choice and
Friendship in Twins: Evidence for Genetic Similarity.
Psychological Science 16:555-559.
SHIELDS, W. M. 1982. Philopatry, inbreeding, and the evolution
of sex. State University of New York Press, Albany, NY.
SMITH, M. A., AND H. V. CORNELL. 1979. Hopkins host-selection
in Nasonia vitripennis and its implications for sympatric
speciation. Animal Behavior 27:365-370.
SNEAD, J. S., AND J. ALCOCK. 1985. Aggregation formation and
assortative mating in two meloid beetles. Evolution
39(5):1123-1131.
TAUBER, C. A., AND M. J. TAUBER. 1977a. Sympatric speciation
based on allelic changes at three loci: Evidence from
natural populations in two habitats. Science 197:12981299.
TAUBER, C. A., AND M. J. TAUBER. 1977b. A genetic model for
sympatric speciation through habitat diversification and
seasonal isolation. Nature 268:702-705.
WOOD, T. K., AND M. C. KEESE. 1990. Host-plant-induced
assortative mating in Enchenopa treehoppers. Evolution
44(3):619-628.
ZHU, J., B. B. CHASTAIN, B. G. SPOHN, AND K. F. HAYNES. 1997.
Assortative mating in two pheromone strains of the cabbage looper moth, Trichoplusia ni. Journal of Insect
Behavior 10(6):805-817.
ASSORTATIVE MATING IN THE JEWEL WASP 1 ! FIGUEREDO AND SAGE
58
ASSORTATIVE MATING IN THE JEWEL WASP 2 ! FIGUEREDO AND GORSUCH
59
ASSORTATIVE MATING IN THE JEWEL WASP: 2. SEQUENTIAL
CANONICAL ANALYSIS AS AN
EXPLORATORY FORM OF PATH ANALYSIS
AURELIO JOSÉ FIGUEREDO, Department of Psychology, University of Arizona, Tucson, AZ 85721; and
RICHARD L. GORSUCH, Graduate School of Psychology, Fuller Theological Seminary, Pasadena, CA
91182
ABSTRACT
This paper discusses the conceptual and mathematical relationships between three statistical models: 1)
traditional, or “Simultaneous”, Canonical Analysis, 2) hierarchical, or “Sequential”, Canonical Analysis, and 3)
Structural Equations Modeling, or confirmatory path analysis. The advantages of Sequential over Simultaneous
Canonical Analysis are reviewed. The relationships between Sequential Canonical Analysis and Path Analysis are
explored. The need for a legitimate exploratory form of path analysis, analogous to existing exploratory forms of both
multiple regression and factor analysis, is discussed. A logical extension of Sequential Canonical Analysis is proposed
as adequately serving the function of an exploratory path analysis. Empirical data from psychological research is used
to illustrate and qualitatively compare and contrast the results of these three approaches.
INTRODUCTION
It is often the case that the multiple outcomes of an
experimental treatment or manipulation need to be
assessed. Moreover, these dependent variables are often selected to measure the impact of the treatment on
several conceptually distinct outcomes, rather than converge upon a single construct. Thus, multivariate data
reduction methods, such as common factor modeling,
may not be appropriate. On the other hand, because the
intervention may exert a common causal influence on
these outcomes, the dependent variables are likely to be
at least spuriously correlated with each other. In addition, the multiple outcomes may also subsequently
exert various causal influences on each other. Thus,
separate causal analyses for each of these dependent
variables may also not be appropriate.
Fortunately, there exist several statistical procedures for the analysis of multiple correlated dependent
variables. One of these is structural equations modeling
(SEM), or confirmatory path analysis, in which the
hypothesized causal network between outcomes can be
fully specified, estimated, and tested (Bentler 1989).
This method, however, requires the guidance of a
strong causal theory which is often not available. Another multivariate method is simultaneous canonical
analysis (SIMCA), which requires little theory. This
method, however, produces empirically-derived linear
composites of dependent variables, or “canonical variates”, which are often difficult to interpret pragmatically in terms of the concrete outcomes that we are
interested in. The linear composites are statistically
defined, but are dependent upon the exact data set and
so may not be replicated if the next study uses a slightly
different set of variables.
A third alternative is sequential canonical analysis
(SEQCA), which combines some of the advantages of
both (Gorsuch 1991, Gorsuch and Figueredo 1991).
Rather than combining all dependent variables into
uninterpretable linear composites, as a simultaneous
regression does for independent variables, it partitions
their covariance sequentially, as a hierarchical regression does, while maintaining their separate identity.
This method isolates the direct effects of the independent variables, or interventions, sequentially on each of
the dependent variables, or outcomes, controlling for
all indirect effects through the prior dependent variables, or outcomes. Because the only theoretical guidance required is a tentative specification of a meaningful
order among the dependent variables, we propose that
this sequential method can be developed into an exploratory form of path analysis.
The purpose of this paper is to explore the properties of sequential canonical analysis both in relation to
those of confirmatory path analysis and simultaneous
canonical analysis, and as a potential model for exploratory path analysis. To do this, we will use an example
from insect psychobiology which has the advantage of
clearly illustrating these principles with very concrete,
“brass tacks”, numbers.
ASSORTATIVE MATING IN THE
JEWEL WASP
The natural history of the Jewel wasp and the
psychobiological rationale for this study is described in
Figueredo and Sage (2007). Basically, this experiment
was designed to test the effects of juvenile feeding
experience on adult mate choice in the Jewel wasp. To
FIGUEREDO, A. J., AND R. GORSUCH. 2007. ASSORTATIVE MATING IN THE JEWEL WASP: 2.
SEQUENTIAL CANONICAL ANALYSIS AS AN EXPLORATORY FORM OF PATH ANALYSIS.
JOURNAL OF THE ARIZONA-NEVADA ACADEMY OF SCIENCE 39(2):59-64.
ASSORTATIVE MATING IN THE JEWEL WASP 2 ! FIGUEREDO AND GORSUCH
accurately determine the female choice of male wasps as
sires for their offspring, genetically marked wasps were
used. Two recessive mutant genotypes were used, one
for white eyes and one for red eyes, which, when crossed,
produce hybrids with brown eyes. Each female was
given a choice of two males, one of each eye-color
genotype (which were counterbalanced for the particular
prey species, either housefly or blowfly, that they had
been fed on as larvae). Females of independently crossed
eye-color genotype and juvenile feeding experience were
also used. Because these different genotypes might themselves influence mate choice, the experiment was designed to test the relative influence of two factors, 1)
inherited eye-color genotype, and 2) conditioned foraging phenotype, on assortative mating in the Jewel wasp.
We wound up with a slight, though nonsignificant,
negative correlation between the independent variables,
female genotype (FG) and female juvenile host (FJ), due
to postoperative differential subject mortality. Such correlated independent variables are easily modeled by
hierarchical partitioning of variance. To be conservative
in our testing of the effects of female host, we assigned
causal priority to female genotype. Moreover, we did not
hypothesize an interaction between these two factors.
The dependent variables, however, presented problems that could not be handled by unaided multiple
regression. First, female Jewel wasps that feed on housefly hosts are significantly less fertile than those that feed
on blowfly hosts (Figueredo 1987, 1989). This means
that we could expect significantly different total numbers of offspring (TOT), regardless of paternity, from
subjects reared on different juvenile hosts. Second, a
variable number of Jewel wasp offspring remain immature in a dormant condition, called diapause, for up to six
months past the normal fourteen days (and, thus, the end
of our experiment). The total number of such dormant
immature offspring (TIO) is known to vary with certain
environmental factors, such as ambient temperatures and
photoperiods, but might also be affected in some unknown way by the local mating conditions experienced
by the mother. Because these dormant larvae had not yet
developed eyes of either color, it was not possible to
determine their paternity, perhaps producing nonrandom missing data. Third, male Jewel wasps are always
produced asexually by the mother and, therefore, have
no biological father at all. Thus, a female Jewel wasp that
does not mate with either of the males provided might
produce a clutch of all male offspring. The total number
of male offspring (TMO) could not be simply ignored in
this study because it might have indicated female rejection of both males, and perhaps constituted more nonrandom missing data.
Finally, the critical information in this study was
provided by the paternity of the female offspring, but had
to be controlled for all of the above causally prior
offspring outcomes. Thus, the two dependent variables
of principal interest were the total number of female
60
offspring sired by red-eyed, rather than white-eyed, males
(TRFO), and the total number of female offspring sired
by blowfly-reared, rather than housefly-reared, males
(TBFO). In the interests of brevity, these female offspring were, respectively, referred to as “red-sired female
offspring”, “white-sired female offspring”, “blowflysired female offspring”, and “housefly-sired female offspring”. Of course, the specified eye colors and juvenile
hosts referred to are those of the fathers, rather than those
of the daughters, because all genetic hybrids had brown
eyes, and all posttest offspring were reared on the larger
blowfly hosts for convenience. Because paternal genotypes and juvenile hosts were also slightly correlated, by
the postoperative mortality noted above, eye color was
modeled as causally prior to juvenile host in these two
dependent variables, for the same reasons that they were
in the two corresponding independent variables.
ALTERNATIVE METHODS OF ANALYSIS
The question then becomes, precisely how do we
accomplish the requisite feats of multivariate statistical
control? To expedite our discussion, we report the results
of the most uninformative statistical model first, namely,
simultaneous canonical analysis (SIMCA). This analysis
was performed using PROC CANCORR in SAS (SAS
Institute 1989). SIMCA constructed two pairs of canonical variates, V1 and W1, and V2 and W2, to represent our
manifest variables. Pillai’s Trace for the whole model
was .21, F(10,158)= 1.856, p= .055. Furthermore, V1
was correlated to W1 by a canonical correlation of .42,
which was statistically significant (F(10,156)= 1.89, p=
.05); V2, however, was correlated to W2 by a canonical
correlation of .17, which was statistically nonsignificant
(F(4,79)= 0.61, p= .66). Presumably, we could therefore
proceed to interpret V1 and W1, and forget about V2 and
W2. Fortunately, V1 and W1 were defined by the following linear equations, using the standardized canonical
coefficients, as seen in Table 1 above.
Using SIMCA, we were also helpfully provided
with the resulting indirect correlations between W1 and
the independent manifest variables (FG and FJ) and
between V1 and the dependent manifest variables (TOT
through TBFO), as well as all the corresponding estimates for the nonsignificant pair of canonical variates,
V2 and W2, in both their raw and standardized forms.
Unfortunately, in spite of this apparent wealth of information, we still had some trouble understanding what any
of this meant for the results of our experiment. How did
some linear combination of female genotype and female
juvenile host somehow produce some other linear combination of the total numbers of various types of offspring?
A more informative approach was structural equations modeling (SEM), or confirmatory path analysis.
This analysis was done using EQS (Bentler 1989). Be-
ASSORTATIVE MATING IN THE JEWEL WASP 2 ! FIGUEREDO AND GORSUCH
61
Tabl e 1. Linear equat ions f or dependent canonical v ariat es V1 and W1.
V1 =
W1 =
- 1.015*TOT
- 0.082*TIO
+ .046*TMO
+ .919*TRFO
0.864*FG
- 0. 41 3 * F J
- . 1 05 * F G
+.218*FJ
- .117*FG
+.004*FJ
- .054*FG
+.012*FJ
+ .263*FG
- .090*FJ
+ .123*FG
+.004*FJ
+ 0.274*TBFO
Tabl e 2. St andardized pat h coef f icient s f or sat urat ed st ruct ural model.
TOT =
TIO =
.579*TOT
TMO =
.760*TOT
- .022*TIO
TRFO =
.856*TOT
- .255*TIO
- .241*TMO
TBFO =
.753*TOT
- .103*TIO
- .307*TMO
- .083TRFO
Tabl e 3. St andardized pat h coef f icient s f or rest rict ed model wit h non-signif icant pat hways omit t ed.
TOT =
+.230*FJ
TIO =
.595*TOT
TMO =
.756*TOT
TRFO =
.646*TOT
TBFO =
.630*TOT
- .247*TIO
+.281*FG
- .296*TMO
cause we had no strong causal theory to specify the
structural relations between the multiple dependent variables, a fully saturated structural model was run, freely
estimating every causal pathway possible between every
dependent variable, specifying only the presumed causal
priority. This method has also been referred to as the
cascade model in cognitive psychology (e.g., Mouyi
2006, Demetriou et al. 2002). This specification produced a saturated structural model perfectly reproducing
the data, with a residual chi-squared of zero, based on
zero degrees of freedom. The standardized path coefficients are summarized in Table 2 above.
Using the statistical tests reported in the EQS output,
we tested each of these path coefficients against zero. We
then eliminated nonsignificant causal pathways to develop a more restricted model in which all the remaining
pathways were statistically significant. Not surprisingly,
this model was statistically acceptable: the chi-squared
was 11.06 based on 14 degrees of freedom (p = .68). The
standardized path coefficients for this model are described in Table 3.
Such brazenly exploratory use of SEM, however, is
technically inappropriate. The probability of the model
chi-squared reported, for example, no longer represented a prior probability under the null hypothesis, due
to the many empirical model respecifications. Another
disadvantage was that SEM did not provide for performing hierarchical tests of significance, except through
such nested model comparisons, which required running
multiple causal models. Of course, such results could be
obtained by running a series of separate hierarchical
regressions, as specified by the structural equations
written above. This piecemeal method had the disadvantage of requiring a separate regression model for each of
the dependent variables and, thus, still providing no valid
overall protective test of significance. This alternative
procedure was performed first using PROC GLM in SAS
(SAS Institute 1989), and then using UNIMULT (Gorsuch
1991, Gorsuch and Figueredo 1991), and, in spite of the
algorithmic differences in parameter estimation, both
produced nearly identical results to each other. Furthermore, this alternative procedure has the practical advan-
ASSORTATIVE MATING IN THE JEWEL WASP 2 ! FIGUEREDO AND GORSUCH
tage of not requiring any specialized software to perform. Any standard software package (such as SAS or
SPSS) that can estimate hierarchical multiple regressions (using Type I sums of squares) can be used.
In spite of all these limitations, at least SEM provided us with some clue as to what is going on. Controlling for all significant prior dependent variables, it
appeared that there were only two significant causal
effects of our two independent variables. As indicated by
the positive effect of FJ on TOT, blowfly-reared females
produced more total offspring than housefly-reared females, indicating their generally higher fertility, as was
expected. As indicated by the positive effect of FG on
TRFO, red-eyed females produced more red-sired female offspring than white-eyed females, indicating assortative mating by eye color. Contrary to the principal
experimental hypothesis, however, blowfly-reared females apparently produced no more blowfly-sired female offspring than housefly-reared females, indicating
no assortative mating by juvenile host. This was indicated by the lack of a significant causal effect of FJ on
TBFO.
The third multivariate method used on this data was
sequential canonical analysis (SEQCA). This analysis
was performed using UNIMULT (Gorsuch 1991). All
that was required for running this model was the specification of two hierarchies of causal priority, one for the
two independent variables and another for the five dependent variables. The Pillai-Bartlett V for the whole
model was .21, F(10,158)= 1.86, p= .05, which was the
same, within rounding error, as the corresponding SIMCA
results. In addition, we also obtained overall tests of the
proportions of the variance of each of the five dependent
variables accounted for by the linear combinations of the
two independent variables, which are shown in Table 4.
SEQCA also provided the following hierarchical
significance tests for the separate effects of each of the
two independent variables on each of the five dependent
variables and these are shown in Table 5.
Tabl e 4. Mult iple Correlat ions and Associat ed Test s
of Signif icance f or Each Dependent Variable in t he
Mult iv ariat e Model.
Variable
Effect Size
PBV
F(2,158)
p
TOT
R = . 25
0. 0 6
2.68
0.07
TIO
R = .14
0.03
1.31
0. 3
TMO
R = . 08
0.02
0.65
n.s.
TRFO
R = .32
0. 1 5
7.06
< 0.005
TBFO
R = .12
0. 0 2
0.73
n.s.
62
Tabl e 5. Semipart ial Correlat ions and Associat ed
Test s of Signif icance f or Each Independent
Variable in t he Mult iv ariat e Model.
Variable
Effect Size
PBV
F(1,158)
p
Dependent variable: TOT
FG
r =- .13
0.02
1 . 43
0.2
FJ
r = .22
0.05
3.93
0.05
Dependent variable: TIO
FG
r =- .14
0.03
2.62
0. 1
FJ
r = .00
0
0
n.s.
Dependent variable: TMO
FG
r =- .08
0.02
1.25
0.3
FJ
r = .02
0
0. 05
n.s.
Dependent variable: TRFO
FG
r = .31
0. 1 4
13
<.001
FJ
r =- .09
0.01
1.12
0.3
Dependent variable: TBFO
FG
r = .12
0.02
1.44
0. 2
FJ
r = .01
0
0.02
n.s.
Note that the two statistically significant effects of
the independent variables on the dependent variables
were the same as those identified by SEM. Controlling
for all the prior dependent variables, the standardized
regression coefficients obtained by SEQCA were also
very similar to those produced by SEM, but tended to
become somewhat higher as one stepped sequentially
down the causal hierarchy of dependent variables. These
slightly higher parameter estimates were a systematic
property of the current UNIMULT implementation of
SEQCA, not an artifact of slightly different estimation
algorithms. Recall that when the separate multiple regressions for the structural equations model were performed by UNIMULT, nearly identical parameter
estimates were obtained. The SEQCA estimates were
systematically higher because UNIMULT currently uses
what Cohen and Cohen (1983) referred to as the “partial”
rather than the “semipartial” residual correlations used
by SEM. In the current version of UNIMULT, SEQCA
residualized the denominator, or total variance to be
explained (including the error), as well as the numerator,
or portion of variance actually explained, of each of the
sequential dependent variables on all the prior ones. This
affected significance testing, as well as parameter estimation, by increasing the power of the tests. For com-
ASSORTATIVE MATING IN THE JEWEL WASP 2 ! FIGUEREDO AND GORSUCH
63
Tabl e 6. St andardized Regression Coef f icient s f or
E a c h In d e p e n d e n t Va r ia b le in t h e Mu lt iv a r ia t e
Model.
TOT
=
- . 1 05 * F G
+ .217*FJ
TIO
=
- . 1 44* F G
+ .004*FJ
TMO =
- . 08 0* F G
+ .016*FJ
TRFO =
. 3 00* F G
- .092*FJ
TBFO =
. 1 21 * F G
+ .012*FJ
parison with those obtained by SEM, the SEQCA standardized estimates are reported in Table 6.
The UNIMULT implementation of SEQCA does
not explicitly provide estimates of the sequential effects
of the dependent variables upon each other. Doing so
would provide an alternative implementation of SEQCA
that would be fully equivalent to an exploratory path
analysis by explicitly including estimates of sequential
effects between dependent variables. Such a model would
restore the status of hierarchically residualized correlations as the “semipartials” (Cohen and Cohen 1983)
typically estimated in SEM. This new model would have
the added advantages of both hierarchical partitioning of
variance and protective overall tests of significance,
such as the Pillai-Bartlett V statistic, so important to
exploratory data analysis. It would also help legitimize
an alternative path analytic model for empirically-assisted theory development, instead of perpetuating the
widespread abuse of available structural models that are
clearly intended exclusively for theory confirmation. As
it stands, as illustrated by our Jewel wasp example,
SEQCA represents a useful diagnostic tool for isolating
and identifying the direct effects of independent variables on a multiplicity of correlated dependent variables,
if not for estimating the magnitudes of these effects in the
conventional way. In either case, parameter estimation
should never be based on the initially saturated model
used for data exploration because the probable inclusion
of nonsignificant variables substantially reduces the
efficiency of estimation.
Superior estimates can be obtained by running the
various final structural equations, as respecified by protected and hierarchical significance testing, as separate
multiple regressions after the initial exploration of the
data, as was briefly described above.
One potential disadvantage of SEQCA in relation to
the alternative methods is that it requires the specialized
UNIMULT software (Gorsuch 1991) to perform. Armed
with the basic theory behind SEQCA, however, one may
use the more commonly available SAS software (SAS
Institute 1989), or even SPSS, to perform a series of
hierarchical regressions in which multiple dependent
criterion variables are analyzed sequentially according
to a hypothesized causal order. These dependent criterion variables can be entered sequentially into a system
of multiple regression equations with each hierarchically
prior criterion variable entered as the first predictor for
the next, as we did in the SEM cascade model presented
above. Each successive dependent variable can be predicted from an initial set of ordered predictor variables,
each time entering the immediately preceding dependent
variable hierarchically as the first predictor, then entering all the ordered predictors from the previous regression equation. Thus, each successive regression enters
all of the preceding dependent variables in reverse causal
order to statistically control for any indirect effects that
might be transmitted through them. Within this analytical scheme, as with SEQCA, the estimated effect of each
predictor is limited to its direct effect on each of the
successive dependent variables. The general format for
this system of hierarchical multiple regressions is as
shown in Table 7 below.
Tabl e 7. G eneral f orm at f or m ult iple dependent
crit erion v ariables analyzed sequent ially according
t o a hypot hesized causal order.
β 1X1
+β 2X2
+β 3X3
β 4Y4
+β 1X1
+β 2X2
+β 3X3
+β 4Y4
+β 1X1
+β 2X2
+β 3X3
Y4=
Y5=
Y6=
β 5Y5
Where X1, X2 and X3 are the ordered predictor
variables and Y4, Y5, and Y6 are the ordered criterion
variables, numbered consecutively after the predictors to
avoid confusion among the subscripts. What is lost by
using this method, as opposed to SEQCA, is the protective overall test of significance. What is gained by this
method, as opposed to SEQCA, is obtaining estimates of
the sequential effects among dependent variables. Thus,
this is superior to merely including all the prior dependent variables as “covariates”. What is gained as opposed SEM is the ability to perform hierarchical
partitioning of variance and hypothesis testing, as in
SEQCA. In addition, one avoids compromising the purely
confirmatory nature of the SEM tests of whole-model
goodness-of-fit.
SUBSTANTIVE AND METHODOLOGICAL
CONCLUSIONS
To finish our story, our study revealed no evidence
of assortative mating by juvenile feeding experience in
the Jewel wasp. It is therefore unlikely that differential
juvenile feeding experience can lead to sympatric speciation in this particular species by either of the mecha-
ASSORTATIVE MATING IN THE JEWEL WASP 2 ! FIGUEREDO AND GORSUCH
nisms that have been proposed. The requisite reproductive isolation can apparently not be produced by differential juvenile feeding experience either indirectly by
permanently differentiated foraging behavior or directly
by differentially imprinted sexual preferences. Instead,
assortative mating was shown to be greatly influenced by
genetic markers, such as eye color. Converging evidence
for these conclusions was variously obtained by hierarchical multiple regression, conventional path analysis,
and sequential canonical analysis. Finally, the methodological implications of these findings for research are as
follows. It was shown how the separate direct effects of
the experimental manipulations on a set of seemingly
hopelessly interdependent outcomes could be readily
discriminated by sequential canonical analysis, producing results very similar to those obtainable by confirmatory path analysis without requiring the stronger
theoretical assumptions of that model. This was done
directly on the variables of practical interest, without
altering the basic nature of the research question by
constructing either inappropriate common factors or
uninterpretable canonical variates. The statistical results
were readily interpretable and directly relevant to the
experimental hypotheses that motivated the study.
LITERATURE CITED
BENTLER, P. M. 1989. EQS, Structural Equations, Program
Manual, Program Version 3.0. BMDP Statistical Software, Inc., Los Angeles, CA.
COHEN, J., AND P. COHEN. 1983. Applied multiple regression/
correlation analysis for the behavioral sciences. Lawrence
Erlbaum, Hillsdale, NJ.
DEMETRIOU, A., C. CHRISTOU, G. SPANOUDIS, AND M. PLATSIDOU.
2002. The development of mental processing: Efficiency,
working memory, and thinking. Monographs of the Society of Research in Child Development 67(1, Serial No.
268):1-154.
FIGUEREDO, A. J. 1987. The statistical measurement, developmental mechanisms, and adaptive ecological functions of
conditioned host selection in the parasitoid Jewel wasp.
Dissertation, University of California, Riverside, CA.
FIGUEREDO, A. J. 1989. Host-cue information processing by
foraging Jewel wasps: Response-bias tracking of expected host-species encounter rates, not modified
profitabilities. Psychobiology 17:435-444.
FIGUEREDO, A. J., AND R. S. S. SAGE. 2007. Assortative mating
in the Jewel wasp: 1. Female matching of eye-color
genotype, not host-feeding phenotype. Journal of the
Arizona-Nevada Academy of Science, 39:51-57.
GORSUCH, R. L. 1991. UniMult: For univariate and multivariate data analysis. UniMult, Altadena, CA.
GORSUCH, R. L., AND A. J. FIGUEREDO. 1991. Sequential canonical analysis as an exploratory form of path analysis. Paper.
American Evaluation Association Conference, Chicago,
Illinois.
MOUYI, A. 2006. Untangling the cognitive processes web. Paper.
Seventh Annual Conference of the International Society for
Intelligence Research, San Francisco, California.
64
SAS INSTITUTE, INC. 1989. SAS Language and Procedures:
Usage, Version 6, First Edition. SAS Institute, Cary, NC.
Differential Parental Investment! Davis Et Al.
65
Differential Parental Investment in
the Southwestern United States
MELINDA F. DAVIS, CORDELIA B. GUGGENHEIM, AURELIO JOSÉ FIGUEREDO, & CATHERINE J. LOCKE*,
University of Arizona, PO Box 245073 Tucson, Arizona 85724-5073
*Deceased
ABSTRACT
The Trivers-Willard model (1973) predicts differential parental investment in children by sex and income;
wealthier families will invest more in boys, while poorer families will invest more in girls. We investigated the TW
Hypothesis in a sample of 103 six month old Tucson babies and their mothers. Hierarchical multiple regression
equations were used entering baby’s age, baby’s sex, mother’s age, male paternal commitment, a dichotomous
poverty measure, per capita income, and four interaction terms, baby’s sex by 1) mother’s age 2) mother’s education,
3) male paternal commitment, 4) poverty, and 5) per capita income. We included three dependent variables in
successive regression equations; mother’s attitudes towards ideal baby size for boy versus girl babies, weeks breast
fed and the baby’s weight at six months. These variables measure attitude, behavior, and physical outcomes. Poverty
was a significant predictor of differential preference in ideal body size for boys versus girls; poor mothers preferred
bigger baby girls. There was no evidence of differential preference in breast feeding. Education was also a significant
predictor; but in the opposite direction than predicted by the TWH. Mothers with higher levels of education had
heavier baby girls. Within Hispanics only, poverty was a significant predictor of sex-biased weight; poor mothers
had heavier baby girls. This effect was not seen in Caucasians. These results provide mixed evidence for the TriversWillard model in a resource rich environment for humans.
INTRODUCTION
'In species with a long period of parental investment after birth of young, one might expect biases in
parental behavior toward offspring of different sex,
according to parental condition; parents in better condition would be expected to show a bias toward male
offspring.' (Trivers and Willard, 1973)
The Trivers-Willard Hypothesis (TWH) proposes
sex-biased resource allocation by parents in order to
maximize the reproductive success of their offspring.
The hypothesis is salient in species where variations in
parental condition (either prenatally or after birth) contribute to the differential reproductive success by their
offspring. When reproductive variance is higher for
males than for females (an intrinsic function of polygyny), mothers in optimal condition (defined here by
plentiful resources) will be more likely to invest in male
offspring, while mothers in poor condition (defined by
scarce resources), will be more likely to invest in female
offspring. The TWH suggests that parents will differentially invest in the sex with the greatest chance for
reproductive success.
Since 1973, the Trivers-Willard Hypothesis (TWH)
has been the focus of hundreds of plant and animal
studies that have examined both the sex ratio at birth and
differential parental investment throughout the life cycle.
The results of these studies have been mixed, as have
been the result of several meta-analyses.
Brown and Silk (2002) examined sex ratios in nonhuman primates, and did not find support for the TW
Hypothesis. “Our analyses indicate that the data are
distributed much as we would expect by chance and that
maternal rank is not associated consistently with biased
birth sex ratios in this data set on nonhuman primates.”
In a subsequent meta-analysis, Schino (2004) replicated Brown and Silk’s overall results, but also found
support for the TW Hypothesis in conditions with high
resource availability and low sexual dimorphism.
Cameron (2004) performed a meta-analysis of 381
studies examining sex ratios in mammals (excluding
humans); only a third provided support for the TW
Hypothesis. However, there was considerable heterogeneity in the dataset. When the data were examined
within indicator of maternal condition and timing, there
was a consistent effect of maternal body condition
around the time of conception and the TW Hypothesis
was supported nearly 90% of the time. Sheldon and
West (2004) found the same results in a meta-analysis
of ungulate mammals.
Within humans, there is evidence for sex-biased
infanticide and differential mortality. However, the
evidence for differential parental investment is not
uniform; with half of the studies showing evidence
consistent with the TW hypothesis and the other half
rejecting it. This ambiguity may be due to the heterogeneity of indicators used in human studies, or to the
absence of a TW effect.
Unfortunately, the number of studies testing the
TW Hypothesis in humans remains limited, particularly
for resource rich environments, and existing work in the
United States has provided conflicting results. In a
DAVIS, M. F., C. B. GUGGENHEIM, A. J. FIGUEREDO, AND C. J. LOCKE. 2007. DIFFERENTIAL PARENTAL
INVESTMENT IN THE SOUTHWESTERN UNITED STATES. JOURNAL OF THE ARIZONA-NEVADA ACADEMY OF
SCIENCE 39(2)65-72.
Differential Parental Investment ! Davis Et Al.
heterogeneous set of studies using national samples in
the United States, Gaulin and Robbins (1991), Kanazawa
(2005), and Hopcroft (2005) all found evidence for the
TW Hypothesiss, while others (Freese and Powell 1999,
Keller et al. 2001) found no indication of sex-biased
investment.
It is our intention to investigate the Trivers-Willard
Hypothesis in a resource-rich sample in the Southwestern United States. Our measures of parental investment
are infant weight at six months, weeks breastfed, and
discrepancies in the mother’s preferred infant size for
boys and girls at six months. These outcomes are on a
continuum from attitude, behavior, to physical outcomes
and each is intended to measure the underlying construct
of parental investment. Our approach follows that of
Donald Campbell’s work on acquired behavioral dispositions. Rather than viewing behavior as different from
attitude, Campbell viewed them as a continuum, noting
that “. . there has been a stubborn confusion of the fact
that verbal behaviors and overt behaviors have different
situational thresholds with the fact of consistency. . .
From the dispositional perspective, the supposed absence of relationship between attitudes and behavior
disappears” (1963). In this framework, an attitude can be
considered a behavior that is not yet observable because
it has not passed the threshold into action. Further along
the continuum are rare and infrequent behaviors, those
that occur with fair regularity, and finally behaviors that
are ingrained habits.
Discrepancies in perception of the best baby size for
baby boys and girls is an indicator of the mother’s
attitude that she may not be consciously aware of. It is
difficult, if not impossible to get candid answers to the
question “Do you think that girl babies should be smaller
than boy babies?” A common measure of attitudes
regarding body size are the figural drawings pioneered
by Stunkard et al. (1983). In this approach, respondents
rate silhouettes of body size that correspond to standard
height and weight percentiles. Discrepancy scores between ideal and current body size (current – desired) are
used as measures of body dissatisfaction. In this study,
discrepancy scores are used as a covert measure of
maternal investment.
In a summary of breast feeding studies in humans,
Quinlan et al. (2005) found inconsistent results across
countries irrespective of resource level. Likewise,
Margulis et al. (1993) reported no support for the TW
hypothesis in a sample of Hutterites, while Gaulin and
Robbins (1991) found evidence for the TW Hypothesis
in length of breastfeeding in the United States.
We tested the TW Hypothesis in a sample of six
month old babies, using several indicators of parental
investment; maternal preference for baby size, months
breastfed, and infant weight at 6 months. Differential
preference for baby size is a measure of attitude; breast
feeding is a measure of behavior, and infant weight is a
physical outcome. Measures of parental condition in-
66
clude maternal age, education, male parental commitment, poverty, and per capita household income. Poverty and per capita household income are direct measures
of resources, as is male parental commitment (measured
by presence of a spouse or boyfriend in the household).
The age of the mother and her education are proxies for
rank and increased resources. We hypothesized that 1)
mothers with greater resources will prefer larger baby
boys than girls and mothers with few resources will
prefer larger baby girls. 2) Mothers with greater resources will breastfeed boys longer, and those with few
resources will breastfeed girls longer than boys. 3)
Mothers with greater resources will have heavier baby
boys, and mothers with few resources will have heavier
baby girls.
METHODS
Subjects
The women were drawn from a number of pediatric
clinics throughout a metropolitan area in Arizona, in
order to include women from a wide range of backgrounds. Eligible subjects were mothers of infants
between five and eight months old who were attending a
pediatric clinic for their child’s six-month well baby visit
and spoke English. Exclusion critera included attendance at the clinic for a sick child visit, and children
brought to clinic by someone other than their mother.
Mother-infant pairs were approached in pediatric waiting rooms, the sample was stratified by income and
ethnicity, and 110 women completed the interview. The
child’s weight was recorded from the medical records
after the visit. Weight was not collected for six of the
babies, and these mother-child pairs were not included in
the analyses. Two mother-child pairs were not included
in the analyses; one child was adopted, and the other
child had a birth defect that resulted in a very low weight.
Approximately half of the babies were male, and their
ages ranged from 23 to 37 weeks, and their weight ranged
from 4554 to 10700 grams. The mothers’ ages ranged
from 17 to 39 years (Table 1).
Procedure
Potential subjects were identified by the clinic staff
and were approached by the research staff member and
invited to participate in the study. The purpose of the
study was explained, and written consent obtained from
women who agreed to participate in the study. The
mothers were interviewed while at the pediatric clinic for
their baby’s 6 month well baby exam. The interview
elicited the mothers’ opinions on body size for themselves and their child, current feeding practices, and
demographic questions. The interviews took approximately twenty minutes to complete. In order to compensate the mother for her time and effort, each mother was
given a toy for her baby. Actual weight was recorded
Differential Parental Investment! Davis Et Al.
67
Tabl e 1. Demographics
Mother’s age
Mother’s education
Household size
Total income
Baby’s age
Baby’s weight
Discrepancy score
Weeks breastfed
Poverty
Mother’s ethnicity
Baby’s sex
Male paternal
commitment
Mean (SD)
27.2 (6.0)
13.6 (2.6)
4.3 (1.5)
$28,214 (16,037)
28.5 weeks (4.0)
7966 gr. (1079)
0.0 (0.70)
11.72 (10.17)
N (%)
< $15,000
27 (26.2)
= $15,000
76 (73.8)
Anglo
61 (59.2)
Hispanic
42 (40.8)
Male
52 (50.5)
Female
51 (49.5)
Does not live with
23 (22.3)
partner
Lives with boyfriend
11 (10.7)
Lives with spouse
69 (67.0)
from the child’s medical record. The study was approved
by the Human Subjects Com-mittee of the University of
Arizona.
Body Image
The measures for infant body size followed the figural
approach developed by Stunkard, Sørensen and Schulsinger
(1983). Figure rating scales have been widely used in body
image research to measure attitudes and perceptions, and
discrepancy scores between ideal and current body size
(current – desired) have often been employed as measures
of body dissatisfaction (Sørensen et al. 1983, Fallon and
Rozin 1985, Beebe et al. 1999, McArthur et al. 2005).
Seven line drawings for infants were developed based
on photographs of actual infants at different points on the
weight-for-length chart. The babies were between six and
eight months old ± 2 weeks. The middle drawing (#4)
represents infants at the 50th percentile on this chart, e.g.
their weight is average for all infants of a similar length.
Drawings #3 and #5 represent the 25th and 75th percentiles,
drawings #2 and #6 the 10th and 90th percentiles, and
drawings #1 and #7 correspond to <10th and > 90th percentiles (Figure 1). The figural approach has demonstrated
reliability and validity (Sørensen et al. 1983, Stunkard
2000, Bulik et al. 2001).
In this study, mothers were shown drawings of infants
and were told that these were six month olds of the same sex
as the mother’s own child. The mothers were asked to
indicate the ideal body size, as well as the range of acceptable body sizes. Mothers were then asked to imagine that
the drawings represented six month old infants of the
opposite sex, and were asked to rate the drawings on the
same criteria. To prevent any systematic bias resulting from
the order presentation, the infant drawings were presented
in a prespecified random order.
Discrepancy scores
The difference between current body size and desired
body size has been frequently used as a measure of body
dissatisfaction. Our measure of sex-biased parental investment was the mother’s discrepancy between ideal body size
for males minus the ideal body size for females. Overall, the
mothers endorsed the silhouette 4 (the 50th percentile) for
baby boys, and silhouette 4 for girls, with an average
discrepancy score of 0.0. However, nearly half of the
women chose either a heavier boy picture or a heavier girl
picture.
Parental condition
Measures of parental resources included the
mother’s age, her years of education, male parental
Figural Drawings for six month old babies
Figure 1. Figural stimuli for 6 month old babies. The numbers 1-7 represent the following points on
the weigh- for-length charts: 1 < 10%, 2 = 10%, 3 = 25%, 4 = 40%, 5 = 75%, 6 = 90%, 7 > 90%.
Differential Parental Investment ! Davis Et Al.
commitment (scored 1 for no partner, 2 for boyfriend,
and 3 for spouse), poverty (<$15,000 and > $15,000) and
per capita household income. The mother’s ethnicity
(Hispanic or Caucasian) and the baby’s sex were also
collected during the interview. Multivariate imputation
was used for the few cases with missing predictors.
Parental investment
Measures included maternal preference in baby
body size (discrepancy scores), the number of weeks the
mother breastfed her baby (adjusted to 24 weeks for
babies who were still being breastfed), and the baby’s
weight as recorded by the medical staff.
68
statistically for any indirect effects of the predictors
through the causally prior dependent variables.
In this analysis, we followed a theoretically
pre-specified order for the indicators of parental
invest-ment: Maternal preference for baby body
size was assumed to precede the length of breast
feeding, given the causal priority of attitude over
behavior. In turn, weeks breastfed was assumed to
be causally prior to the baby’s actual weight at six
months. The causal order for the dependent variables is illustrated in Figure 2.
Hypotheses
There are three specific hypotheses for this study. 1)
Mothers with greater resources will prefer larger baby
boys than girls; mothers with fewer resources will prefer
larger baby girls. Greater resources include increased
maternal age, education, male parental commitment, and
income. 2) Mothers with greater resources will breastfeed
boys longer than girls; mothers with fewer resources will
breastfeed girls longer than boys. 3) Mothers with
greater resources will have heavier baby boys than girls;
mothers with few resources will have heavier baby girls.
Data Analysis
A series of sequential hierarchical regressions was
executed in which the multiple dependent variables were
analyzed according to a prespecified causal order. Because the dependent criterion variables were expected to
influence each other, they were entered sequentially in a
causal order in a series of multiple regression equations
with each hierarchically prior criterion variable entered
as the first predictor for the next. Each successive
dependent variable was therefore predicted from the
initial set of ordered predictor variables, and the immediately preceding dependent variable was entered hierarchically as the first predictor, followed by all of the
ordered predictors from the previous regression equation. Each successive regression therefore entered all of
the preceding dependent variables in reverse causal
order to statistically control for the indirect effects that
might be transmitted through them. Within this analytical approach, the estimated effect of each predictor was
therefore limited to its direct effect on each of the
successive dependent variables. The general format for
these hierarchical multiple regressions is as follows:
Y1 =
Y2 =
Y3 =
Y1 +
Y2 + Y1 +
X1 +
X1 +
X1 +
X2 +
X2 +
X2 +
X3
X3
X3
This procedure is conceptually equivalent to a
sequential canonical analysis (Gorsuch and Figueredo
1991, Figueredo and Gorsuch, 2007), which controls
Figure 2. Theoretical and empirical model for
parental investment.
The independent variables and their order of entry for the dependent variables were 1) mother’s age,
2) mother’s education, 3) male parental commitment,
4) poverty (above or below $15,000), and 5) per capita
household income. Each of these is an indicator of
maternal condition, or socioeconomic resources.
Ethnicity was entered next to examine possible differences in feeding practices in Hispanics versus Caucasians. The baby’s sex and interaction terms with sex
were entered last.
The interaction terms test the TW hypothesis for
weight and for breastfeeding. A significant sex by
condition interaction in the appropriate direction is
regarded as support for the hypothesis; “wealthy”
mothers will invest more in sons and poor mothers
will invest more in daughters. For preference, the
dependent variable is already a measure of differential parental investment, and the main effects test
the TW hypothesis.
69
Differential Parental Investment! Davis Et Al.
RESULTS
Tabl e 3. Hypot hesis Test s f or Predict ors of Breast
f eeding.
The relationships between the three measures of
parental investment and the five measures of maternal
condition were examined for colinearity. Correlations
between the maternal condition measures ranged from
.26 to .59. Correlations between the measures of parental
investment ranged from -.13 to .15 and were not significant.
Preference
50th
On average, the mothers chose picture #4 (the
percentile) for the ideal baby boy and #4 for the ideal
baby girl; resulting in an average discrepancy score of
zero and a standard deviation of .7. While the average
was zero, there was considerable variation; nearly half of
the mothers chose either a bigger baby boy or baby girl
picture. Table 3 shows the results of a hierarchical
regression examining predictors of differential preference (best baby boy size minus best baby girl size).
Because preference is already a difference score, the
main effects test the TW Hypothesis.
Predictor
Preference
Mother’s age
F(1,102)
2.10
p
0.151
19.07 <0.001
Mother’s education
4.91
0.029
Male parental commitment
1.23
0.271
Poverty
0.99
0.332
Per capita household income
0.44
0.510
Hispanic ethnicity
1.19
0.278
Baby’s sex
0.12
0.725
Ethnicity * poverty
5 . 04
0.027
Sex * preference
0.15
0.698
Sex * mother’s age
0.02
0.881
Sex * mother’s education
7.16
0.009
Interactions
T a b l e 2 . H y p o t h e s i s T e s t s f o r P re d i c t o r s o f
P ref erence.
Sex * male parental commitment
0.17
0.682
Sex * poverty
0.01
0.944
Predictor
Sex * per capita household income
0.01
0.938
F(1,102)
p
Mother’s age
0.92
0. 3 3 9
Sex * ethnicity
0.07
0.794
Mother’s education
0. 8 0
0.372
Sex * ethnicity * poverty
4.62
0.034
Male parental commitment
1.00
0.321
Pove rty
9.68
0.003
Per capita household income
0.08
0.773
Hispanic ethnicity
0.21
0.645
Baby’s sex
4.38
0.039
Of the five indicators of parental condition, one was
significant and is bolded. Mothers whose incomes were
below the poverty level preferred larger baby girls, with
a standardized beta weight (β) of 0.28. This provides
support for the TW Hypothesis. The last two main
effects do not test the TW Hypothesis and are included
for consistency because they are used in subsequent
analyses. There was no difference in body size by
ethnicity. Mothers of girls preferred larger baby
girls, and mothers of boys preferred larger baby boys
(β= -0.21).
Breast feeding
Two indicators of maternal condition were associated with longer breast feeding (Table 3). Older mothers
breastfed their babies longer (β=0.32), and mothers with
more education breastfed longer (β= 0.21). For each
additional year of mother’s age, breastfeeding increased
by .5 of a week; each additional year of education
increased breastfeeding by .8 of a week. These differences provide evidence for the validity of education and
age as indicators of maternal condition. There were no
interactions by sex, and therefore no support for the TW
Hypothesis for this outcome.
Weight
The hypothesis tests for predictors of babies’ weight
are in Table 4. Only one main effect was significant, baby
boys were heavier than baby girls (β= 0.57). This term
adjusts for normal sex differences in weight, and the first
two interaction terms; sex * breastfeeding and sex *
preference are included to control for the effect of previous dependent variables. Mothers of boys who preferred
larger boys had larger boys, similarly, mothers of girls
who pre-ferred larger girls had larger girls (β= 0.16). This
effect may provide evidence for the link between weight
preference and the actual weight of the baby. Or, mothers
may simply prefer the size of the child that they have.
Differential Parental Investment ! Davis Et Al.
The remainder of the interaction terms with sex test
the TW Hypothesis for weight; only one test was significant; mothers with more education had heavier baby
girls (β= -0.40). For each additional year of the mother’s
education, the baby girls were 61 grams heavier. This
effect does not provide support for the TW Hypothesis
because it was in the opposite direction.
Inspection of the data indicated a more complex
relationship between income and weight that appear-ed
to be associated with ethnicity. We therefore tested the
interaction between ethnicity, poverty, and child’s sex.
Hispanic mothers in poverty had larger baby girls than
Hispanic mothers with greater resources; in fact, their
daughters were larger than baby boys born to Hispanic
mothers in poverty (β= 0.11). This was not seen in
Caucasian mothers.
Power analysis
With a sample size of 103 and 16 predictors in a
multiple regression analysis, this study had the power
to detect a small effect size (R2 =.02) 10% of the time,
and a medium effect size (R2=.15) 60% of the time. A
standardized effect size of .22 would be detected
approximately 80% of the time.
DISCUSSION
We examined the TW Hypothesis in a resource rich
metropolitan environment in the southwestern United
States using three indicators of parental investment;
maternal attitude, behavior, and baby’s weight. This
study was specifically designed to elicit information
regarding feeding practices in mothers of young children. In these analyses, we included five measures of
parental condition; mother’s age, education, male parental commitment, a dichotomous measure of poverty, and
per capita household income. Poverty appeared to be the
only maternal condition that was associated with a TW
effect in this resource rich environment, and the effect
appeared to be more pronounced in Hispanic mothers.
Mothers showed a differential preference for baby
body size by poverty and sex. This effect was not
observed for income, but only when the dichotomous
variable for poverty (<15,000) was used. Poorer mothers
preferred heavier girl babies, and mothers with more
resources preferred heavier boy babies. This is consistent with the TW Hypothesis. For breast feeding, there
was no evidence of differential parental investment.
This is consistent with Gaulin and Robbins (1999) who
found no effect for breast feeding in a United States
sample. For weight, mothers with more education had
heavier girl babies, which was an effect that was in the
opposite direction.
Interestingly, we observed a differential patternby
ethnicity; a TW effect was observed only in poor Hispanic mothers and not in poor Caucasian mothers. This
result is puzzling. Why would a TW effect be active in
70
Hispanics, but not in Caucasians? We speculate that
the difference could be due to the study protocol. The
sample was stratified by income and considerable effort was taken to ensure the same percentages of Hispanics and Anglos were included in each income bracket.
This resulted in an oversampling of affluent Hispanics
in the study, when compared to the US Census (2005).
The use of discrepancy scores as a subtle measure
of body satisfaction is a common use of figural drawings, and rating others with these drawings is also fairly
typical. This is an example of a disguised method
(Campbell 1950, Davis 2001), and is useful in a setting
where the individual either may not be consciously
aware of their belief, or may not be willing to share it.
There was no effect of parental condition on
breastfeeding in this study, and the results are variable
in other countries (Quinlan et al. 2005). Breast feeding
information is problematic as a measure of parental
investment, and variations in duration may be attributable to cultural, socioeconomic, or reporting error. An
additional issue concerns a mother’s inability to provide breast milk.
Were the measures we chose for parental condition
valid in this sample and setting? In order to evaluate the
effectiveness of the measures of parental condition we
examined all main effects and interactions. Associations with parental investment, either for all children
(main effect), or for some children (an interaction effect
by sex) provide support for the value of each measure
of condition. The study showed significant main effects for mothers’ education and age, and significant
interaction effects for poverty, providing evidence for
the validity of three indicators of parental condition in
a resource rich setting. However, only poverty was
associated in a Trivers Willard effect.
Results from existing investigations of the TW
Hypothesis are highly variable, and a meta-analytic
approach will undoubtedly be required to tease out the
effects of parental condition in humans. The benefits of
meta-analysis in this area of research have been demonstrated in ungulates (Sheldon and West 2004), nonhuman primates (Shino 2004) and mammals (Cameron
2004). In each case, subtle effects were found based on
either type of maternal condition or timing. To our
knowledge, a meta-analysis in humans has not been
attempted, and we await the results of such an endeavor
with great interest.
Contrary to the TW Hypothesis, we found that the
weights of baby girls at six months increased in tandem
with the education of their mothers. Guggenheim
(2004) also found a number of regional effects that
were not consistent with the TW Hypothesis. Overall,
this study provides ambiguous results for the TWH. It
may well be that timing effects for the TWH in humans
are consistent with other mammals, and any effect of
the Trivers Willard Hypothesis takes place well before
birth.
71
Other approaches, such as Life History Theory or
Local Resource Competition may offer additional explanatory power. We recommend the use of multiple
working hypotheses (Chamberlin 1890) in future studies.
There were several limitations to this study. The
mothers were approached during their child’s six month
well baby visit and the sample was therefore restricted to
mothers who were already investing resources in their
children; in essence, the data were collected at the
watering hole. The study was powered to detect a
standardized effect size of .22 approximately 80% of the
time. Such a sample size would detect some, but not all
of the hypothesized differences.
These analyses provide a novel test of the TriversWillard Hypothesis in a resource-rich environment. We
examined the effect of the TW Hypothesis on weight,
breastfeeding, and differential preference; and found
mixed support for the hypothesis in humans. Only
poverty was associated with sex-biased maternal investment in both the mother’s preference in baby size and for
actual weight.
ACKNOWLEDGEMENTS
This research was supported by the US Department of Health and Human Services, Administration on
Developmental Disabilities - 90DD0315. Our appreciation to Anne Wright who reviewed an earlier version
of this manuscript, Barbara Altimari for project management, and Britt Robertson for parent interviews.
LITERATURE CITED
BEEBE, D. W, G. N. HOLMBECK, AND C. GRZESKIEWICZ. 1999.
Normative and Psychometric Data on the Body Image
Assessment-Revised. Journal of Personality Assessment
73:374-394.
BROWN, G. R., AND J. B. SILK. 2002. Reconsidering the null
hypothesis: Is maternal rank associated with birth sex
ratios in primate groups? Proceedings of the National
Academy of Sciences 99:11252–11255.
BULIK, C. M., T. D. WADE, A. C. HEATH, N. G. MARTIN, A. J.
STUNKARD, AND L. J. EAVES. 2001. Relating body mass
index to figural stimuli: Population-based normative data
for Caucasians. International Journal of Obesity & Related Metabolic Disorders: Journal of the International
Association for the Study of Obesity 25:1517-1524.
CAMERON, E. Z. 2004. Facultative adjustment of mammalian
sex ratios in support of the Trivers-Willard hypothesis:
Evidence for a mechanism. Proceedings of the Royal
Society of London, Ser. B 271:1723–1728.
CAMPBELL, D. T. 1950. The indirect assessment of social
attitudes. Psychological Bulletin 47:15-38.
CAMPBELL, D. T. 1963. Social attitudes and other acquired
behavioral dispositions. Pp. 94-172 in S. Koch, ed., Psychology: A study of a science, Volume 6: Investigations of
Man as Socius. McGraw-Hill, New York
CHAMBERLIN, T. C. 1890. The method of multiple working
hypotheses. Science (old series) 15:92.
DAVIS, M. F. 2001. Method Variance in the Social Sciences.
Dissertation, University of Arizona, Tucson.
Differential Parental Investment! Davis Et Al.
FALLON, A. E., AND P. ROZIN. 1985. Sex differences in perceptions of desirable body shape. Journal of Abnormal Psychology. 94:102-105.
FIGUEREDO, A. J. AND R. GORSUCH. 2007. Assortative mating in
the jewel wasp: 2. Sequential canonical analysis as an
exploratory form of path analysis. Journal of the Arizona
Nevada Academy of Sciences. 39(2):59-64.
FREESE, J., AND B. POWELL. 1999. Sociobiology, status, and
parental investment in sons and daughters: Testing the
Trivers-Willard Hypothesis. American Journal of Sociology 106:1704-1743.
GAULIN, S. J. C., AND C. J. ROBBINS. 1991. Trivers-Willard effect
in contemporary North American society. American
Journal of Physical Anthropology 85:61-69.
GORSUCH, R. L., AND A. J. FIGUEREDO. (1991). Sequential
canonical analysis as an exploratory form of path analysis.
Paper presented to the American Evaluation Association
Conference, Chicago, IL.
GUGGENHEIM, C. B. 2004. Lost daughters and fragile sons: Patterns of differential parental investment across thirty-five
countries. Dissertation. University of Arizona, Tucson.
HOPCROFT, R. L. 2005. Parental Status and Differential Investment in Sons and Daughters: Trivers-Willard Revisited.
Social Forces 83:1111-1136.
KANAZAWA, S. 2005. Big and tall parents have more sons:
Further generalizations from the Trivers-Willard Hypothesis. Journal of Theoretical Biology 235:583-590.
KELLER, M.C., R. M. NESSE, AND S. HOFFERTH. 2001. The
Trivers–Willard hypothesis of parental investment: No
effect in the contemporary United States. Evolution and
Human Behavior 22:343-366.
MARGULIS, S. W., J. ALTMANN, AND C. OBER. 1993. Sex-biased
lactation duration in a human population and its reproductive
costs. Behavioral Ecology and Sociobiology 32:41-45.
MCARTHUR, L. H., D. HOLBERT, AND M. PEÑA. 2005. An exploration of the attitudinal and perceptual dimensions of body
image among male and female adolescents from six Latin
American cities. Adolescence 40:801-816.
QUINLAN, R. J., M. B. QUINLAN, AND M. B. FLINN. 2005. Local
resource enhancement and sex-biased breastfeeding in a
Caribbean community. Current Anthropology 46:471-480.
SCHINO, G. 2004. Birth sex ratio and social rank: Consistency
and variability within and between primate groups. Behavioral Ecology 15:850-856.
SHELDON, B. C., AND S. A. WEST. 2004. Maternal dominance,
maternal condition, and offspring sex ratio in ungulate
mammals. American Naturalist 163:40-54.
SØRENSEN, T., A. STUNKARD, T. TEASDALE, AND M. HIGGINS.
1983. The accuracy of the reports of weight: Children’s
recall of their parents’ weights 15 y earlier. International
Journal of Obesity 7:155-122.
STUNKARD, A. 2000. Old and new scales for the assessment of
body image. Perceptual and Motor Skills 90:(3 Pt 1): 930.
STUNKARD, A., T. SØRENSEN, AND F. SCHULSINGER. 1983. Use of
the adoption registry for the study of obesity and thinness.
Pp. 115-120 in S. Kety, L. Roland, R. Sidman, and S.
Matthysse, eds., The Genetics and Neurological and
Psychiatric Disorders. New York, Raven Press.
TRIVERS, R. L., AND D. E. WILLARD. 1973. Natural selection of
parental ability to vary the sex ratio of offspring. Science,
179:90-92.
U.S. CENSUS BUREAU, HOUSING AND HOUSEHOLD ECONOMIC STATISTICS DIVISION. 2005. Historical Income Inequality Tables.
http://www.census.gov/ hhes/www/income/histinc/
ineqtoc.html Accessed 18 Dec 2006.
Differential Parental Investment ! Davis Et Al.
72
SONS OR DAUGHTERS ! GUGGENHEIM ET AL.
73
SONS OR DAUGHTERS: A CROSS-CULTURAL STUDY OF SEX
RATIO BIASING AND DIFFERENTIAL PARENTAL INVESTMENT
CORDELIA B. GUGGENHEIM, MELINDA F. DAVIS, AND AURELIO JOSÉ FIGUEREDO, UNIVERSITY OF ARIZONA,
TUCSON, AZ 85721
ABSTRACT
Survivorship of children is dependent upon numerous variables, including the role that preferential treatment may
play in biasing the birth and survival of sons and daughters across cultures. This study draws upon an evolutionary
approach by examining a theory referred to as the “Trivers-Willard hypothesis” concerning condition-dependent sex
allocation and differential parental investment. Previous research on humans concerning this hypothesis tends to be
restricted to one cultural group and thereby limited in sample size. For this study, nationally representative household
survey data collected by the Demographic and Health Surveys (DHS+) program across 35 countries was used to test
biological, resource-oriented, and behavioral aspects affecting maternal condition, sex allocation, and parental
investment in humans. The units of analysis for this study were the mothers and their lastborn child (N = 128,039
woman-child pairs). A series of hierarchical regressions were executed to empirically investigate the TW hypothesis
in humans. Scales were developed for maternal socioeconomic resources (MSR), maternal biological condition
(MBC), prenatal care for the lastborn child (PCL), and health-seeking for the lastborn child (HSL). MSR was measured
by relative household economic status, woman’s and partner’s education, and residence in an urban/rural setting. MBC
was defined by body mass index, pregnancy status and duration, and breast-feeding status. PCL was an index for type
of prenatal care received, number of prenatal visits, and assistance during delivery of the lastborn child. Lastly, HSL
measured indicators of treatment for diarrhea and immunizations received by the lastborn child. Across the 35
countries, the analyses did not support the Trivers-Willard hypothesis. However, there is evidence of regional and
country level differences.
INTRODUCTION
“Once you’ve learned to think of a herring gull as
an equal, the rest is easy.” – William Drury (Trivers
2002, p. 57)
While there are academic endeavors that can wrap
themselves neatly around a circle of concepts, sex ratio
research is certainly not one of them. Explorations of
what drives any sexual species to produce one of merely
two potential outcomes – a daughter or a son – has
garnered remarkable attention for several centuries.
Indeed, sex determination or adjustment has been attributed to timing of ovulation, maternal hormones,
parental chromosomes, sexual positions, phases of the
moon, mating opportunities, available nutritional resources, environmental stressors (e.g., drought, famine,
flood), temperature changes (viz., across reptilian species), presence of kin and non-kin, the probability of
infant survivorship, and even stochastic variation
(Andersson 1994, Brewis 1993, Brown and Silk 2002,
Cameron 2004, Ciofi and Swingland 1997, CluttonBrock and Iason 1986, Grant 1998, Hamilton 1967,
Hardy 1997, Hardy 2002, Hrdy 1987, Hrdy 1988,
James 1987). Sex ratio differences – what causes them
and what maintains them – have been studied in a wide
spectrum of species, ranging from spinach to whales
(e.g., Freeman et al. 1994, Wiley and Clapham 1993).
What is noteworthy is the extent to which research
has attempted to account for such variations in elements
of gestation, survivorship, and parenting style. Such
research has crossed the domains of anthropology,
sociology, psychology, biology, and evolutionary
theory. The evolutionary angle from which academics
tend to approach the topics of conception, gestation,
survivorship, and parenting is understandably from an
evaluation of reproductive fitness and the adaptations
that are predicted to come into play in order to maximize
it. The heart of the theory reviewed in this paper is
embedded in the notion that nonhuman and human
animals actually do behave in a manner so as to optimize reproductive success (i.e., number of viable offspring produced), measured not only in parity but also
in post-parturition investment in the survivorship of
offspring.
Richard Alexander once wrote: “My own view of
the optimal outcome would be for the significance of
evolution to become so widely known and so thoroughly embedded in the understanding of all those
working in human-oriented disciplines, that its tenets
can be employed, without fanfare, when they are useful,
and ignored or discarded when they are not” (1988, p.
339). A model concerning sex-biased parental investment is evaluated in this light, drawing upon evolutionary and cultural approaches in order to appreciate not
only how evolutionary adaptations operate within cultures but also how humans cross-culturally may be
GUGGENHEIM, C. B., M. F. DAVIS, AND A. J. FIGUEREDO. 2007. SONS OR DAUGHTERS: A CROSS-CULTURAL STUDY OF SEX RATIO
BIASING AND DIFFERENTIAL PARENTAL INVESTMENT. JOURNAL OF THE ARIZONA-NEVADA ACADEMY OF SCIENCE 39(3):73-90.
SONS OR DAUGHTERS ! GUGGENHEIM ET AL.
outrightly invoking an evolutionary strategy in the
treatment of their children. As Cronin points out, the
acceptance of natural selection and sexual selection as
forces of evolution are thankfully, at this point, “...for
modern Darwinism...a storm in a teacup” (1991, p.
236).
This study focuses on a theory referred to as the
“Trivers-Willard hypothesis” concerning conditiondependent sex allocation and differential parental investment in humans (Trivers and Willard 1973). The
central idea is that within a polygynous social mating
structure, where reproductive variance is higher for
males than for females as an intrinsic function of polygyny, mothers in optimal condition (defined by high
status, good health, and abundant resources) are more
likely to produce and invest in male offspring whereas
mothers in poor condition (defined by low status, poor
health, and resource deprivation) are more likely to
produce and invest in female offspring. Although the
Trivers-Willard hypothesis has been demonstrated in
many animal species, its application and explanation
for sex ratio determination across taxa have been notably inconsistent, sometimes even within species (e.g.,
Cockburn 1994, Cockburn et al. 2002). For example,
animals exhibiting condition-dependent sex allocation
and differential parental investment include: fur seals,
elephant seals, opossums, red deer, mule deer, zebras,
spider monkeys, horses, humpbacked whales, chimpanzees, zebra finches, reindeer, hamsters, rats, coypus, mouse lemurs, and wood rats. Species where the
theory has demonstrated less applicability include sea
lions and chickens with inconsistent support evident
across studies on bison, lion tamarins, certain species of
ungulates, pigs, mice, rhesus monkeys, and baboons1.
In one study, food-restricted hamster mothers produced smaller, female-biased litters and by the 15th day
of the study, they had reduced their litters by half,
whereas no such mortality pattern was present in the
other food conditions (Huck et al. 1986). Moreover, of
the food-restricted pups, males and females weighed
less than all others by the 25th day of the study. Similar
tests of maternal condition and its impact on the sex
ratio of litters have yielded moderate support for the
TW hypothesis (e.g., coypu; Gosling 1986). In the wild
boar, for example, mothers apparently adjust their litter
size and sex ratio in that maternal condition, as measured by weight and size, was associated with litter size
1 For good reviews of studies addressing sex ratio variation and
differential parental investment across various species (including humans) refer to any of the following: Anderson and
Crawford 1993, Brown 2001, Clutton-Brock and Iason 1986,
Caley and Nudds 1987, Cameron 2004, Carranza 2002,
Cockburn et al. 2002, Hamilton 1967, Hardy 1997, Hewison
and Gaillard 1999, Hrdy 1988, James 1987, Lazarus 2002,
Pedersen 1991, Rosenfeld and Roberts 2004, Sheldon and West
2004, and van Hooff 1997.
74
(Fernández-Llario et al. 1999). A study on house mice
by Krackow (1993) examined fitness differentials based
on body weight variation and found a positive effect for
males but not for females. Yet in others, support is
absent on account of social variables purported to
influence the sex ratio (e.g., yellow-bellied marmots;
Armitage 1987; spotted hyaenas; Hofer and East 1997).
Nonetheless, several studies seem to suggest that
the TW hypothesis finds its strongest support in ungulates such as reindeer and red deer (Kojola 1997). For
example, Kucera (1991) demonstrated that heavier and
fatter (as measured by kidney-fat index) female mule
deer produced male-biased litters. In fact, Hewison and
Gaillard (1999) argue that the predictions are supported
in species such as red deer, reindeer, bighorn sheep, and
fallow deer because these species most satisfy the
assumptions of the theory.
Its applicability to humans is just as open to debate.
While several studies on humans have found support
for the Trivers-Willard hypothesis, others have provided inconclusive results on account of differences in
measuring evidence of sex ratio biasing and parental
resource allocation biasing (Keller et al. 2001). For
example, several studies have indicated a weak to
moderate Trivers-Willard effect for lower status groups
but no real effect in the higher status groups, regardless
of how status (or maternal condition) was actually
measured (e.g., Borgerhoff Mulder 1998, ChaconPuignau and Jaffe 1996, Webster 2004). On the other
hand, Freese and Powell (1999) were not able to support
the TW hypothesis in humans in that sex-biased differences in socioeconomic investment were absent in their
study while Mealey and Mackey (1990) showed that,
among 19th century Mormons, higher order wives
produced higher sex ratios. Interestingly, they also
make note that a threshold effect may exist in humans
in that the TW hypothesis “…cannot by itself predict
which species will exhibit sex ratio biases under which
conditions, or whether sex ratios will exhibit continuous variations or respond to some environmental threshold” (1990, p. 92).
The data required for such an analysis in humans
has often been unavailable or problematic in its measurement, and differences across cultures – for example, the practice of selective neglect and/or infanticide
– may confound otherwise potentially ethologically
comparative analyses. As a result, there is a theoretical
and practical need for additional examination of condition-dependent sex allocation and differential parental
investment in humans. The main objective of this study
is to extend the application of the animal model to
include human reproductive and parental behavior
thereby revealing how humans produce and enhance
the survival of their male and female offspring based on
an evolutionary principle concerning reproductive success. Essentially, what might a cross-cultural test of the
hypothesis yield for our species? The idea is to under-
75
stand the “survival of those who survive” (Trivers
2002, p. 64) and how or why their mothers let them live.
This study tests the Trivers-Willard hypothesis1 by
pursuing two specific aims. First, it will evaluate the
significance of condition-dependent sex allocation of
offspring among humans. The working hypothesis is
that the sex of the child is predicted by maternal condition such that a mother who is “healthy and wealthy”
will be more likely to have a son. A woman in suboptimal condition regarding health and resources will,
instead, more likely be the mother of a daughter. Secondly, the significance of differential parental investment among humans will be examined with the following
working hypothesis in mind: Females of high status,
abundant resources, and good health will be more likely
to invest more resources in their male offspring than in
their female offspring. Females at the other end of the
condition continuum will tend to invest more in their
female offspring. These questions will be examined in
a multi-sample analysis spanning 35 developing countries from four regions around the world. This study will
present an analysis of the TW hypothesis in humans by
testing variables that may predict the sex allocation of
the lastborn child and the parental behaviors differentially directed toward that child, thereby comparing the
human model to patterns found in other animals.
As such, central assumptions of the theory concerning natural deviations from a 50/50 sex ratio in
nature are the following: (a) the condition of the mother
during the period of parental investment serves as a
predictor of the condition of the offspring at the end of
the parental investment period; (b) differences in offspring condition at the end of the period of parental
investment persist through adulthood; and (c) in species
where males exhibit less parental investment than females, males are expected to exhibit higher variance in
reproductive success. In summary, according to Trivers
and Willard, “...under certain well-defined conditions,
natural selection favors systematic deviations from a
50/50 sex ratio at conception.…other things being
equal, species showing especially high variance in male
[reproductive success] (compared to variance in female
RS) should show, as a function of differences in maternal condition, especially high variance in sex ratios
produced” (1973, p. 90).
Interestingly, most tests of the TW hypothesis tend
to focus on an abbreviated version without necessarily
2
Note that, in the research literature, the central hypothesis is
sometimes interchangeably referred to as the “Trivers-Willard
model” or as the “Trivers-Willard hypothesis” and thus abbreviated as TWM or TWH, respectively. In some cases it is cited
as the “Trivers-Willard effect” when referring to particular
predictions of the hypothesis. For the sake of clarity in this
paper, the guiding theoretical framework is referred to as the
“TW hypothesis” while the term “model” will indicate the
analytical approach used to test the hypothesis, as measured in
this study.
SONS OR DAUGHTERS ! GUGGENHEIM ET AL.
including – or at least, demonstrably excluding – some
of the underlying predictions. While many studies readily
focus on “good-condition mothers invest in sons,” it
should not be overlooked that the following parameters
were also included: (a) the theory is expected to apply
to species with small brood sizes, given cited empirical
evidence – yet less so to species with larger litters; (b)
compensatory growth, in response to malnutrition or
maternal effects, for example, is deemed trivial in
effect; (c) sex ratio at birth is assumed to be an indicator
of parental investment; (d) and reproductive success of
males is free to vary on account of “negligible” paternal
care (Trivers and Willard 1973, p. 91). Which is to say
that several central assumptions must be met in order
for analysis to address the actual Trivers-Willard hypothesis, as outlined in the original paper. The role of
polygyny and minimal parental investment by males is
central to the assumptions of the theory.
In fact, variance in reproductive success is explained as “all or nothing” for males within a polygynous mating system whereas the contrast in reproductive
success for females is of smaller disparity in that a
female is more likely than not to successfully secure
mating opportunities at some point. Although she will
never be able to produce as many potential grandchildren as a male could, it is at least more probable that she
will at least have the opportunity to try. In truly socially
polygynous mating systems – such as those seen among
herding or harem species – not all males are so fortunate. Hence, the greater disparity between the males
who are outcompeted by other males in their access to
females. While Clutton-Brock and Iason pronounce
that “...the firmest conclusion that can be drawn from
the distribution of observed trends in the sex ratio is that
the distribution does not conform to the predictions of
any single adaptive hypothesis” (1986, p. 367), it is all
the more reason to apply an empirical test of one such
as that provided by the Trivers-Willard hypothesis.
Precisely when parental investment begins and
ends can be problematic in terms of measurement; most
definitely, depending on the species in question. Nonetheless, it has been defined as “...any investment by the
parent in an individual offspring that increases the
offspring’s chance of surviving (and hence reproductive success) at the cost of the parent’s ability to invest
in other offspring” (Trivers 1972, p. 139). Despite all
their efforts and intentions, or because of all the variance in parental behaviors, not all parents watch their
children grow up and produce grandchildren; there is
also “the darker side of parenting” (Scheper-Hughes
and Sargent 1998, p. 21). Indeed, the impact of parental
under-investment – or disinvestment – ranges from
passive neglect to active infanticide (Scrimshaw 1984).
While 9% of all world cultures practice sex-selective
infanticide (Hrdy 1999), most cases where human children do not survive would be more appropriately classified as selective neglect (Larme 1997).
SONS OR DAUGHTERS ! GUGGENHEIM ET AL.
Extensive research has accumulated in the last
years addressing what adaptive mechanisms may be
driving the practice of infanticide as well as how cultural sanctions may be merely providing the framework
in which “evolved decision rules” can operate (Hrdy
1999). In fact, much of the research on infanticide
seems to support the Trivers-Willard hypothesis in that
infanticidal trends are most evident where females have
the smaller chance of reproductive success comparable
to males (Beise and Voland 2002, Daly and Wilson
1984, Dickemann 1979, Gosden et al. 1999, Jeffery et
al. 1984, Miller 1987). For example, Hrdy describes the
role of the “daughter destroyers” within elite clans in
India (1999, p. 326) and cites research revealing that
among lower castes, genetic markers crossing caste
boundaries serve as an indicator of females engaging in
“marrying up” (i.e., hypergamy; Hrdy 1999, p. 340) in
that the lower caste females have higher reproductive
variance than their male counterparts. Among the elites,
however, daughters are at a distinct disadvantage. As
such, countries known to practice sex-selective infanticide are included in the analyses.
METHODS
Data source
The study population was drawn from 134,257
women in 35 countries who were interviewed between
1992 and 1998 as part of the Demographic and Health
Surveys (DHS+) with funding from the US Agency for
International Development1. DHS+ surveys are nationally representative samples by country and usually use
multistage cluster sampling techniques. We selected all
countries in the DHS+ datasets that collected complete
data for maternal nutritional status, socioeconomic resources, and parental investment. These 35 countries
are in the least developed and developing categories.
Sub-Saharan Africa is represented by 21 countries,
South Asia and the Near East/North Africa by three
countries each, and the Caribbean/Latin America by
eight countries. A list of all countries selected for this
study and survey collection dates is presented, by region, in Appendix Table 1.
Sample
The youngest child for each mother was selected
for the analyses and cases with missing data (4.6%)
were excluded, leaving a final sample size of 128,039
mother-child pairs (weighted N = 124,371). The sample
size per country ranged from 732 for Comoros to 5,361
for Guatemala, with an average of approximately 3,000
mother-child pairs. The exception was India, with a
sample size of 21,839 (Appendix Table 1).
3 Dataset access can be initiated at the MEASURE DHS+
website: http://www.measuredhs.com/
76
Note that the sampling techniques employed in the
DHS+ surveys are designed to maximize the retrieval of
information representing 100% of the population surveyed. DHS+ emphasizes the use of nonzero probability sampling. When this is not the case and a sample is
not self-weighting, calculation of sample weights should
be included in any inferential analyses of DHS+ data
(Macro International Inc. 1996). That is, individual
weights in almost all of the surveys are normalized so
that if all of the women within a sample are included, the
weighted and unweighted numbers of women should be
equal. If selecting a sub-sample, then the weighted and
unweighted sample sizes will differ. In order to ensure
that analysis is representative of the entire population –
and not just of the sample population – sample weights
are provided with all data sets for proper parameter
estimation; their significance being that they "...are
often interpreted as the number of population units
represented by each unit of the sample" (Häder and
Gabler 2003, p. 124). Therefore, sample weights were
included in the analyses.
Procedure
Women between the ages of 15-49 were interviewed by trained personnel using structured questionnaires. Response rates were generally above 95%. In
each country, households were selected using a multistage clustering procedure. The surveys included questions related to maternal resources, biological condition,
medical care, and health-seeking behavior. Data collection procedures are described in detail in ORC Macro
(2005).
Scales
Four scales were developed relating to the biological, resource-oriented, and behavioral aspects affecting
maternal condition, sex allocation, and parental investment in humans. Selection of the final set of variables
are presented in Appendix Table 2 , along with brief
descriptions and coding schemes. Stringent decision
rules were applied to enhance the cross-species comparative focus of this study, and these are briefly reviewed here.
Maternal socioeconomic resources (MSR)
The four dimensions of household economic status
were drawn from a previous study by Smith et al.
(2003). Economic status was defined by level of basic
needs met and number of assets owned. Basic needs are
measured by whether the household has a finished
floor, toilet facility, or access to piped or bottled water.
Cheap assets include a radio, a television, or a bicycle;
expensive assets include a refrigerator, motorcycle, or
car. Households were classified as “destitute” if they
have one or none of the basic needs and none of the
assets. Households classified as “poor” have no more
77
than two of the basic needs and cheap assets. “Rich”
households have at least two basic needs met and at least
one expensive asset. Urban households received a
higher score because urban living tends to provide
women with more efficient access to resources
(Madzingira 1993). The woman’s level of education
was included as a resource, and male contribution was
measured by presence in the household and educational
status.
Maternal biological condition (MBC)
Body mass index (BMI) was used as an indicator of
the continuum of maternal condition. Higher BMI is
used as an indicator of good condition, along with a
categorical variable for under-nourished, healthy, or
overweight. Current pregnancy and breast-feeding
status were also included in order to differentiate between women on several important elements that constitute the mammalian female reproductive life cycle.
Breast-feeding was predicted to have a negative effect
on a mother’s condition by placing a drain on her energy
and nutrient intake (Brown 2001). Length of pregnancy
(as opposed to pregnancy status by itself) was used as
a positive indicator of maternal condition in that the
very fact that a woman is able to be pregnant and
especially to be able to maintain a pregnancy for any
length of time are considered indicators of good condition (Haig 1999). Although any pregnancy naturally
deprives a woman of necessary energy reserves (i.e.,
maternal depletion; Ellison 2001), the very fact that she
is able to be pregnant and especially to be able to
maintain a pregnancy for any length of time are considered indicators of good condition, especially given that
malnourished and/or stressed women are more likely to
miscarry (Kerr 1971). For this reason, current pregnancy duration was selected for the maternal health
scale whereby women who were not pregnant were
coded as zero for pregnancy duration.
Prenatal care for lastborn child (PCL)
This scale measured protective, risk-avoidance behaviors. Central to this scale is access to care and initial
parental effort directed toward the maintenance of
maternal and child health. Type of prenatal care was
measured by amount and provider (none, non-medically trained, or medically trained provider). Measures
for birth assistance included delivery at a medical
facility and type of provider.
Health-seeking for lastborn child (HSL)
The overall HSL scale is not a measure of the
child’s health, but the parental response and prevention
strategy experienced by the child. It serves as a direct
measure of parental-initiated investment per lastborn
child. Receipt of immunizations and age-relative compliance with immunization completion were included
SONS OR DAUGHTERS ! GUGGENHEIM ET AL.
to provide each child with a score comparable to all
other children, regardless of age. Note, though, that all
eight vaccinations should ideally be given before the
first birthday has been reached such that by the time
they are 4 months old, children should have received
five of the eight vaccinations (Smith et al. 2003).
Furthermore, diarrhea in the developing world is particularly dangerous for children and the risk of dehydration cannot be underplayed; as such, the scale was
coded to emphasize treatment-seeking. The highest
score was assigned to children who received treatment
and the lowest score assigned to those who did not
receive treatment for a reported case of diarrhea.
Scale construction
Composite scores were created by standardizing
and averaging the variables, and scale scores were
standardized again for the hierarchical regressions; the
specific means (M) and standard deviations (SD) should,
by definition, tend toward a mean of zero and a standard
deviation of 1. Listwise deletion was used in scale
construction. Final determination of scales was based
on a Cronbach’s alpha > .60. The inter-item reliabilities
for the four scales ranged from .63 for MBC to .85 for
HSL. Internal consistency – a measure of scale reliability – serves as evidence of construct validity. For scale
confirmation, the indicators were then correlated with
the scales. The results of these correlations are presented in Table 1. While the diarrhea treatment variable
did not correlate strongly with the scale to which it was
assigned (r = .24, p < .01), the item was retained for
theoretical considerations. It was necessary to include
a measurement for diarrhea treatment-seeking behavior
given that diarrhea is one of the main causes of mortality
in children in developing countries. In this case, theoretical considerations overrode the weaker statistical
justification for retaining that variable in the healthseeking behavior scale. Zero-order correlations between the scales ranged from .2 to .6. Detailed
information regarding scale construction is in
Guggenheim (2005).
Statistical analyses
A hierarchical cluster analysis involving Ward’s
linkage method was used to determine the pattern of
association between cases based on discrepancies between the national at-birth sex ratio and the sex ratios
within the current data. The cluster analysis utilized a
composite score for each country using national estimates from the CIA and survey research data to measure similarities and differences in individual countries’
deviation from their estimated at-birth sex ratio among
children under five years of age.
A series of hierarchical regressions was performed
in which multiple dependent criterion variables were
analyzed sequentially according to a hypothesized causal
SONS OR DAUGHTERS ! GUGGENHEIM ET AL.
Table 1. Pearson correlations between scales and indicators.
MSR MBC PCL HSL
Whether woman lives in urban area .715
Household economic status
.778
Woman’s education
.802
Male’s educational value
.738
78
them. Within this analytical scheme, the estimated
effect of each predictor was limited to its direct effect on
each of the successive dependent variables. The general
format for these hierarchical multiple regressions was
therefore as follows:
Y1 =
X1 + X2 + X3
Y2 =
Y1 + X1 + X2 + X3
Y3 = Y2 + Y1 + X1 + X2 + X3
Woman’s BMI
.785
Woman is malnourished/
healthy/overweight
.775
Duration of current pregnancy,
if pregnant
.544
Woman currently breast-feeding
(reversed)
.655
Whether prenatal care received
& care type
.770
Number of prenatal care visits
.788
Delivery assistance
.808
Delivery took place at a medical
facility
.824
This procedure was conceptually equivalent to a
sequential canonical analysis (Gorsuch and Figueredo
1991, Figueredo and Gorsuch 2007), which controls
statistically for any indirect effects of the predictors
through the causally prior dependent variables.
In this study, the causal order of the dependent
variables was: 1) maternal socioeconomic resources
(MSR), 2) maternal biological condition (MBC), 3)
prenatal care for the lastborn child (PCL), 4) sex of
lastborn child (SEX), and 5) health-seeking for lastborn
child (HSL). The causal order for the dependent variables is presented in Figure 1. Note, however, that this
figure is simplified as it does not include the successive
main effects and interaction terms. The order for the
general linear models was:
Child’s diarrhea: untreated, absent,
or received treatment in last 2 wks
.243
Whether child received BCG
vaccination
.799
Whether child received measles
vaccination
.742
Total number of DPT vaccinations
received
.916
Total number of polio vaccinations
received
.894
Percent recommended vaccinations
received
.922
order. Because these dependent criterion variables were
expected to causally influence each other, they were
entered sequentially as criterion variables into a system
of multiple regression equations with each hierarchically prior criterion variable entered as the first predictor for the next. Thus, each successive dependent variable
was predicted from an initial set of ordered predictor
variables, each time entering the immediately preceding dependent variable hierarchically as the first predictor, then entering all the ordered predictors from the
previous regression equation. Thus, each successive
regression entered all of the preceding dependent variables in reverse causal order to statistically control for
any indirect effects that might be transmitted through
MSR = REGION + COUNTRY
MBC = MSR + REGION + COUNTRY
PCL = MBC + MSR + REGION + COUNTRY
SEX = PCL + MBC + MSR + REGION + COUNTRY
HSL = SEX + PCL + MBC + MSR + REGION + COUNTRY
The effect of region and country on maternal socioeconomic resources (MSR) were examined first, given
the prediction that they precede the measure for maternal condition (MBC) in explanatory power. That is,
socioeconomic factors are predicted to be region- and
country-specific. Resources available to each mother
are, in turn, predicted to influence her biological condition (as measured by MBC). The next relationship
examined is the effect of MSR and MBC in influencing
levels of prenatal care for each lastborn child (PCL).
The frequency and type of prenatal care are, in turn,
predicted to determine the sex of that same child (SEX).
Lastly, degree of health-seeking for the lastborn child
(HSL) is hypothesized to be predicted by the sex of that
child. The analyses also assess the residual effects of
each previous variable in the analysis along with interactions by region and country.
These interaction terms have been included to
measure whether the effect of any of the central predictors (MSR, MBC, PCL, SEX, and HSL) on the dependent variable of any given regression are moderated by
REGION or COUNTRY. For example, if an interaction
between REGION and MSR is significant and large in
predicting PCL, this would indicate that the manner in
which maternal resources impacts prenatal care varies
SONS OR DAUGHTERS ! GUGGENHEIM ET AL.
79
Figure 1. Causal order for the dependent variables.
as a function of regional or country differences. For this
study, the conditional associations – as measured by the
interaction terms – are crucial in determining to what
extent cultural variance is at play here in addressing an
evolutionary theory that has been difficult to demonstrate reliably in human populations. In essence, if the
Trivers-Willard hypothesis applies to humans over and
above what cultural differences may exist, then results
from interaction terms in the forthcoming analyses
should be nonsignificant or minimal. Additionally, as
referenced earlier, sample weights were used in the
analyses to ensure representative samples within each
country.
RESULTS
Thirty-five countries were sampled across four
regions: South Asia, Sub-Saharan Africa, the Caribbean/Latin America, and the Near East/North Africa.
For the mothers across all four regions: 91% reported a
male partner present, 13% were living in female-headed
households at the time of the interview, 14% were
malnourished, 10% were pregnant at the time of sampling, and 63% were breast-feeding. The mean age of
the women across all regions was 27.8 years, with the
average age at first marriage being 17.7 years and
education averaging 3.8 years.
For the 22% of the sample reporting information
for more than one child, the maximum number of
children currently residing at home with the mother and
under the age of five years was 4. The mean age of the
lastborn child across all women was 19.7 months.
Virtually all of the children (99%) were reported to have
been breast-fed at some point. Additionally, 79% received the BCG vaccination, 53% received the measles
vaccination, the average number of DPT and polio
vaccinations was 2, and the mean percent of agerecommended vaccinations received was 72%. Variations by region and country are discussed in Guggenheim
(2005).
Sex ratio of lastborn children
The number of males (65,190) to females (62,849)
across all regions in the final sample selected for analysis yields a sex ratio of 1.037 (51% boys, 49% girls).
Estimated sex ratios by country at birth and in the
current sample are presented in Appendix Table 3.
Preliminary interpretation of the discrepancy in the
national at-birth sex ratios from the sex ratios of the
current data was suggestive of interesting trends. Certain countries apparently deviate from the secondary
sex ratio by becoming unusually female-biased (e.g.,
Uganda) while others deviate by becoming unusually
male-biased (e.g., Turkey). Noteworthy, also, is that all
three Near Eastern/North African countries yield malebiased trends as well as India; notwithstanding, these
are the countries with prevalent son preference and
significantly high infanticide rates.
SONS OR DAUGHTERS ! GUGGENHEIM ET AL.
80
Figure 2. Results of cluster analysis.
Results reveal a distinctive pattern of three clusters, with the first break (Euclidean distance = 0.593)
across two groups separating countries by those that
decreased (or remained unchanged) in the number of
males relative to females and those that decreased in the
number of females relative to males within the underfive age group. The former group contains countries
with male and female-biased sex ratios while the latter
contains only male-biased countries. The second break
(Euclidean distance = .343) distinguishes between a
group that decreased in the number of males proportional to females, thereby producing female-biased sex
ratios, and a group that either decreased or did not
decrease in the number of males relative to females,
thereby producing male-biased sex ratios.
The cluster tree yields three distinctive groups (see
Figure 2). The top cluster, in italics, includes the female-biased countries that reflect a loss of males relative to females within the under-five age group. The
center of the tree identifies the male-biased countries
that did or did not decrease in number of males relative
to females. And the bottom group, in bold, contains the
countries that demonstrate a loss of females relative to
males. The cluster analysis implies a proportional decrease over time from the estimated at-birth sex ratio in
almost all countries.
Overview of modeling
Analyses involved a system of hierarchical regressions testing for the interaction and main effects of
region, country and the four scales: maternal socioeconomic resources, maternal biological condition, prenatal care for lastborn child, sex of lastborn child, and
health-seeking for lastborn child. Results are presented
for each dependent variable as it is predicted by each set
of independent variables and interactions following the
sequence depicted in Figure 1.
Maternal socioeconomic resourcesIn the first general linear model, region and country together account
for 22.6% of the variance in maternal socioeconomic
resources (see Table 2). In fact, region accounted for
13.8% while country accounted for almost 12%, as
indicated by R2 for each variable. Additionally, SubSaharan Africa had the lowest mean for the maternal
socioeconomic resources scale (M = -.32, SD = .82),
and Latin America and the Caribbean had the highest
mean (M = .50, SD = 1.05)
Maternal biological conditionIn the second general linear model, maternal socioeconomic resources
accounts for 11.6% of the variance in maternal biological condition. In fact, across all regions, increasing
maternal socioeconomic resources contributes toward
SONS OR DAUGHTERS ! GUGGENHEIM ET AL.
81
Tabl e 2. Predicting Maternal Socioeconomic Resources.
Tabl e 3. Predict ing Mat ernal Biological Condit ion
F
p
R2
Source
4
4983.65
.000
.138
MSR
COUNTRY
31
5 28 . 3 1
.000
.116
REGION
Model
35 1037.49
.000
.226
COUNTRY
Source
REGION
df
Error
124336
Total
124371
increased maternal biological condition but not all that
powerfully, as indicated by the small, positive standardized regression coefficient for the maternal socioeconomic resources scale (β = .066). The main effect for
region is large, with almost 18% of the variance in
maternal biological condition explained by systematic
differences between regions regarding maternal condition (Table 3).
Although significant, the interaction between maternal socioeconomic resources and region, as well as
the interaction between maternal socioeconomic resources and country, are trivial in effect. Maternal
socioeconomic resources by region accounts for 0.1%
of the variance in maternal biological condition, while
the interaction of country with maternal socioeconomic
resources accounts for 0.3% of the variance in maternal
biological condition. Small interactions such as these
merely demonstrate that the effect of maternal resources in explaining maternal biological condition
does not vary much – if at all – as a function of regional
or country differences but that in this case, some degree
of conditional variation by country and region is present
for this scale. This trend of trivial explanatory power is
evident through almost all of the conditional relationships tested within the sequence of regressions. Moreover, that the main effect for region is so large, in
comparison, indicates that almost 18% of the variance
in maternal biological condition is explained by systematic differences between regions regarding maternal condition.
What is interesting, on the other hand, is that
South Asia (β = -.566) differs from the other three
regions in the direction of its coefficient in this
regression: Sub-Saharan Africa (β = .097), Latin
America/Caribbean (β = .461), and the Near East/
North Africa (β= .768). In fact, Bangladesh ranks the
lowest (M = -.72 , SD = .85) in maternal biological
condition while Turkey has the highest mean (M =
.82, SD = .94) on that scale. Furthermore, of all the
regions, South Asia has the lowest mean maternal
biological condition (M = -.67, SD = .83) while the
Near East/North African region has the highest (M =
.73, SD = 1.0).
p
R2
1 15391.32
.000
.110
4
6697.18
.000
.177
31
44.00
.000
.011
3
51.53
.000
.001
COUNTRY
*MSR
31
10.79
.000
.003
Model
70
629.05
.000
.262
df
REGION
*MSR
F
Error
124301
Total
124371
Prenatal care for lastborn child
Table 4 reveals that maternal biological condition
accounts for 9% of the variance in prenatal care with a
small, yet positive, effect (β= .061). Maternal socioeconomic resources has the largest effect (37.1%) in predicting prenatal care for the lastborn child. Indeed,
increased maternal socioeconomic resources predicts
increased prenatal care (β= .706) across all countries.
Region, on the other hand, only predicts 5.5% of prenatal care, while country is the second strongest predictor
by explaining 13% of the variance (Table 4). Here it
Tabl e 4. Predicting Prenatal Care for Lastborn Child.
Source
df
F
p
R2
MBC
1 12222.63
.000
.090
MSR
1 73307.83
.000
.371
REGIO N
4
1802.21
.000
.055
31
600.21
.000
.130
3
2. 8 0
.038
.000
31
6.28
.000
.002
3
22.18
.000
.001
31
92.02
.000
.022
105
1090.17
.000
.479
CO UN TRY
REGIO N
*MBC
CO UN TRY
*MBC
REGIO N
*MSR
CO UN TRY
*MSR
Model
Error
124266
Total
124371
SONS OR DAUGHTERS ! GUGGENHEIM ET AL.
82
variance with almost all of the predictive power being
explained by region. In fact, maternal socioeconomic
resources only accounts for 0.1% of the variance in
predicting the sex of the lastborn child while region
explains 49.1% of the variance. Yet, MSR has a small,
positive effect such that higher levels of resources
predict lastborn daughters across all regions (β = .040).
Prenatal care is not significant in predicting the sex of
the lastborn child. Moreover, maternal biological condition does not contribute any detectable predictive
power to the sex of the lastborn child, given that R2
equals zero for this predictor as well.
should be noted that the interaction of country with
maternal socioeconomic resources represents the largest proportion – with only 2.2% predictive power – of
all the interactions in the entire set of regressions for the
interaction model, thereby demonstrating their minimal
explanatory contribution overall. Hence, 2.2% of the
variance in prenatal care is explained by maternal
socioeconomic resources varying as a function of country differences.
Nonetheless, in its entirety, the model testing the
effects of maternal condition, resources, region, and
country captures 47.9% of the variance in prenatal care
for the lastborn child. Across the four regions, South
Asia has the lowest mean prenatal care for the lastborn
child (M = -.28, SD = 1.0), while the Latin American/
Caribbean region has the highest mean (M = .44, SD =
1.0) across all countries once again. Note, however, that
the Near East/North African region is similar (M = -.27,
SD = 1.0) to South Asia concerning mean levels of
prenatal care.
The final regression in this series examines the effect
of region, country, the three scales, and the sex of the
lastborn child in determining health-seeking for that
child (Table 6) . Results show that, overall, male lastborn
children are more likely to receive treatment and immunizations; however, this effect is very small both
Sex of the lastborn child
in explanatory power (R2 = 0.1%) and magnitude (β
= -.050). The largest contribution (16.7%) in predicting
The next regression in this series tests for the effect
of country, region, maternal socioeconomic resources,
maternal biological condition, and prenatal care in
predicting the sex of the lastborn child. As Table 5
shows, the overall model accounts for 49.2% of the
Tabl e 5. Predict ing Sex of Last born Child.
Source
df
F
p
R2
PC L
1
2. 1 7
. 1 41
. 000
MBC
1
11.63
. 001
. 000
MSR
1
8 4. 3 1
. 000
.001
REGION
4 30004.76
. 000
.491
COUNTRY
31
1.33
. 1 03
.000
REGION*
PC L
3
. 75
.521
.000
COUNTRY
*P C L
31
1 . 23
. 1 79
.000
REGION*
MBC
3
5 . 62
.001
.000
COUNTRY
*MBC
31
1.76
. 006
.000
REGION*
MSR
3
. 93
.424
.000
COUNTRY
*MSR
31
. 99
. 479
. 000
1 40
8 5 9. 3 1
.000
.492
Model
Error
124231
Total
124371
Health-seeking for lastborn child
health-seeking across all four regions lies in prenatal care
for the lastborn child (β = .041), suggesting that higher
rates of prenatal care for the mother predict higher rates
of health-seeking for the lastborn child. Furthermore,
maternal biological condition has a small, yet significant,
positive predictive effect (β = .233) in explaining 4.2% of
the variance in health-seeking behaviors. Maternal socioeconomic resources account for only 1.1% of the
variance in health-seeking (β = .254); such that, as
resources increase, health-seeking for the lastborn child
increases. Region accounts for 0.8% of the variance in
health-seeking whereby the second strongest predictor is
country, accounting for 7.2% of the variance.
Note also that the conditional relationships between
prenatal care with region (1.6%) and prenatal care with
country (1.7%) are significant, albeit also small, indicating that a small proportion of explanatory power in
predicting health-seeking is contingent upon variations
in how country- and region-specific differences determine prenatal care. Conditional relationships between
maternal biological condition and maternal socioeconomic resources with country and region are even smaller
(ranging from 0.1 to 0.5%). Furthermore, conditional
relationships between sex of the lastborn child with
prenatal care, maternal biological condition, maternal
socioeconomic resources, region, and country were either nonsignificant or trivially small in predicting healthseeking for the lastborn child. As such, the entire model
accounts for 27.9% of the variance in health-seeking for
the lastborn child. What is more, South Asia has the
lowest average level of investment in health-seeking
behaviors (M = -.19, SD = 1.05) whereas mothers within
the Near East/North African region (M = .30, SD = .85)
demonstrate, on average, higher levels of preventative
behaviors toward their lastborn children.
SONS OR DAUGHTERS ! GUGGENHEIM ET AL.
83
Tabl e 6. Predicting Health-seeking for Lastborn Child
Source
df
F
p
R2
SEX
1
68.45
. 000
.001
PC L
1 24943.67
.000
. 1 67
MBC
1
5 3 78 . 3 0
.000
. 042
MSR
1
1428.27
.000
.011
MBC*SEX
1
.03
. 8 72
. 000
MSR*SEX
1
.51
.475
. 000
REGIO N
4
257.06
. 000
. 008
31
3 08 . 99
.000
. 072
3
14.82
. 000
.000
31
1 . 76
.006
.000
3
68 9 . 04
. 000
. 01 6
31
68.47
. 000
.017
3
66.96
.000
.002
31
5.30
.000
.001
3
48 . 3 6
.000
.001
31
20 . 06
.000
. 005
3
2 . 73
.042
.000
31
1.15
. 25 8
.000
3
1.54
.202
.000
CO UN TRY*
MBC*SEX
31
1.08
. 3 47
.000
REGIO N *
MSR* SEX
3
.32
.813
.000
CO UN TRY*
MSR*SEX
31
.77
.811
. 000
28 0
171.27
.000
.279
CO UN TRY
REGIO N *
SEX
CO UN TRY*
SEX
REGIO N *
PC L
CO UN TRY*
PC L
REGIO N *
MBC
CO UN TRY*
MBC
REGIO N *
MSR
CO UN TRY*
MSR
REGIO N *
PCL* SEX
CO UN TRY*
PCL* SEX
REGIO N *
MBC*SEX
Model
Error
124231
Total
1 243 7 1
DISCUSSION
Region and country predicted nearly 25% of the
variability in maternal socioeconomic resources (MSR)
and were significant predictors in each successive general linear model. That there are differences in resources across regions and countries is unsurprising,
and the amount of variance that region and country
accounted for in each GLM ranged from less than 10%
for HSL to nearly 50% for sex of the lastborn child.
MSR predicted 11% of maternal biological condition (MBC), which in turn accounted for 9% of prenatal
care for the lastborn child (PCL). Neither MSR, MBC
or PCL had any meaningful explanatory power for sex
of the lastborn child. Health-seeking for the lastborn
child (HSL), however, was predicted by PCL, MBC,
and to a lesser extent, by MSR and SEX of the lastborn
child. Interactions with region and country were generally small, indicating that the main effects of the predictor variables are relatively stable across geographic
areas. The exceptions were regarding health-seeking
behavior, whether prenatally or after birth. Namely,
MSR*country accounted for 2.2% of the variance in
PCL, and PCL*region and PCL*country accounted for
1.6% and 1.7% of the variance in HSL respectively.
Essentially, women who can obtain healthcare are doing so at differential rates by country and region.
Across the 35 countries there are two small effects
that provide minimal support for the TW hypothesis.
That is, MSR has a small, positive effect such that
higher levels of resources predict lastborn daughters
across all regions. Furthermore, male lastborn children
are more likely to receive treatment and immunizations.
Both are small effects, each accounting for 0.1% of the
variance in the analyses. However, the role of polygyny
and minimal parental investment by males is central to
the assumptions of the theory, and this condition may be
differentially met within the four regions. And, in fact,
interaction terms for HSL by region and country account for .8% and 7.2% of the variance, respectively.
Note, however, that the lack of conditional relationships between sex of the lastborn child and prenatal
care, maternal biological condition, maternal socioeconomic resources, region, and country in predicting
health-seeking limit any additional support for the TW
hypothesis.
As well-established by previous research, males
are born at a higher frequency, with the average sex
ratio in humans estimated to be 1.06 males born for
every female (Cartwright 2000). However, variations
across countries exist and a comparison of the current
sample sex ratio with national estimates thereof, yielded
three evident patterns: one where there is a drop in the
number of males but they continue to outnumber females; one where males decrease so drastically within
the first five years of life, that the sex ratio becomes
female-biased; and one where males initially predominate in number and increase, due to what can only be
attributed to a loss of females. The observation that
good condition mothers in male-biased countries show
a tendency toward producing daughters and that there
are not enough daughters in these regions to account for
this pattern, is an important point of contrast.
All things considered, though, while the descriptive characteristics initially presented confirm foreseeable demographic differences, the system of
SONS OR DAUGHTERS ! GUGGENHEIM ET AL.
regressions used in this research places those differences and similarities into perspective regarding an
evolutionary framework as provided by the TW hypothesis. Namely, the fact that Sub-Saharan Africa
deviates somewhat from the pattern found for the
other three regions is potentially explained by the
higher degree of polygyny, on average, across those
countries. That is, polygyny may afford more autonomy to wives than monogamy does and the
husband’s influence is even more denounced the higher
the women’s education is (Dodoo 1998). Indeed, there
is substantial variation in the degree of polygyny
across these countries. According to DHS+ information, almost 51% of women in Burkina Faso report
having two or more co-wives present and 46% of
women from Senegal report having one other wife
present. In contrast, 96% of women in Madagascar are
not in a polygynous union and the same for 75% of
women in Comoros. Furthermore, of those 25% who
have a co-wife, 71% only had one co-wife, according
to another study (Althaus 1997).
It would seem that polygyny can leave almost half
of its male population without a mate. As such, the
trend for the African countries in this data would
suggest a mild Trivers-Willard effect in that if mothers
have the resources, they produce sons. Otherwise,
even when in good physical condition, they invest in
the production of daughters. Note, however, that the
effect for the resources was somewhat greater than
that of the effect for physical condition. As such, this
is the closest this analysis seems to come in supporting
the TW hypothesis across thirty-five countries.
Anderson and Crawford (1993) as well as Carranza
(2002) provide critical reviews of how predictions
stemming from the TW hypothesis require modification in order to be appropriately applied to humans.
Freese and Powell (1999) did not find support for the
TW hypothesis in humans either, i.e., no sex-biased
differences in socioeconomic investment, but failed to
mention paternal care as a potential confound. They
conclude with: “...the evidentiary burden would now
seem to fall upon those who might attribute our findings to the exceptional character of American society
rather than to the more fundamental limitations of the
theory” (p. 1737), citing sociologists’ “professional
obligation... to test [sociobiological] hypotheses fairly
and rigorously” (p. 1738). It is hoped that was accomplished here.
However, this research used self-report data and,
as such, is prone to typical biases and errors common
to such methodologies. For example, standardization,
whether in technique or measurement, limits the possibilities of responses and can create demand characteristics. While selection of the lastborn child may
capitalize on enhanced memory recall, a focus on the
oldest child might serve as a better source for measuring parental investment. In addition, previous studies
84
on birth order indicate that selection of the lastborn
child may be confounded by the possibility that the
favored child is being studied (Rohde et al. 2003).
While no noticeable sex-biased differences were apparent across diarrhea treatment-seeking behaviors,
this could in fact be due to reporting error alone. In
addition, particularly relevant information concerning polygynous pairings was missing from the data at
hand. Finally, definitions of parenting can vary across
both monogamous social structures and in settings
where alloparenting is normative.
These analyses would have been more informative if child nutritional status as a measure of parental
investment were included, along with indicators of
maternal condition at preconception rather than postnatally. Analysis of reproductive success of offspring
would also enable a more representative test of the
original theory. It would also be interesting to test the
sex ratio against expected averages at the primary and
secondary levels; namely, 3 months post-conception
(1.2:1) vs. at birth (1.06:1) sex ratios (from Cartwright
2000) to examine the juncture at which sex ratio
manipulation might be occurring.
The foremost issue is the notion of condition –
what exactly did Trivers and Willard mean, and why
have so many different interpretations been applied to
it? If they meant weight and body size for polygynous
animals where parental care is trivial, then the theory
can be tested on those species and probably should not
be expected to fit others. Which is to say, with so many
constraints involved in terms of “well-defined conditions” (Trivers and Willard 1973, p. 90) is it really that
surprising that there is so much variation not only in
the results that have been revealed across species but
in the extent to which this theory has been applied?
To the extent that other studies recommend that a
true analysis of the TW hypothesis would have to test
for whether maternal condition predicts offspring reproductive success and then examine the variance of
reproductive success between siblings, it is slightly
odd that more studies have not actually done this; that
is to say, on species where reproductive success can be
tracked. To test for sex of offspring alone, in the
absence of the predicted covariates, may actually omit
parts of the theory.
Ultimately, there is something inherently poetic
in the original theory of Trivers and Willard. That is,
the idea that maternal effects are retained from development into adulthood is reminiscent of a comment by
the poet Rainer Maria Rilke: “Perhaps we are our
childhood still, for as St. Augustine said, ‘Whither
should it have gone?” Yet, at the end of the day, the
specific relevance of the Trivers and Willard hypothesis to humans may be, after all, just another storm in
a teacup and better relegated to species that satisfy
most, if not all, of the assumptions laid out in the
original paper.
85
ACKNOWLEDGEMENTS
Our appreciation to MEASURE DHS+ for their
assistance during this project and to Dr. Lisa Smith for
access to data drawn from the demographic and health
surveys of thirty-five countries. The original survey
data referenced in this article can be obtained through
www.measuredhs.com.
LITERATURE CITED
ALEXANDER, R. D. 1988. Evolutionary approaches to
human behavior: What does the future hold? Pp.
317-341 in L. Betzig, M. Borgerhoff Mulder, and
P. Turkey, eds., Human reproductive behavior: A
Darwinian perspective. Cambridge University
Press, Cambridge.
ALTHAUS, F. 1997. Comorian women delay marriage
and first birth, but fertility remains high. International Family Planning Perspectives 23:187-189.
ANDERSON, J. L., AND C. B. CRAWFORD. 1993. TriversWillard rules for sex allocation: When do they
maximize expected grandchildren in humans?
Human Nature 4:137-174.
ANDERSSON, M. B. 1994. Sexual selection. Princeton
University Press, New Jersey.
ARMITAGE, K. B. 1987. Do female yellow-bellied marmots adjust the sex ratios of their offspring?
American Naturalist 129:501-519.
BEISE, J., AND E. VOLAND. 2002. Differential infant
mortality viewed from an evolutionary biological
perspective. The History of the Family 7:515-526.
BORGERHOFF MULDER, M. 1998. Brothers and sisters:
How sibling interactions affect optimal parental
allocations. Human Nature 9:119-161.
BREWIS, A. A. 1993. Sex ratios at birth in a Micronesian
Atoll population. Social Biology 40:207-213.
BROWN, G. R. 2001. Sex-biased investment in nonhuman primates: Can Trivers and Willard’s theory
be tested? Animal Behaviour 61:683-694.
BROWN, G. R., AND J. B. SILK. 2002. Reconsidering the
null hypothesis: Is maternal rank associated with
birth sex ratios in primate groups? Proceedings of
the National Academy of Sciences 99:1125211255.
CALEY, M. J., AND T. D. NUDDS. 1987. Sex ratio
adjustment in Odocoileus: Does local resource
competition play a role. American Naturalist
129:452-457.
CAMERON, E. Z. 2004. Facultative adjustment of mammalian sex ratios in support of the Trivers-Willard
hypothesis: Evidence for a mechanism. Proceedings of The Royal Society of London. Series B.
Biological Sciences 271:1723-1728.
CARRANZA, J. 2002. What did Trivers and Willard
really predict? Animal Behaviour 63:F1-F3.
SONS OR DAUGHTERS ! GUGGENHEIM ET AL.
CARTWRIGHT, J. 2000. Evolution and human behavior: Darwinian perspectives on human nature.
MIT Press, MA.
CHACON-PUIGNAU, G. C., AND K. JAFFE. 1996. Sex
ratio at birth deviations in modern Venezuela:
The Trivers-Willard effect. Social Biology
43:257-270.
CIOFI, C., AND I. R. SWINGLAND. 1997. Environmental
sex determination in reptiles. Applied Animal
Behaviour Science 51:251-265.
CLUTTON-BROCK, T. H., AND G. R. IASON. 1986. Sex
ratio variation in mammals. The Quarterly Review of Biology 61:339-374.
CRONIN, H. 1991. The ant and the peacock: altruism
and sexual selection from Darwin to today. Press
Syndicate of the University of Cambridge, New
York
COCKBURN, A. 1994. Adaptive sex allocation by brood
reduction in antechinuses. Behavioral Ecology
and Sociobiology 35:53-62.
COCKBURN, A., S. LEGGE., AND M. C. DOUBLE. 2002.
Sex ratios in birds and mammals: Can the hypotheses be disentangled? Pp. 266-286 in I. C.
W. Hardy, ed., Sex ratios: Concepts and research methods. Cambridge University Press,
Cambridge.
DALY, M., AND M. WILSON. 1984. A sociobiological
analysis of human infanticide. Pp. 487-502 in G.
Hausfater and S. B. Hrdy. eds., Infanticide: Comparative and evolutionary perspectives. Aldine
Pub. Co., New York.
DICKEMANN, M. 1979. Female infanticide, reproductive strategies, and social stratification: A preliminary model. Pp. 321-367 in N. A. Chagnon
and W. Irons eds., Evolutionary biology and
human social behavior: An anthropological perspective. Duxbury Press, MA.
DODOO, F. N. 1998. Marriage type and reproductive
decisions. A comparative study in Sub-Saharan
Africa. Journal of Marriage and the Family
60:232-242.
ELLISON, P. T. 2001. On fertile ground. Harvard
University Press, Cambridge, MA.
FERNÁNDEZ-LLARIO, P., J. CARRANZA, AND P. MATEOSQUESADA. 1999. Sex allocation in a polygynous
mammal with large litters: The wild boar. Animal Behaviour 58:1079-1084.
FIGUEREDO, A. J. AND R. GORSUCH. 2007. Assortative
mating in the jewel wasp: 2. Sequential canonical analysis as an exploratory form of path analysis. Journal of the Arizona Nevada Academy of
Sciences 39(2):59-64.
FREEMAN, D., B. WACHOCKI, D. GOLDSCHLAGE, AND H.
MICHAELS. 1994. Seed size and sex ratio in spinach: Application of the Trivers-Willard hypothesis to plants. Ecoscience 1:54-63.
SONS OR DAUGHTERS ! GUGGENHEIM ET AL.
FREESE, J., AND B. POWELL. 1999. Sociobiology, status,
and parental investment in sons and daughters:
Testing the Trivers-Willard hypothesis. The
American Journal of Sociology 104:1704-1743.
GORSUCH, R. L., AND A. J. FIGUEREDO. 1991. Sequential
canonical analysis as an exploratory form of path
analysis. Paper presented to the American Evaluation Association Conference, Chicago, IL.
GOSDEN, R. G., R. I. M. DUNBAR, D. HAIG, E. HEYER, R.
MACE, M. MILINSKI, B. PICHON, H. RICHNER, B. I.
STRASSMANN, D. THALER, C. WEDEKIND, AND S. C.
STEARNS. 1999. Evolutionary interpretations of
the diversity of reproductive health and disease.
Pp. 108-120 in S. C. Stearns ed., Evolution in
health and disease. Oxford University Press,
Oxford.
GOSLING, L. M. 1986. Selective abortion of entire
litters in the coypu: Adaptive control of offspring
production in relation to quality and sex. American Naturalist 127:722-795.
GRANT, V. J. 1998. Maternal personality, evolution,
and the sex ratio: Do mothers control the sex of
the infant? Routledge, London.
GUGGENHEIM, C. B. 2005. Lost daughters and fragile
sons: Patterns of differential parental investment
across thirty-five countries. Dissertation. University of Arizona, Tucson.
HÄDER, S., AND S. GABLER. 2003. Sampling and estimation. Pp. 117-134 in J. Harkness, F. J. R. van de
Vijver, and P. P. Mohler eds., Cross-cultural
survey methods. Wiley & Sons, New Jersey.
HAIG, D. 1999. Genetic conflicts of pregnancy and
childhood. Pp. 77-90 in S. C. Stearns ed., Evolution in health and disease. Oxford University
Press, Oxford.
HAMILTON, W. D. 1967. Extraordinary sex ratios. Science 156:477-488.
HARDY, I. C. W. 1997. Possible factors influencing
vertebrate sex ratios: An introductory overview.
Applied Animal Behaviour Science 51:217-241.
HARDY, I. C. W. 2002. Sex ratios: Concepts and
research methods. Cambridge University Press,
Cambridge.
HEWISON, A. J. M., AND J. GAILLARD. 1999. Successful
sons or advantaged daughters? The TriversWillard model and sex-biased maternal investment in ungulates. Trends in Ecology and
Evolution 14:229-234.
HOFER, H., AND M. L. EAST. 1997. Skewed offspring
sex ratios and sex composition of twin litters in
Serengeti spotted hyaenas (Crocuta crocuta) are a
consequence of siblicide. Applied Animal
Behaviour Science 51:307-316.
HRDY, S. B. 1987. Sex-biased parental investment
among primates and other mammals: A critical
evaluation of the Trivers-Willard hypothesis. Pp.
97-147 in R. J. Gelles and J. B. Lancaster eds.,
86
Child abuse and neglect: Biosocial dimensions.
Aldine De Gruyter, New York.
HRDY, S. B. 1988. Levels of complexity in the study
of "adaptive" sex ratios. Pp. 147-163 in G.
Greenberg and E. Tobach eds., Evolution of
social behavior and integrative levels. The T. C.
Schneirla conference series, vol. 3. Lawrence
Erlbaum Associates, Inc., New Jersey.
HRDY, S. B. 1999. Mother nature: A history of mothers, infants, and natural selection. Pantheon
Books, New York.
HUCK, U. W., J. B. LABOV, AND R. D. LISK . 1986. Food
restricting young hamsters (Mesocricetus
auratus) affects sex ratio and growth of subsequent offspring. Biology of Reproduction 35:592598.
JAMES, W. H. 1987. The human sex ratio. A review of
the literature. Human Biology 59:721-752.
JEFFERY, R., P. JEFFERY, AND A. LYON. 1984. Female
infanticide and amniocentesis. Social Science
and Medicine 19:1207-1212.
KELLER, M. C., R. M. NESSE, AND S. HOFFERTH. 2001.
The Trivers-Willard hypothesis of parental investment: No effect in the contemporary United
States. Evolution and Human Behavior 22:343360.
KERR, M. G. 1971. Prenatal mortality and genetic
wastage in man. Journal of Biosocial Sciences
3:223-237.
KOJOLA, I. 1997. Social status and physical condition
of mother and sex ratio of offspring in cervids.
Applied Animal Behaviour Science 51:267-274.
KRACKOW, S. 1993. The effect of weaning weight on
offspring fitness in wild house mice (Mus musculus domesticus): A preliminary study. Ethology 95:76-82.
KUCERA, T. E. 1991. Adaptive variation in sex ratios
of offspring of nutritionally stressed mule deer.
Journal of Mammalogy 72:745-749.
LARME, A. 1997. Health care allocation and selective
neglect in rural Peru. Social Science and Medicine 44:1711-1723.
LAZARUS, J. 2002. Human sex ratios: Adaptations and
mechanisms, problems and prospects. Pp. 287311 in I. C. W. Hardy ed., Sex ratios: Concepts
and research methods. Cambridge University
Press, Cambridge.
MADZINGIRA, N. 1993. Malnutrition in children under
five in Zimbabwe: Effect of socioeconomic factors and disease. Social Biology 40:239-246.
MACRO INTERNATIONAL INC. 1996. Sampling manual.
DHS-III Basic Documentation No. 6. Calverton,
Maryland.
MEALEY, L., AND W. MACKEY. 1990. Variation in
offspring sex ratio in women of differing social
status. Ethology and Sociobiology 11:83-95.
87
MILLER, B. D. 1987. Female infanticide and child neglect in rural North India. Pp. 95-112 in N. ScheperHughes ed., Child survival: Anthropological
perspectives on the treatment and maltreatment of
children. Kluwer Academic Publishers, MA.
ORC MACRO. 2005. Demographic and Health Survey
Interviewer’s Manual. MEASURE DHS Basic
Documentation No. 3. ORC Macro, Calverton,
MD.
PEDERSEN, F. A. 1991. Secular trends in human sex
ratios: Their influence on individual and family
behavior. Human Nature 2:271-291.
ROHDE, P. A., ATZWANGER, K., BUTOVSKAYA, M., LAMPERT,
A., M YSTERUD, I., S ANCHEZ -A NDRES , A., AND
SULLOWAY, F. J. 2003. Perceived parental favoritism, closeness to kin, and the rebel of the family:
The effects of birth order and sex. Evolution and
Human Behavior 24:261-276.
ROSENFELD, C. S., AND R. M. ROBERTS. 2004. Maternal
diet and other factors affecting offspring sex ratio:
A review. Biology of Reproduction 71:1063-1070.
SCHEPER-HUGHES, N., AND C. SARGENT. 1998. Introduction: The cultural politics of childhood. Pp. 1-33 in
N. Scheper-Hughes and C. Sargent eds., Small
wars: The cultural politics of childhood. University of California Press, Berkeley, CA.
SCRIMSHAW, S. C. M. 1984. Infanticide in human populations: Societal and individual concerns. perspectives Pp. 439-462 in G. Hausfater and S. B. Hrdy
eds., Infanticide: Comparative and evolutionary
Aldine Pub. Co., New York.
SHELDON, B. C., AND S. A. WEST. 2004. Maternal dominance, maternal condition, and offspring sex ratio
in ungulate mammals. The American Naturalist
163: 40-54.
SMITH, L. C., U. RAMAKRISHNAN, A. NDIAYE, L. HADDAD,
AND R. MARTORELL. 2003. The importance of
women’s status for child nutrition in developing
countries. IFPRI Research Report #131. International Food Policy Research Institute, Washington, D.C.
TRIVERS, R. L. 1972. Parental investment and sexual
selection. Pp. 1871-1971 in B. G. Campbell ed.,
Sexual selection and the descent of man. Aldine
Pub. Co., Chicago.
TRIVERS, R. L. 2002. Natural Selection and Social
Theory: Selected Papers of Robert L. Trivers.
(Evolution and Cognition Series) Oxford University Press, Oxford.
TRIVERS, R. L., AND D. E. WILLARD. 1973. Natural
selection of parental ability to vary the sex ratio of
offspring. Science 179:90-91.
VAN HOOFF, J. A. R. A. M. 1997. The socio-ecology of
sex ratio variation in primates: Evolutionary deduction and empirical evidence. Applied Animal
Behaviour Science 51:293-306.
SONS OR DAUGHTERS ! GUGGENHEIM ET AL.
WEBSTER, G. D. 2004. Risk, reproductive variance, and
the Trivers-Willard effect: A non-linear approach.
Paper presented at the annual conference of the
Human Behavior and Evolution Society, Berlin,
Germany.
WILEY, D. N., AND P. J. CLAPHAM. 1993. Does maternal
condition affect the sex ratio of offspring in humpback whales? Animal Behaviour 46:321-324.
SONS OR DAUGHTERS ! GUGGENHEIM ET AL.
88
Appendi x Tabl e 1. Final sam ple size by region and count ry.
Year of
collection
South Asia
Sub- Saharan Africa
Latin America & Caribbean
N ear East & N orth Africa
Unweighted
Bangladesh
1997
3,730
India
1998
21,839
N epal
1996
3,246
Weighted (by
sample weight)
3,505
21,487
3,292
Total = 28,815
Total = 28,284
Benin
1996
2,037
2,037
Burkina Faso
1993
2,887
1 ,8 3 1
Cameroon
1998
1,411
1,411
Central African Republic
1995
1,967
1,967
Chad
1996
3,789
3,789
Comoros
1996
732
732
Cote d'Ivoire
1994
3,025
3,025
Ghana
1998
1,467
1 ,3 5 6
K enya
1998
2,474
2,386
Madagascar
1997
2,510
2,431
Malawi
1992
2,230
1,953
Mali
1996
3,933
3,619
Mozambique
1997
2,539
2,218
N amibia
1992
1,750
1,543
N iger
1998
3,443
3,762
Senegal
1997
2,615
2,615
Tanzania
1996
3,518
3,820
Togo
1998
3,076
3,249
Uganda
1995
3,290
3,079
Zambia
1996
3,834
3,827
Zimbabwe
1994
1,860
1,997
Total = 54,387
Total = 52,647
Bolivia
1998
3,073
2,874
Brazil
1996
3,034
2,873
Columbia
1995
3,368
3,517
Dominican Republic
1996
2,211
1,890
Guatemala
1995
5,361
5,471
Haiti
1995
1,805
1,805
N icaragua
1998
4,728
5,137
Peru
1996
10,741
9,960
Total = 34,321
Total = 33,527
Egypt
1995
5,049
4,371
Morocco
1992
3,116
3,116
Turkey
1993
2,351
2,426
Total = 10,516
Total = 9,913
SONS OR DAUGHTERS ! GUGGENHEIM ET AL.
89
Appendi x Tabl e 2. List of v ariables used in scales.
Scale name & code
Maternal Socioeconomic Resources (MSR)
Description of variables & coding scheme
*
Whether woman lives in urban or rural residence
rural=0, urban=1
*
Household economic status
destitute=0, poor=1, middle=2, rich=3
* Woman’s years of education
Male partner’s educational value
* measured in years if partner present, otherwise equals 0 if no
male partner present or if male partner has 0 years of education
Maternal Biological Condition (MBC)
*
Woman’s body mass index (BMI)
weight in kg/square of height in meters
*
Whether woman is malnourished, healthy, or overweigth
malnourished=0, healthy=1, overweight=2
*
Duration of current pregnancy (if pregnant)
not pregnant=0, otherwise recorded as months (max=10)
Whether woman is currently breast- feeding
* not breast- feeding=0, currently breast- feeding=1
note: reverse- coded for predicted negative effect
Prenatal Care for Lastborn Child (PCL)
Whether woman received prenatal care & type of care
did not receive prenatal care=0, received prenatal care but not
*
from medically trained person=1, received prenatal care from
medically trained person=2
*
N umber of prenatal care visits
no visits=0, otherwise equals number of visits recorded
Whether delivery was assisted & type of assistance
* delivery was not assisted=0, delivery was assisted=1, delivery
was assisted by medically trained person2
*
Health- seeking for Lastborn Child (HSL)
Whether delivery took place at a medical facility
delivery not at medical facility=0, delivery at medical facility=1
Child’s diarrhea in last 2 weeks
* child had diarrhea and was not treated=0, no diarrhea=1, had
diarrhea and was treated=2
*
Whether child received BCG vaccination
no=0, yes=1
*
Whether child received measles vaccination
no=0, yes=1
*
Total number of DPT vaccinations
ranges from none=0 to max=3
*
Total number of polio vaccinations
ranges from none=0 to max=3
*
Percent of age- recommended vaccinations child received
ranges from 0 to 100
SONS OR DAUGHTERS ! GUGGENHEIM ET AL.
90
A ppendi x Tabl e 3. Sex rat ios by region and count ry.
South Asia
Sub- Saharan Africa
Latin America &
C aribbean
N ear East & N orth
Africa
N ational est. sex
ratio (at birth)
C urrent sample (lastborn child <5 yrs)
Bangladesh
1 . 06
1.04
India
1 . 05
1.12
N epal*
1.05
1.07
Benin
1 . 03
1.02
Burkina Faso
1 . 03
1.03
C ameroon
1 . 03
1.04
C entral African Republic
1 . 03
1.02
C had
1 . 04
0.99
C omoros
1 . 03
1.11
C ote d'Ivoire
1 . 03
1.02
Ghana
1.03
0.99
K enya
1.03
0.99
Madagascar
1.03
1.03
Malawi
1.03
0.99
Mali
1.03
0.95
Mozambique
1.03
0.97
N amibia
1.03
0.99
N iger
1.03
1.09
Senegal
1.03
0.95
Tanzania
1 . 03
1 . 03
Togo
1 . 03
0.99
Uganda
1.03
0.93
Zambia
1 . 03
0.97
Zimbabwe
1 . 03
0.97
Bolivia
1 . 05
1.02
Brazil
1 . 05
1.00
C olumbia
1 . 03
1.05
Dominican Republic
1.05
1.05
Guatemala
1.05
1.02
Haiti
1.05
1.07
N icaragua
1.05
0.98
Peru
1.05
1 . 03
Egypt
1 . 05
1.11
Morocco
1.05
1.11
Turkey
1 . 05
1.14
* Indicates the only country where female infant mortality exceeds male infant mortality (72.27
deaths/ 1,000 live births to 68.95 deaths/1,000 live births). N ote: Information for sex ratios based
on 2003 estimates posted at C IA website for "Worldbook of Facts."
SEXUAL RISK BEHAVIOR t ADAM AND MUTUNGI
91
SEXUAL RISK BEHAVIOR AMONG KENYAN
UNIVERSITY STUDENTS
M ARY B. A DAM , University of Arizona Departments of Pediatrics & Psychology, Tucson, Arizona 85721
M IKE M UTUNGI , I Choose Life – Africa, P.O. Box 5166, Nairobi 00100, Kenya
ABSTRACT
Previous HIV/AIDS assessments in Kenya have focused on the sexual risk behaviors of the general population
and the HIV serostatus of women of childbearing age. No data are available for Kenyan university students. Baseline
surveillance data were obtained from a representative sample of 1917 university students at Moi University in Eldoret,
Kenya. Both qualitative (focus group discussions) and quantitative (self-administered questionnaire) data were
collected. Students were asked about their HIV knowledge, perceptions, and sexual risk behaviors. Seventy-one %
of males and 47.6% of females reported having had sex. Only 49% of university students reported any HIV/AIDS
education. Of those who reported having ever had sex, 76% reported ever using a condom: only 18% of males and
14% of females reported using a condom every time they had sex in the last month. Eighty-nine% of students reported
thinking they were at risk for HIV infection, but only 28% of subjects had been tested for HIV. Qualitative data suggest
the issue of HIV testing remains very controversial among students, in large part because of societal stigma. The results
of this study emphasize the vulnerability of university students to HIV infection. Most university students have not
had access to accurate HIV/AIDS information. Sexual activity on campus is high and many students consider
themselves at risk. Conversely, consistent condom use and rates of voluntary counseling and testing are low.
INTRODUCTION
Sub-Saharan Africa contains only 10% of the
world's population but accounts for more than two
thirds of the worlds HIV infected people (DeCock et al.
2002). Of the more than 25 million people who have
died from AIDS worldwide, more than 14 million are
from Africa. Transmission of HIV in Kenya and all of
sub Saharan Africa is primarily heterosexual (Hayes
and Weiss 2006). Tracking the epidemic has been
important for program and policy development (Mertens
and LowBeer 1996, Hayes and Weiss 2006).
Monitoring HIV prevalence and sexual behaviors
has grown from sentinel surveillance toward representative household surveys (Rugg, et al. 2004, Gouws et
al. 2006, Garcia-Calleja et al. 2006). The World Health
Organization used sentinel surveillance to estimate that
HIV seroprevalence rates for Kenya ranged from 4.7%9.6%, with HIV prevalence in pregnant women in some
cities as high as 30% (World Health Organization
2004). The use of household surveys has made it
possible to more closely link HIV prevalence to specific
risk behaviors. A 2002 Kenyan population based survey of persons age 15-49 from randomly selected
households in Mombasa reports an overall HIV
seroprevalence of 10.8%. Eighty-eight% of men and
83% of women reported they had ever had sex (Hawken
et al. 2002). HIV-1 prevalence in the 20-24 age groups
was 1.2% for men and 15.6% for women. This rises
dramatically in the 25-29 age groups where rates are
8.4% for men and 20.7% for women.
Forty five percent of men and 22% of women who
were sexually experienced reported ever using a con-
dom. Men aged 20-24 reported the highest rate of ever
using a condom at 59%. However, consistent condom
use was substantially lower. Only 0.2% of men ages 1549 reported consistent condom use with their regular
partner. Condom use rates increased to 10% with steady
partners and to 36% with casual partners (Hawken et al.,
2002). Low rates of consistent condom are also found
in many other countries (Hearst and Chen 2004).
These data, while informative for the general population, have limited usefulness in assessing the impact
of interventions for specifically targeted sub-populations (Rehle et al. 2004, Peersman and Rugg 2004).
Data from university students in other countries are
available but may not be sufficiently sensitive to local
contextual and cultural factors to serve as a baseline to
evaluate Kenyan interventions (Terry et al. 2006,
Baggaley et al. 1997, Uwalaka and Matsuo 2002).
Since university students represent the future business,
educational, and government leaders they are an important target group for HIV interventions. The potential to
multiply the impact of an effective intervention in
university students is high because will they will graduate and move into all regions of the country.
In response to the needs for HIV prevention in this
strategically important target group, ICL began providing HIV prevention services for University students in
2003. ICL uses a theoretical framework to develop
interventions with the goal of training students to (1)
have accurate knowledge about HIV/AIDS, (2) implement strategies to personally reduce or eliminate their
risk of HIV/AIDS acquisition, (3) implement interventions among their peers to reduce or eliminate an individuals' risk of HIV/AIDS acquisition, (4) strategically
influence organizations and institutions to implement
ADAM, M. B., AND M. MUTUNGI. 2007. SEXUAL RISK BEHAVIOR AMONG KENYAN UNIVERSITY
STUDENTS. JOURNAL OF THE ARIZONA-NEVADA ACADEMY OF SCIENCE 39(2)91-98
SEXUAL RISK BEHAVIOR t ADAM AND MUTUNGI
policies which reinforce sexual health and reduce sexual
violence, and (5) organize events to promote voluntary
testing and counseling and testing (VCT). The objectives of the quantitative and qualitative survey were to
(1) examine current levels of knowledge of HIV/AIDS,
(2) assess the perceptions of university students toward
HIV testing and people living with HIV/AIDS, (3)
assess risky sexual behaviors among university students, (4) examine experience with HIV/AIDS education especially as it relates to VCT, and (5) identify
strategic areas for future intervention.
92
Tabl e 1. Dem ographics.
Sample Characteristics
Gender
Age
Religious Affiliation
METHODS
Procedure
Moi University students from the Main and
Chepkoilel campuses, in Eldoret, completed a confidential, anonymous questionnaire regarding their HIV
knowledge, perceptions and sexual risk behaviors. The
questionnaire is included in the Appendix. Almost all
students live in dormitories. Since dormitories are
grouped by gender, year, and field of study, recruitment
procedures utilized this grouping. Dormitories were
selected to ensure a representative sample. Trained
study personnel collected data over a 5-day period.
Data were recorded on standard forms and then entered
into SPSS 11.5. Data were entered using the SPSS 11.5.
Trained facilitators conducted eight focus groups
that were either all male or all female and were composed of seven to ten persons. Participants were identified by number rather than by name to ensure
confidentiality. Sessions were conducted in English,
lasted 1.5 hours, and were recorded. Transcriptions of
the sessions were reviewed and primary themes were
extracted and paired with illustrative quotations from
students.
The study was approved by the Moi University
research council. Participation was voluntary. Informed
consent was obtained by all participants. The survey
was publicly supported by the University administration. Since there was strong administration support
almost none of those students invited to participate
refused (estimated fewer than 5%).
Major
N
Males
945
49.3
Females
972
50.7
Less than 20
22 2
11.6
21 - 22
994
51.9
23- 24
552
28 . 8
O ver 25
149
7. 8
Christian
1 8 28
94.2
Muslim
75
5.2
O ther
14
0. 4
Science & Engineering
693
36.2
Education
511
26.7
Arts
331
17.3
Medicine
210
11.0
78
4.1
Law
O ther
Year of Study
93
4.7
First year
586
30.6
Second year
456
23 . 8
Third year
46 9
24.5
Fourth year
400
20.9
6
0.2
1752
91.4
Fifth year
Marital Status
Single
Married
92
4. 8
O ther
73
3.8
years. Males reported earlier sexual debut than females;
with 26% of males and 10% of females reporting sexual
debut in secondary school (Figure 1).
Thirty-seven percent of first year females reported
previous sexual experience compared with 63% of
males. For both genders rates of sexual activity varied
with their year at the university and the highest rate of
RESULTS
Quantitative Results
Of the 1,917 students surveyed, 51% were female,
52% were age 21-22, and 31% were first year students.
Almost all (94%) reported Christian religious affiliation, and 91% reported being single (Table 1).
Seventy one percent of males and 47.6% of females reported having had sexual intercourse. Thirty
percent of students reported their first sexual experience during secondary school, 26% during the years
between secondary school and joining the university
(typically two years), and 19% during their university
%
Figure 1. Gender and age at first sex.
SEXUAL RISK BEHAVIOR t ADAM AND MUTUNGI
93
sexual activity in the last 12 months was for second year
students (Table 2).
Of those who reported having ever had sex, 76%
reported ever using a condom. Eighteen percent of
males and 14% of females reported using a condom
every time they had sex in the last month, indicating that
consistent condom use is uncommon (Figure 2).
Tabl e 2. Sex ual act iv it y in t he last 12 mont hs
by gender and year of st udy.
Year
Male (%)
Female (%)
First
62.9
37.3
Second
67.2
51 . 0
Third
78 . 6
55.2
Forth
75.3
5 2. 4
HIV/AIDS Knowledge
Fewer than half of University students reported
previous HIV/AIDS training. Of those who had received previous training, 77.1% reported training on
HIV transmission, 69.1% reported training on effects of
HIV, but only 28.8% reported receiving information on
treatment of HIV. Many students reported significant
knowledge of someone who had died of AIDS. Over
half knew of a neighbor who had died of AIDS and
nearly one third had a family member who had died of
AIDS.
HIV Risk Perception and Testing
Eighty-nine percent of students reported personally believing they were at risk for HIV infection; with
38% thinking their risk for HIV infection was high or
moderate. Students were asked, "Which of the follow-
ing are ways that you personally feel you could become
infected with HIV?" More than three-fourths reported
they could become infected through intercourse without a condom, unsterilized instruments in dental work,
and sharing of needles in piercing or tattooing. Nearly
two thirds believed they were at risk from deep mouth
kissing and about one third from sexual intercourse
with a condom (Table 3). Behaviors that put one at risk
for HV include sexual intercourse with an infected
person, contact with HIV contaminated medical or
dental instruments due to improper sterilization technique, and IV drug use with HIV contaminated needles.
Tabl e 3 . Personal belief s on suscept ibilit y t o
inf ect ion.
Ways you feel you might become infected with HIV
%
Sexual intercourse without a condom
88
Unsterilized instruments in dental work
86
Sharing of needles in piercing or tattooing
76
Deep mouth kissing
59
Sexual intercourse with a condom
37
Sharing a meal with an infected person
6
Only 28.2% of subjects reported having been tested
for HIV. Of those who had been tested, 67.7% had been
tested at VCT centers, 21.1% at hospitals, 8.4% in
private clinics, and 2.9% used self-test kits. Of those
tested, 2.6% did not return for their results. Having been
tested for HIV correlated somewhat with knowing
someone who died of AIDS (r=0.24, p<.001). Having
been tested for HIV correlated more highly (r= 0.45,
p<.001) with recommending abstinence to a friend than
with recommending condom use to a friend (r=0.18,
p<.001).
Qualitative Results
Figure 2. Frequency of condom use in last month
The qualitative data provide information about the
context in which sexual decision-making occurs. Specific quotes from students are used to highlight prominent themes, such as condom use, relational context of
sexual decisions, and concerns about HIV testing. Campus relationships are viewed as transient as opposed to
marriage relationships. That observation has important
implications for sexual decision-making. Also, new
students were viewed as being naive.
"And then the nature of relationships in campus
most of them are not very long lasting. So one person
may enter into one thinking it may last long then after
some time or after having sex he/she is ditched." (3rd
SEXUAL RISK BEHAVIOR t ADAM AND MUTUNGI
year male)
Coercion in the form of gift giving was a fairly
prominent theme in the focus groups discussions.
If you accept what they are giving you, like she
said, something expensive, then straight away, there's
something hidden, so you don't want to feel indebted to
someone (cough), so people shy from debts, you know
according to how they do something for you, you have
to pay back for what they have done for you, so that's the
mistake they (women) make. (2nd year female)
Students also reported concern about sexual relationships between female students and male faculty,
with the exchange of sex for grades, or struggles to
avoid certain faculty who have a reputation for putting
female students in difficult situations.
It is an occurrence we have even in some courses.
Some ladies have had to dodge some lecturers. You
know if there is an option you can do without some other
lecturers. We have heard some ladies moving away,
drifting drastically from some particular lecturers who
are known that after spotting a female student, she must
first of all see him in the office then she can proceed with
the course. (4th year male)
In addition, several groups reported hearing rumors of female students who were supporting themselves at the University through prostitution.
According to focus group participants, abstinence
was viewed as a difficult choice, but was not perceived
as "Western." Choosing abstinence was often viewed as
a religious choice. In contrast, being sexually active
was viewed as a type of fashion statement, the "in" thing
to do.
"Not only students but everybody should be told
that premarital sex is not in order. I have seen it. When
addressing the issue of abstinence, it's presented as if it
is because of AIDS. But I think people should be told
that it is not in order. It is not because of AIDS that we
are told to abstain. It is in order for you to avoid
premarital sex." (2nd year male)
Condom use is not a straightforward issue among
university students. The question students had was, "if
all condoms are effective, how come some are cheaper
than others?" This led to mistrust of some of the condoms
on the market. A 1st year male student touched on this
issue saying:
But usually condoms are perceived as not usually
reliable because when you find the same condom brand
like Trust, there are different names and texture and you
wonder if I use the one for 10 cents is it really reliable
like that of 30 cents or if I use the one that I was given
free of charge at the clinic (laughter) will I really be sure
that I am protected or will it work? (1st year male)
It also appeared that condoms from dispensers
were less respected than those purchased from shops or
pharmacists. One female student commented:
A campus chick wouldn't hold a guy in high esteem
if he uses those from the dispensers, (laughter), "he just
94
passes by one dispensary and picks — (laughter), it's
true." (3rd year female)
The issue of HIV testing remains very controversial among students in large part because of tremendous
societal stigma. Students comment on feeling significant fear and helplessness related to HIV diagnosis.
Females voice fear related to a woman's unique vulnerability. Students also express fear about lack of confidentiality for testing provided on campus.
"Some people don't want to know their status. If
people get to know that they are HIV positive, they have
a negative attitude towards life, they think they are
going to die the next minute. So they give up, that's why
they don't like going to know their status that is if they
doubt themselves." (1st year female)
DISCUSSION
These data provide interesting insight into Kenyan
university students' knowledge, perceptions, and behavior. Fewer than half of university students reported
any previous HIV/AIDS education. Most students (89%)
reported thinking they were at risk for HIV infection,
but less than a third of subjects (28%) reported having
been tested for HIV. Qualitative data suggest the issue
of HIV testing remains very controversial among students in large part because of tremendous societal
stigma.
Clearly, even for the most elite students-those
admitted to university, prior HIV training had some
significant gaps. Only half had previous HIV training.
This demonstrates the need for HIV prevention programs at the university level as well as much earlier.
Interventions with demonstrated effectiveness aimed at
students in the university as well as effective interventions for primary and secondary school students would
be of benefit (Wegbreit et al. 2006). More interventions
for the secondary school age group have been carefully
evaluated (Eke et al. 2002, Erulkar et al 2004, Jewkes et
al. 2006, Klepp and Lugoe 1999, Magmami et al. 2005,
Obasi et al. 2006, Stanton et al. 1998).
This study highlights the importance of population specific information on risk behavior. Kenyan
university students report less risk behavior than the
general population. Women at the university are more
likely to delay sex. Sixty-three percent of Kenyan first
year female students report no sexual experience as
compared to 52% of women ages 15-19 in the population based survey (Hawken et al. 2002). The qualitative results indicate that first year females represent a
vulnerable population in regard to coercion and harassment. This identifies a strategic opportunity to
structure interventions aimed at reducing coercion
and encouraging risk avoidance and reduction in this
group.
Kenyan and American university students also
report important similarities and differences that em-
95
phasize the value of population specific information.
Kenyan university students differ from US students in
a variety of ways that may influence how an intervention works. Kenyan university students are older on
average than students entering college in the US because there is a two-year wait after graduation from
secondary school before the admitted class is allowed to
matriculate. Kenyans uniformly graduate in four years,
and are much more likely to delay sexual debut. Kenyan
and American university students report having been
tested for HIV at about the same rate (28%), but Kenyans
are at far greater risk of acquiring HIV due to the
incidence and prevalence of the infection (American
College Health Association 2006, World Health Organization 2004).
Over three fourths of sexually active students reported some experience with using a condom. Experience with condoms is substantially higher among
university students (76%) than the highest subset of the
national sample, men age 20-24 (56%). However, consistent condom use was still rare. The trustworthiness of
condoms was a concern identified in the qualitative data
and 37.4% of students reported that having sexual
intercourse with a condom could infect them. It is
possible this perception is related to an accurate understanding of risk reduction methods like condoms and
the role of cumulative risk for HIV infection. Correct
and consistent condom use has been shown to reduce
risk of HIV transmission in the range of 85% in
serodiscordant couples (Davis and Weller 1999, NIAID
2001). It is also possible that this reflects a perception
that only expensive name brand condoms reduce risk.
The perception that the quality of the condom was
related to price and place of purchase showed lack of
knowledge about condoms and their ability to reduce
risk for transmission. The ability of latex condoms to
form a mechanical barrier does not depend on price or
place of purchase. Improved knowledge of the reliability of condoms is easy to address with accurate information and is a necessary first step toward behavioral
change.
Over 89% of students thought they were at risk for
HIV but relatively few had received VCT. Hopelessness was identified as a barrier to VCT. Interventions
that promote hope by providing information about
antiretroviral treatment and how to access treatment
resources could be effective in increasing VCT. Studies
have demonstrated that effective antiretroviral treatment can be delivered in resource poor settings
(Kumarasary et al. 2005).
CONCLUSION
University students represent a strategic and vulnerable population, and there is a large need for effective HIV prevention interventions among university
students. In our sample, over half of the students report
SEXUAL RISK BEHAVIOR t ADAM AND MUTUNGI
no training on HIV/AIDS prior to entering the university. Most had never been tested for HIV despite knowing they might have been exposed. The data indicate
students have high levels of personal knowledge of
someone who has HIV/AIDS and this knowledge could
be harnessed and incorporated to promote changing
risk perceptions and behavior (Macintyre et al. 2001).
Programs that emphasize modes of HIV transmission
should help to alleviate fear of casual contact, dispel
irrational myths, and reduce societal stigma. We recommend that interventions for university students need to
include the full spectrum of HIV prevention including
risk avoidance (i.e. delay of sexual debut and faithfulness to one lifelong sexual partner), risk reduction (i.e.
partner reduction and correct and consistent use of
condoms), voluntary counseling and testing. Information about the care and support of HIV infected individuals may also help to encourage VCT by alleviating
the hopelessness associated with HIV infection.
ACKNOWLEDGEMENTS
We wish to thank the entire I Choose Life-Africa
team with special appreciation to Jackie Njagah and
Juma Warria, and Dr. Amuyunzu-Nyamonogo, the Moi
University administration and research team including
Professors Mathentge, Kibos, Drs. Kiplagat, Mrs. Sewe
and the postgraduate research assistants.
LITERATURE CITED
AMERICAN COLLEGE HEALTH ASSOCIATION. 2006. American College Health Association-National College
Health Assessment (ACHA-NCHA) Spring 2004
Reference Group data report (abridged). Journal
of American College Health 54:201-211.
BAGGALEY, R., F. DROBNIEWSKI, A. POZNIAK, D. CHIPANTA,
M. TEMBO, AND P. GODFREY-FAUSSETT. 1997. Knowledge and attitudes to HIV and AIDS and sexual
practices among university students in Lusaka, Zambia and London, England: are they so different?
Journal of the Royal Society of Health 117:88-94.
DAVIS, K. R., AND S. C. WELLER. 1999. The effectiveness
of condoms in reducing heterosexual transmission
of HIV. Family Planning Perspectives 31:272-297.
DE COCK, K. M., D. MBORI-NGACHA, AND E. MARUM.
2002. Shadow on the continent: public health and
HIV/AIDS in Africa in the 21st century. Lancet
360:67-72.
EKE, A., G. PEERSMAN, S. SEMAAN, K. HYLTON, N. KIITI,
AND M. D. SWEAT. 2002. Acquisition and review of
non-U.S.-based HIV risk reduction intervention
studies. Journal of Acquired Immune Deficiency
Syndromes 30 Suppl 1, S51-55.
ERULKAR, A. S., L. IL ETTYANG, C. ONOKA, F. K. NYAGAH,
AND A. MUYONGA. 2004. Behavior change evaluation of a culturally consistent reproductive health
SEXUAL RISK BEHAVIOR t ADAM AND MUTUNGI
program for young Kenyans. International Family
Planning Perspectives 30:58-67.
GARCIA-CALLEJA J. M., E. GOUWS, AND P. D. GHYS. 2006.
National population based HIV prevalence surveys in
sub-Saharan Africa: results and implications for HIV
and AIDS estimates. Sexually Transmitted Infections
82:iii64-70.
GOUWS, E., P. J. WHITE, J. STOVER, AND T. BROWN. 2006.
Short term estimates of adult HIV incidence by mode
of transmission: Kenya and Thailand as examples.
Sexually Transmitted Infections 82:iii51-55iii.
HAWKEN, M. P., R. D. J. MELIS, D. T. NGOMOBO, K. N.
MANDALIYA, L. W. NG'ANG'A, J. PRICE, G. DALLABETTA,
AND M. TEMMERMAN. 2002. Opportunity for prevention of HIV and sexually transmitted infections in
Kenyan youth: Results of a population based survey.
Journal of Acquired Immune Deficiency Syndromes
31:529-535.
HAYES, R., AND H. WEISS. 2006. Understanding HIV
epidemic trends in Africa. Science 311:620-621.
HEARST, N. AND S. CHEN. 2004. Condom Promotion for
AIDS Prevention in the developing world: Is it working? Studies in Family Planning 35:39-47.
JEWEKS, R., M. NDUNA, J. LEVIN, N. JAMA, K. DUNKLE, N.
KHUZWAYO M. KOSS, A. PUREN, K. WOOD, AND N.
DUVVURY. 2006. A Cluster Randomized-Controlled
Trial To Determine The Effectiveness Of Stepping
Stones In Preventing Hiv Infections And Promoting
Safer Sexual Behaviour Amongst Youth In The Rural
Eastern Cape, South Africa: Trial Design, Methods
And Baseline Findings. Tropical Medicine & International Health 11:3-116.
KLEPP, K. I. AND W. L. LUGOE. 1999. AIDS education in
Tanzania: focus on school children. AIDS Reader 9:
270-272.
KUMARASARY, N., S. SOLOMON, S. K. CHAGUTURU, A. J.
CECELIA, S. VALLABHANENI, T. P. FLANIGAN, AND K. H.
MAYER. 2005. The changing natural history of HIV
disease: Before and after the introduction of generic
antiretroviral therapy in southern India. Clinical
Infectious Disease 41:1525-1528.
MACINTYRE, K., L. BROWN AND S. SOSLER. 2001. It’s not
what you know, but "who you knew": examining the
relationship between behavior change and AIDS
mortality in Africa. AIDS Education & Prevention
13:160-174.
MAGMAMI, R., K. MACINTYRE, A. M. KARIM, L. BROWN, P.
HUTCHINSON, C. KAUFMAN, N. RUTENBURG, K. HALLMAN,
J. MAY, AND A. DALLIMORE. 2005. The impact of life
skills education on adolescent sexual risk behaviors
in KwaZulu-Natal, South Africa. Journal of Adolescent Health 36:289-304.
NATIONAL INSTITUTE OF ALLERGY AND INFECTIOUS DISEASES (NIAID). 2001. Workshop Summary: Scientific Evidence on Condom Effectiveness for Sexually
transmitted Disease (STD) Prevention.
96
http://www.niaid.nih.gov/dmid/stds/
condomreport.pdf Accessed 10 Nov 2006
OBASI, A. I., B. CLEOPHAS, D. A. ROSS, K. L. CHIMA, G.
MMASSY, A. GAVYOLE M. L. PLUMMER, M. MAKOKHA,
B. MUJAYA, J. TODD, D. WIGHT, H. GROSSKURTH, D.
C. MABEY, AND R. J. HAYES. 2006. Rationale and
design of the MEMA kwa Vijana adolescent sexual
and reproductive health intervention in Mwanza
Region, Tanzania. AIDS Care 18:311-322.
PEERSMAN, G., AND D. RUGG. 2004. Intervention research
and program evaluation: the need to move beyond
monitoring. New Directions for Evaluation 103:141158.
REHLE, R., T. SAIDEL, S. MILLS, AND R. MAGNANI. 2004.
Evaluating programs for HIV/AIDS prevention and
care in developing countries. Family Health International. http://search.fhi.org Accessed 10 Nov
2006.
RUGG D., M. CARAEL, J. T. BOERMA, AND J. NOVAK. 2004.
Global advances in monitoring and evaluation of
HIV/AIDS: From AIDS case reporting to program
improvement. New Directions for Evaluation
103:33-48.
RUTENBURG, N., K. HALLMAN, J. MAY, A. AND DALLIMORE.
MERTENS, T. E. AND D. LOW-BEER. 1996. HIV and
AIDS: Where is the epidemic going? Bulletin of the
World Health Organization 74:121-129.
STANTON B. F., X. LI, J. KAHIHUATA, A. M. FITZGERALD,
S. NEUMBO, G.. KANDUUOMBE, I. B. RICARDO, J. S.
GALBRAITH, N. TERRERI, I. GUEVARA, H. SHIPENA, J.
STRIJDOM, R. CLEMENS, AND R. F. ZIMBA. 1998.
Increased protected sex and abstinence among
Namibian youth following a HIV risk-reduction
intervention: a randomized, longitudinal study.
AIDS. 12:2473-2480.
TERRY, P. E., M. MHIOYI, T. MASVAURE, AND S. ADLISS.
2006. An examination of knowledge, attitudes and
practices related to HIV/AIDS prevention in Zimbabwean university students: Comparing intervention
program participants and non-participants. International Journal of Infectious Diseases 10:38-46.
WEGBREIT, J., S. BERTOZZI, L. M. DEMARIA, AND N. S.
PADIAN. 2006. Effectiveness of HIV prevention in
resource-poor settings: tailoring the intervention to
the context. AIDS 20:1217-1235.
WORLD HEALTH ORGANIZATION. 2004. WHO report 2004:
Changing history. http://whqlibdoc.who.int/icd/hq/
2004/9241592281_Kenya.pdf Accessed 10 Nov
2006.
UWALAKA, E. AND MATSUO H. 2002. Impact of knowledge, attitude and beliefs about AIDS among college students in Nigeria: The case of the University
of Nigeria, Nsukka. West Africa Review 3(2): http:/
/www.westafricareview.com Accessed 10 Nov
2006.
97
SEXUAL RISK BEHAVIOR t ADAM AND MUTUNGI
APPENDIX.
Moi University Survey.
1. Age:
a. below 20 b. 21-22 c. 23-24 d. 25 and over
2. Gender:
a. male b. female
3. Religion:
a. Christian b. Muslim c. Buddhist d. Hindu e. Other
4. Year of Study: a. First b. Second c. Third d. Fourth e. Fifth
5. Field of Study: a. Art b. Education c. Medicine d. Law e. Engineering f. BcS g. Information Science h. Other
6. Marital Status: a. Single b. Married c. Divorced d. Separated e. Cohabitating f. Widowed g. Other
7a. Have you ever attended any training on HIV/AIDS? a. Yes b. No
7b. If yes above, what was the subject or subjects of the training. Circle all that apply.
a. Effects of HIV/AIDS b. HIV/AIDS transmission c. Treatment of AIDS d. Counseling e. Home Based Care for HIV/AIDS
f. other (specify) __________
8. Do you think you are at risk of contracting HIV/AIDS? a. Yes b. No
9. How would you rank your chances of getting HIV infected? a. high b. moderate c. low d. no risk at all
WHICH OF THE FOLLOWING ARE WAYS THAT YOU PERSONALLY FEEL YOU COULD BECOME INFECTED WITH HIV?
10. Sharing of needles in ear piercing and tattooing a. I could become infected b. I could not become infected
11. Un-sterilized instruments used in dental work a. I could become infected b. I could not become infected
12. Deep mouth kissing a. I could become infected b. I could not become infected
13. Sexual intercourse without a condom a. I could become infected b. I could not become infected
14. Sharing a meal with an infected person a. I could become infected b. I could not become infected
15. Sexual intercourse with a condom a. I could become infected b. I could not become infected
TO WHAT EXTENT DO YOU AGREE WITH THE FOLLOWING STATEMENTS?
16. Without sex a relationship has no meaning.
a. totally disagree b. partially agree/disagree c. totally agree
17. I can tell my partner, “No sex without a condom”.
a. totally disagree b. partially agree/disagree c. totally agree
18. If all my friends and age mates are having sex, I will too.
a. totally disagree b. partially agree/disagree c. totally agree
19. Using a condom shows that I care about my partner and myself.
a. totally disagree b. partially agree/disagree c. totally agree
20. I cannot make right decisions about sex under the influence of alcohol/drugs.
a. totally disagree b. partially agree/disagree c. totally agree
21. Condoms can’t be trusted to protect you against HIV.
a. totally disagree b. partially agree/disagree c. totally agree
22. Sex should wait until marriage.
a. totally disagree b. partially agree/disagree c. totally agree
23. If I suggest we use a condom my partner will suspect I have HIV.
a. totally disagree b. partially agree/disagree c. totally agree
24. If a man gives a woman a gift he expects her to have sex with him in return.
a. totally disagree b. partially agree/disagree c. totally agree
25. If a woman takes a gift from a man she should have sex with him in return.
a. totally disagree b. partially agree/disagree c. totally agree
26. Have you ever had intercourse? a. Yes b. No IF NO PROCEED TO QUESTION 36.
27. Have you had sex in the last 12 months a. Yes b. No
28. At what level did you have sex for the first time? a. Primary b. Secondary c. Post secondary d. College/university
29. Why did you have sex for the first time? a. Forced b. Wanted to c. Cheated d.Other (specify)__________
30. How many sexual partners have you had in the last 12 months? a. None b. One c. Two d. Three e. Four or more
31. Have you had sex in the last one month? a. Yes b. No
32. Do you currently have a sexual partner? a. Yes b. No
33. What type of sexual relationship do you have? a. Dating/Regular b. Casual c. Cohabitating d. Married e. Engaged f. Other
34. Have you ever used a condom when you had sex? a. Yes b. No
35. In the past one month how often did you use a condom when having sex?
a. Every time b. Most of the time c. Some of the time d. Not at all
SEXUAL RISK BEHAVIOR t ADAM AND MUTUNGI
APPENDIX (CONTINUED)
36. Would you recommend the use of condoms to your friends? a. Yes b. No
37. Would you recommend to a friend that they abstain from sexual intercourse until marriage? a. Yes b. No
38. Have you ever had a Sexually Transmitted Infection? a. Yes b. No
39. Have you ever been tested for HIV/AIDS? a. Yes b. No IF NO PROCEED TO QUESTION 43.
40. If yes above, when did you undergo the last test? a. 1 month b. 3 months c. 6 months d. 12 months or more
41. Where did you undergo the test? a. Hospital b. VCT Center c. Private Clinic d. Self test kit
42. Did you find out the results of your test?
a. Yes b. No
43. Have you been to a VCT Center
a. Yes b. No
44. Would you be willing to go to a VCT Test at this University
a. Yes b. No
45a. Do you know someone personally who has died of AIDS?
a. Yes b. No
45b. How were you related to this person? a. Family member b. Friend c. University d. Other (specify) _________
TO WHAT EXTENT DO YOU AGREE WITH THE FOLLOWING STATEMENTS?
46. If a student is suffering from AIDS, they should be removed from the halls of residence.
a. totally disagree b. partially agree/disagree c. totally agree
47. If I knew that a shopkeeper or food seller had the HIV virus, I would not buy food from them.
a. totally disagree b. partially agree/disagree c. totally agree
48. A student, who has the HIV virus but is not sick, should be allowed to continue at the University.
a. totally disagree b. partially agree/disagree c. totally agree
49. If a member of my family became infected with the HIV virus, I would want it to remain a secret.
a. totally disagree b. partially agree/disagree c. totally agree
50. A student who is found HIV+ should not be supported by the university ARVs.
a. totally disagree b. partially agree/disagree c. totally agree
98
BAYESIAN ANALYSIS OF SCOLIOSIS ! MENKE ET AL.
99
LIKELIHOOD-EVIDENTIAL SUPPORT AND BAYESIAN ANALYSIS ON
A PROSPECTIVE COHORT OF CHILDREN AND ADOLESCENTS
WITH MILD SCOLIOSIS UNDER CHIROPRACTIC MANAGEMENT
J. MICHAEL MENKE, EVALUATION GROUP FOR ANALYSIS OF DATA, University of Arizona, Tucson, AZ
85721;
GREGORY PLAUGHER, Private practice of chiropractic, Alameda, CA;
CHRISTINA A. CARRARI, Private practice of chiropractic, Italy;
ROGER R. COLEMAN, Adjunct Research Faculty, Life Chiropractic College West, Hayward, CA; and
Private practice of chiropractic, Othello, WA;
LUCA VANNETIELLO, Private Practice Of Medicine And Chiropractic, Nola, Italy; and
TRENT R. BACHMAN, Private practice of chiropractic, Fremont, CA
ABSTRACT
A previous study using frequentist analytic methods on a single cohort showed no difference in forty-one patients
under chiropractic management for mild or early stage scoliosis. The grantor requested a re-analysis. Plain film
radiographs of 41 children and adolescents were re-measured by Risser-Ferguson and Cobb methods. Three
magnitudes and three types of change were constructed to cover various notions of scoliosis change: magnitudes of
1°, 3°, or 5°, and types that alternatively included or omitted no change as a possible successful outcome (arrested
progression). Improvement was assessed from using three filters across three definitions of progression: 1) curve
improved or stable, 2) improved only, and 3) those that either improved or progressed. Data were then analyzed by
evidential support methods and Bayesian analyses at each filter and type of progression to establish whether
improvement was likely attributable to treatment or spine characteristics.
Intra-class correlation for intra-examiner stability was 0.73 by Cobb method. Reliability between the new and the
previous examiner was 0.59 for pre- and 0.69 for post-treatment Cobb angles. Reliability increased dramatically when
end vertebrae were specified. Ratio of number improved to those progressed to was at least 2:1 for all three levels of
filter: 1°, 3°, and 5°. Number of treatments or duration of care were not associated with improvement. However, the
number of vertebral segments below the scoliosis curve apex – a measure of curve compression û and bone age
accounted for 49% of adjusted R2 in Cobb angle changes. Initial Cobb angle as a clinical predictor was not supported.
One treating chiropractor experienced a greater rate of improvement at the highest level of change (5°) in his patients.
Results here could not be attributed to management, but could be from a type of scoliosis resolving spontaneously,
or a subgroup of scoliosis cases that responded to chiropractic management or manipulation.
INTRODUCTION
This paper summarizes a re-analysis of data previously submitted to standard frequentist testing methods
for significant differences between means first published in 2001 (Lantz and Chen 2001). Occasionally,
the sine qua non cardinal of random selection and
random assignment cannot be implemented due to
financial and time constraints. Nonetheless, data is still
a precious, expensive, and time-intensive asset; and in
many cases quasi-experimental methodology can still
reveal response to care within the limitations of measurement and collection. Field trials such as the one
represented here are an enormous undertaking; often
confronting limits in available subjects or the ethical
dilemmas in delaying care for proper statistical control.
Without a true experimental design, analysts can
match statistical tools to study limitations. For example,
null hypothesis significance testing (NHST) is familiar
or seems to be the de facto research standard for clinical
research. But NHST has several severe limitations: 1)
simple significance does not measure the strength of an
effect, 2) NHST test data as a match for a given
hypothesis, rather than testing the hypothesis, and 3) by
distilling data into point estimates considerable information is lost, leading to 4) a compromise of statistical
power and increased likelihood of Type II error. ôOne
size fits allö is as unlikely for data as for shoes. Furthermore, for some questions, clinical trials are only incrementally more informative than cohort or case-control
studies. Or, such quasi-experimental designs may answer some questions and pose others that require a
randomized clinical trial to refine care outcomes. Even
the privileges and responsibilities of exploring new
knowledge domains, and the expense of data acquisition, only the most sensitive instruments for the task at
hand should be used.
MENKE J. M., G. PLAUGHER, C. A. CARRARI. R. R. COLEMAN, L. VANNETIELLO, AND T. R. BACHMAN. LIKELIHOOD-EVIDENTIAL
SUPPORT AND BAYESIAN RE-ANALYSIS ON A PROSPECTIVE COHORT OF CHILDREN AND ADOLESCENTS WITH MILD SCOLIOSIS
UNDER CHIROPRACTIC MANAGEMENT. JOURNAL OF THE ARIZONA-NEVADA ACADEMY OF SCIENCE 39(2):99-111
BAYESIAN ANALYSIS OF SCOLIOSIS! MENKE ET AL.
Cause and efficacious care of patients with scoliosis have been elusive (Burwell et al. 2000). This is just
as true for mild curves (i.e., < 20o). The assessment and
prognostic tools used to evaluate scoliosis angles and
any putative consequences are not supported in welldesigned clinical trials (Dickson 1999). Despite this
lack of evidence, Dickson and Weinstein (Weinstein et
al. 2003) discuss the need to treat patients (i.e., bracing,
surgery) with late onset (around puberty) of AIS, if it is
to only address appearance concerns, since the organic
complications of cardiopulmonary failure are related
only to the early onset cases and in those with more
severe angles > 50o.
To the authors' knowledge the four-part Ste-Justine
hospital “Comparative Retrospective Cohort Study of
Scoliosis Patients” is the largest cohort study and includes a population-based control group (Goldberg et
al. 1994a, Goldberg et al. 1994b, Mayo et al 1994,
Poitras et al 1994). Some studies have defined scoliosis
as a curve greater than 10o (e.g., Scoliosis Research
Society). Others have used 5o as the minimum threshold
(Goldberg et al. 1994a). In allopathic management,
most such cases merit "cautious observation" as the
primary management strategy, rather than comprehensive treatment.
Curve progression is the alarming consequence of
scoliosis and indicates the treatment rationale, including surgery and external bracing. Weinstein (1999)
reviewed several studies where scoliosis progression
was observed throughout the entire life, especially if the
curve was 20o or more after skeletal maturity. They,
contrary to the Ste-Justine series’ authors, concluded
that the “incidence of LBP in patients with scoliosis is
comparable to the incidence in the general population”
and state that an increased diagnostic awareness about
the topic has led to over-referring and over-screening of
children.
A Medline search from the period 1966 through
2003, using the key phrase “randomized controlled trial
AND scoliosis,” yielded no studies in relation to younger
subjects with late onset idiopathic scoliosis that evaluated Cobb angle outcomes.
Howard et al. (1998), in a non-randomized study,
concludes that TLSO brace is superior to others (Milwaukee and Charleston). There is a relative dearth of
evidence for surgical and chiropractic treatments. Since
periodic observation is the typical medical management protocol for curves under 20 degrees, it would
seem reasonable to test chiropractic care (i.e., high
velocity low amplitude (HVLA) adjustments, postural
advice, heel lifts, etc.) approach to patients with this
disorder (Lantz and Chen 2001). Lantz and Chen (2001)
reported on the effect of chiropractic intervention for
subjects with small scoliotic curves in younger subjects. The original authors concluded from the singlegroup analysis of 42 subjects over the course of a year
that full-spine chiropractic adjustments with heel lifts,
100
and postural and lifestyle counseling, were not effective
in reducing the severity of scoliotic curves. An interim
preliminary analysis of some subjects from the above
trial suggested that curve magnitudes were decreasing,
albeit slightly (Lantz and Chen 1996). This conference
report, although preliminary and based on fewer subjects, had results that contrast with the results reported
in the final paper. One of us (TRB) was a member of the
original study investigation team, participating as a
treating chiropractor. He provided the impetus to further explore the original study’s data (i.e., plain film
radiographs), and to deepen the inquiry first reported by
Lantz and Chen in 2001.
METHODS
The principal investigator of the original study (Dr.
Lantz) provided 41 complete sets of spine radiographs
used to measure Cobb angle in the original published
study of 42 patients. The time between the initial and
comparative radiographs was approximately one year.
We also had access to and reviewed subject records,
progress reports, and trial protocols. Subjects were
recruited from a variety of sources including local
physicians' patients and health fair screenings. Primarily two different doctors provided care to the subjects;
the clinicians were not randomly assigned to subjects.
One approach was the diversified chiropractic technique (n=26). In the diversified technique implemented,
all adjustments were high velocity and low amplitude in
nature and were based on a combination of palpatory
findings of movement restriction as well as intersegmental and postural alignment (full-spine A-P and
sectional lateral films). Adjustments were also directed
medialward at the apex of the scoliosis. Postural and
exercise advice and heel lifts if a short leg was present
were adjunctive to the adjustive care. The other technical approach was Gonstead technique (Plaugher 1993)
(n=15). Gonstead spinal examination analyses including temperature differentials (Plaugher et al. 1991,
Ebrall et al. 1994) and segmental adjusting procedures
were utilized throughout the entire spine and pelvis of
each subject at sites of subluxation (Simpson and Weiner
1999).
Posterior to anterior full spine x-rays were analyzed using both the Cobb and Risser-Ferguson methods by a practicing doctor of chiropractic with over 27
years of clinical experience, who was blinded to subject
information (e.g. subject identification, clinical status,
etc.), and whether the films were pre or post intervention/time.
In the Cobb method, two end-vertebrae were chosen, one at the superior aspect of the curve and the other
at the inferior aspect of the curve. These two vertebrae
were the ones which were most angulated (rotated on
the z-axis) toward the concavity of the curve as judged
by visual inspection on a standard 14 X 36-inch radio-
101
BAYESIAN ANALYSIS OF SCOLIOSIS ! MENKE ET AL.
graphic light box (Plaugher and Lopes 1993, Panjabi et
al. 1974). Standard lines were drawn in accordance with
the Cobb method.
In the Risser-Ferguson method the end-vertebrae
used were the same ones that had been used in the Cobb
method. In addition, a middle segment, the most laterally displaced (i.e. the vertebra at the apex of the curve)
was identified. On each of these three vertebrae, diagonal lines were drawn from the opposing vertebral body
corners, such that they crossed near the middle of the
vertebral body. The angle formed by lines drawn from
the superior and inferior bodies intersecting in the
middle vertebral body, is the Risser-Ferguson angle
(Rowe and Yochum 1987). Risser’s sign of each of the
41 pretreatment x-rays measured skeletal maturity of
the patients, as this is a commonly used prognosticator
in the trajectory of scoliosis (Lonstein and Carlson
1984).
Analytic Strategy
The original report (Lantz and Chen 2001) did not
support effectiveness of chiropractic management of
scoliosis, as tested by comparisons of pre-post treatment measures. However, the study was underpowered
(1- β) = 0.46 at α = 0.05; 80 subjects were needed for
80% power) and lacked a control group and methodology to confidently conclude that no change had occurred. Potential behaviors may have been missed from
using tools not sensitive enough to detect them.
The main objective was to re-assess the data with
scientific tools more appropriate to these research hypotheses and data. The analysis proceeded from simple
description, measuring reliability, formulating new
hypotheses based on observations, isolating factors and
estimating predictors' variance contribution to outcomes,
then testing each factor for its likelihood in contributing
to outcome. Inconsistent and isolated behaviors were
not accepted as plausible.
Lack of a comparison control group impedes attempts to separate chiropractic management from the
natural history of scoliosis. Other potential sources of
error include placebo effects, radiographic positioning,
and normal variance in postural differences. We assumed that these sources of variance were not systematic (biases), but were true residual (error). The objective
was to assess potential contributions to knowledge and
identify hypotheses to test or refine in future research.
At least three relevant vectors were identified:
(Lantz and Chen 2001) magnitude and direction of
change, (Burwell et al. 2000) level of clinically important change, and (Dickson 1999) alternatively including no curve as successes (stabilization) and then as
failures (did not respond) in analysis. Vector one is the
magnitude and direction of change. Vector two was
minimum acceptable clinical change as 1o, 3o or 5o
called filters, designated f. Vector three had 3 levels,
including no change as a good clinical outcome (filters
of Improvement + No Change, fINC), improved only as
a good clinical outcome (filters of Improvement Only,
fIO), and bilateral (fBL for Bilateral change) movement
only under all 3 three filter levels.
Simple visual inspection of radiographs followed
by general linear models identified associated factors.
These were then tested by evidential support methods
after logistic regression, and Bayesian analysis on the
three magnitudes and types of success.
Logistic regressions were computed with R statistical software (R Development Core Team 2004) and
were applied across fIO versus fINC coded datasets.
Variance contributions, likelihood ratios, and evidential support were also computed. Finally, factors were
combined in hierarchical linear models to generate a
prediction equation. All programs were run on Apple
867 MHz PowerBook G4 (OS 10.3.8), with Emacs text
editor with ESS for "R" analysis.
Natural logarithms of the likelihood ratios were
computed from "R" logistic regressions (R Development Core Team 2004). Since we are dealing with
logarithms of ratios, the more negative the resulting
number, the higher the likelihood the factor contributes
to outcomes, and the less likely the finding was an
erroneous. Support numbers 0 or greater suggest no
evidence; -1 is weak evidence; -2 is moderate evidence;
-3 is strong evidence; and -4 is extremely strong evidence (Goodman and Royall 1988). These analyses
were executed across the three definitions of improvement and three filters described above.
Specific hypotheses
Initial Cobb angle is an often-cited predictor of
scoliosis progression, as used in the Lonstein nomogram (Lonstein and Carlson 1984). Was the initial
Cobb angle predictive of outcome in this sample? What
measured aspects of treatment or adjustive technique
could have been effective in producing an outcome?
In the course of re-measuring x-rays, an additional
hypothesis emerged based on observations that more
compressed scoliotic curves seemed to improve more
over the course of the study. Thus, could a characteristic or characteristics of the scoliosis curve be related, or
even predictive of outcome?
RESULTS
Table 1 summarizes descriptive statistics of the
patients in this study. For α = 0.05 (2-tailed), number of
subjects 41, and mean and standard deviation of the prepost difference scores of 0.56 and 5.31, respectively,
statistical power (1 - β) for this study was calculated to
be 0.46.
The sample size needed to detect a 5 degree
change, with 80% power and n= .05 should be 80 in a
BAYESIAN ANALYSIS OF SCOLIOSIS! MENKE ET AL.
102
Table 1. Patient characteristics. SE=Standard Error.
n=41 for all variables except bone age (n=36).
Variable
Age (years)
Mean
SE
95% CI
11.9
0.44
10.99- 12.81
Boys/Girls
26/15
Bone age
12.24
0.54
11.15- 13.33
Cobb angle at intake
1 0. 3 7
0.77
8.81- 11.94
7.71
0.70
6. 3 1 - 9. 1 3
Treatment duration
1 4. 2 9
0. 5 7
13.14- 15.44
# of visits
41.41
3.43
34.48- 48.35
3.12
0. 3 1
2.48- 4.05
Risser- Ferguson
Average visits per
month
Reliability
pre-post study. Intercorrelations of the primary
measures revealed several interesting relationships
(Table 2): reliability between the Cobb and Risser
Ferguson (RF) methods, the relationship of the change
in curve (the primary outcome measure, ∆ Cobb) with
Cobb 1, age, months of treatment number of visits,
rate of visits, and bone age. Note in particular the
relationship of ∆ Cobb to the number of segments in
the curve (named span), and the number of segments
below the apex of the curve (named Apex)
Patients with a greater degree of initial Cobb
angle received more visits, though more visits was
not associated with a greater reduction in scoliosis.
Interestingly, greater chronological age was unrelated to outcome (∆ Cobb), but greater bone age
correlated with curve improvement. Improvement
and progression were assessed as at least 1o, 3o or 5o
of change in the analysis algorithms.
The correlation in change scores (pre-post) between first and second analyses was r = 0.44 (p<0.01),
though both analyses concluded only 0.5o overall
difference in pre to post measures. Intraclass correlation coefficients were 0.59 for pre-measures, and
0.69 post treatment measures. These low correlations
may indicate differences in marking techniques such
as undesignated end vertebrae in marking. Intra-rater
reliability of markings in the second analysis by one
of us (RRC) for initial Cobb was r = 0.81 for a
randomly chosen set of 23 without end vertebrae
specified. Reliability increased to over 0.91 when end
vertebrae were designated. Risser Ferguson measures were 0.70 without and 0.81 with end vertebrae
designation. We used the Cobb angle measures for all
the analyses of outcome.
Reliability of scoliosis vertebra choice (see Table
3, next page) was accomplished by converting thoracic and lumbar vertebrae to a numerical sequence
from 1 to 17 (for T1 to L5), then correlating among
pre- and post-treatment radiographs. Cephalic vertebrae selection correlated moderately r = 0.44 (p <
0.05, 95% CI: 0.161 to 0.654); and bottom vertebrae
correlated a mere r = 0.22 (NS, 95% CI: -0.094 to
0.494). Kappa coefficients computed by intra-rater
concordance analyses were virtually identical 0.44
for the cephalic and 0.21 for caudal segment selection. This suggests a major source of scoliosis measurement error could be inconsistencies in end
vertebrae choice, if a rater is blinded to pretreatment
and comparative radiographs.
Table 2. Intercorrelations among subject variables. Cobb1 = initial Cobb angle; Cobb2 = final Cobb angle; D
Cobb = change in Cobb angle; RF1 = Initial Risser Ferguson angle; Span = number of vertebral segments in
the scoliosis; Apex = number of segments below the apex of scoliosis curve; Age = age of patients; Months =
months under care; Visits = number of visits during care; Rate = average number of visits per month (Visits/
Months). Significant correlations are designated: * = 0.05, ** = 0.01; *** = 0.001.
Cobb1
Cobb2
∆ Cobb
RF1
Span
Apex
Age
Months
Visits
Rate
Bone age
Cobb1
Cobb2
∆ Cobb
RF1
Span
Apex
Age
Months
Visits
Rate
0.56***
0.28
0.74***
-0.07
-0.17
0.30*
0.04
0.47**
0.48***
-0.09
-0.63***
0.50***
0.36**
0.37**
0.13
-0.14
0.29
0.39**
-0.42**
0.11
-0.49***
-0.59***
0.13
0.21
0.10
0.00
0.42**
0.02
0.03
0.36*
0.12
0.33*
0.31*
0.01
0.87***
0.02
0.06
-0.23
-0.22
-0.33*
-0.07
-0.06
-0.26
-0.19
-0.35*
0.13
0.10
0.02
-0.07
0.02
-0.45**
0.09
0.86***
0.17
0.11
BAYESIAN ANALYSIS OF SCOLIOSIS ! MENKE ET AL.
103
Table
3. Inter-rater and intra-rater reliability of vertebrae choice and m easured Cobb angles.
Inter- rater
reliability
Cobb1
Sample size, n
Intra- rater reliability w/o end
vertebrae specified
Cobb 2 Cobb 1
Intra- rater segment
selection reliability
Intra- rater with end vertebrae
specified
Cobb 2
RisserFerguson
Top
segment
Bottom
segment
Cobb 1
Cobb 2
RisserFerguson
41
41
23
19
24
41
41
41
N/A
41
Pearson’s r
0.39
0.58
0.81
0.68
0.70
0.44
0.22
0.91
N/A
0.81
p value for r
0.01
0.001
0.001
0.001
0.001
0.01
N/S
0.001
N/A
0.001
ICC
0.59
0.69
0.79
0.66
0.70
0.44
0.22
0.91
N/A
0.81
0.47°
0.40°
1.31°
1.55°
0.73°
0.28
segments
0.45
segments
0.04°
N/A
0.23°
Mean
difference
Estimating clinical change
The sum of initial angles was 425.5o versus a sum
of 373o at the end of the one-year observation period, an
overall reduction of 12.34%. Summaries of scoliosis
curve characteristics are included in Table 4. The median value of the initial Cobb measurement was 10o.
The change score mean was 0.55o (n=41), with a
standard error of 0.83 in the current analysis. Gonstead
(n=15) showed an average improvement of 1.43o, while
the diversified technique (n=26) averaged 0.06. As the
filter was increased, naturally the number of patients
observed to change necessarily increased. A minimum
5o change on those with an initial angle of 4 degrees had
10 of 39 (26%) improve, while 9 (23%) progressed.
Mean change for the second analysis was 0.56o (RRC).
When the proportion of those improving versus
those progressing is graphed as a function of initial
angle and filter size, it is apparent that overall improving outpaced progression and monotonically increased
with initial Cobb angles and filter size, by as much as 3
times (Figure 1) at higher levels of initial Cobb angle
and filter size. (n=26) averaged 0.06. As the filter was
increased, naturally the number of patients observed to
change necessarily increased. A minimum 5o change on
those with an initial angle of 4 degrees had 10 of 39
(26%) improve, while 9 (23%) progressed. Mean change
for the second analysis was 0.56¦ (RRC).
When the proportion of those improving versus
those progressing is graphed as a function of initial
angle and filter size, it is apparent that overall improving outpaced progression and monotonically increased
with initial Cobb angles and filter size, by as much as 3
times (Figure 1) at higher levels of initial Cobb angle
and filter size.
Visual inspection of radiographs and data suggested three candidate variables that contributed to
outcome variance in curve improvers versus progressors:
initial Cobb angle (Cobb 1), number of infra-apex
segments (Apex), span, the number of segments in the
major scoliosis curve, and bone age. Both Apex and
span were highly correlated and collinear û they describe the same phenomenon and were thus interchangeable in analysis. As seen below, the variable
Apex consistently accounted for more variance than the
span variable.
The Apex variable was noticed during radiographic
coding as cases more likely to improve had more
compression in the scoliotic curve. Specifically, compression under the apex of the curve seemed most
highly associated with or predictive of improvement.
The concept of compression under the scoliotic apex is
illustrated in Figure 2. In all three of the Figure 2
scenarios, Cobb angles are equal. The only differences
are the apexes of the scoliosis which start at T12, then
progress through L1, and L2, left to right.
The ratio of patients improving to those whose
curve progressed ranged from 1.5:1 to 3.5:1 for patients
with at least 8? of scoliosis, with a 1o to 5o filter. Patients
with 4 or fewer infra-apex segments had about 2:1 odds
for improvement under a 1o filter, 2:1 ratio for a 3o filter,
and 1.8:1 ratio with a 5o filter. Figures 3, 4 and 5
Table 4. Summary of Subjects’ Scoliosis Characteristics.
Variable
Mean, n
SE
95% CI
Initial Cobb
angle
Final Cobb
angle
Segments in
curve
Segments
below apex
∆ Cobb
∆ Risser
10.38, 41
0.77
8.88-11.94
9.82, 41
0.96
7.88-11.76
7.32, 41
0.46
6.39-8.62
4.07, 41
0.24
3.58-4.56
0.55, 41
0.64, 41
0.83
0.88
-1.12-2.24
-1.14-2.43
BAYESIAN ANALYSIS OF SCOLIOSIS! MENKE ET AL.
104
Figure 2. Same 30° scoliosis with three distinct
apices: T12, L1, and L2, distributing compressive
forces more laterally and inferiorward.
determination) was accounted for in the equation:
Cobb.change = - 2*Apex + 0.32*bone age +5.1
Figure 1. The proportion (odds ratios, OR) of subjects
who improved vs. those who did not (top), and those
who improved or were stable as compared to those
who did not (bottom). More patients improved than
progressed in all cases.
graphically illustrate the dominance of improvement
over progression of scoliosis in this sample.
In the next analysis, identified factors were used as
predictors in logistic regression for estimating likelihood ratios for computation and evidential support (R
Development Core Team 2004). Evidential support is
the natural logarithm of the likelihood ratio (LR) of a
result expected in a null hypothesis versus an alternative hypothesis of a factor having an effect in the results
seen. Positive LR numbers suggest the factor did not
affect clinical outcome, since result is explained by the
null hypothesis for that variable (the variable had no
measurable effect). Negative numbers support a factor's
role in influencing outcome, since the denominator –
the alternative hypothesis that the factor tested is making a contribution to the outcome – is supported as
contributing to the outcome to a much greater degree
than the null hypothesis.
Table 5 lists each of the top four factors and their
levels of variance contribution, significance, and likelihood ratio. All factors except initial Cobb angle, were
strong contributors to predictability of scoliosis improvement. The greatest support was evident in the
Apex and bone age factors, with the highest degree of
support in all three categories of filter. Figure 6 graphically represents this finding. When the two factors,
Apex and bone age were loaded into multiple regression against the change in scoliosis during the observation period, 49% of the variance (adjusted coefficient of
Negative scores represent a progression, or worsening of scoliosis in this model. The above equation
explains 49% of the outcome variance, or likelihood, of
patients' scoliosis progressing in this study. The Apex
factor, a measure of inferior Cobb angle compression is
weighted most heavily (times a factor of -2) and added
to bone age as assessed by wrist x-ray (times 0.32).
These numbers are added to 5.1o to give an estimate of
direction and degree of mild scoliosis change over the
course of a year. This change cannot be confidently
attributed to treatment or natural history, because of the
lack of a control group. In brief, lower Apex scores
(number of segments under the curve apex) predict the
scoliosis will improve. When Apex exceeds 3, the
overall equation becomes positive, indicating a progression of scoliosis.
Bayesian analysis
Bayesian analysis estimates probabilities of a clinical event prior to and after treatment û after we look at
the information provided by the new study. Significance testing is unnecessary, since the clinicians may
interpret these probabilities and choose treatments according to perceived clinical utility.
Further, the Bayesian analysis has intuitive appeal
in that these types of probabilities estimate chances the
alternative hypothesis is true for given data, or P(H|D);
whereas null hypothesis significance testing compares
the likelihood of data given the hypothesis, or P(D|H).
A final crucial point is that Bayesian statistical analysis
builds upon earlier research in that results in earlier
research (posterior probabilities) are the new prior
probabilities in later research, leading to a refining of
hypotheses, increasing predictability and clinical confidence.
As there are no clinical trials on the treatment and
management of mild scoliosis, there were no data to
inform prior probabilities. There are thus two prior
BAYESIAN ANALYSIS OF SCOLIOSIS ! MENKE ET AL.
105
Figure 3. Graphic representation of scoliosis
curve changes, including improvement or
progression and no change as observed
through 1o, 3o, and 5o levels. of change.
probabilities: a calculated naive prior probability and a
prior probability of, equipoise, or neutral expectation of
50% chance of improvement or progression.
Since the ratio of those who improved to those that
progressed was at least 2:1 with each filter and with or
without the inclusion of the no-changers, the variance
for overall improvement had to be partitioned from the
contributing factors. The contribution of the Apex
variable contribution was within an overall trend to-
Figure 4. Raw counts of patients who
progressed and those who improved by
categories of initial Cobb angles and observed minimum degree of improvement.
wards improvement, and its unique contribution had to
be separated from this overall trend. This was accomplished by the Bayesian analysis.
Appendix A details the results of the Bayesian
analysis and number needed to treat (NNT) analyses for
filters 1o, 3o, and 5o on the fINC-coded data that considered ôstabilizedö scoliosis as a clinical success. The fIO
and fBL sets could not be included because they had
empty cells at filter 5o, which would erroneously inflate
Table 5. Summary of evidential support analysis by variables derived from log likelihood ratios. As LR approaches 0,
the corresponding log approaches infinity. Any number greater than zero indicates no support for an alternative
hypothesis that the factor affected outcome.
Tested Deviance Factor
Initial Cobb angle
Likelihood ratio, LR
Probability
Level of support, loge LR
Improve Only
Improve + N/C
Filter1 Filter3 Filter5
Improve or Progress
Filter1 Filter3 Filter5
Filter1
Filter3
Filter5
2.13
0.14
-0.76
2.82
0.1
-1.04
0.63
0.43
0.46
3.39
0.07
-1.22
4.08
0.04
-1.41
0.01
0.93
>> 01
0.32
0.57
1.14
1.45
0.23
-0.37
0.97
0.32
0.03
6.16
0.01
-1.82
5.1
0.02
-1.63
4.58
0.03
-1.52
7.19
0.01
-1.97
9.5
0.002
-2.25
4.016
0.05
-1.39
7.13
16.1
15.67
0.01 0.00001 0.00001
-1.96
-2.78
-2.75
8.49
0.004
-2.14
11.77
0.001
-2.47
6.9
0.01
-1.93
10.8 13.55
0.001 0.0002
-2.38 -2.61
8.08
0.004
-2.09
9.17
0.002
-2.22
17.32
0.0001
-2.85
18.11
0.0001
-2.90
2.8
0.09
-1.03
1.28
0.26
-0.25
0.02
0.89
>>01
6.12
0.01
-1.81
10.85
0.001
-2.38
1.56
0.21
-0.44
5.56
0.02
-1.72
7.33
0.007
-1.99
Span of primary scoliosis
Likelihood ratio, LR
Probability
Level of support, loge LR
Segments below curve apex
Likelihood ratio, LR
Probability
Level of support, loge LR
Bone age
Likelihood ratio, LR
Probability
Level of support, loge LR
8.58
0.003
-2.15
BAYESIAN ANALYSIS OF SCOLIOSIS! MENKE ET AL.
Figure 5. Number of patients exhibiting no
change as a classification of initial Cobb angle
and size of measurement filter.
probability estimates. The fIO and fBL data supported
Apex as a factor in filters 1o, 3o, or 5o in logistic regression
analysis and log likelihood ratios (evidential support
estimates). For fINC, the number of Apex segments three
or fewer improved posterior probabilities from 0.10 (at
filter 1o) to 0.19 (at filter 3o) over the naive prior probabilities; and 0.26 to 0.39 over the equipoise prior probabilities. For fIO filters 1o and 3o, posterior probabilities
were increased by 0.04 and 0.18 respectively over computed priors and 0.19 and 0.16 over equipoise (50-50)
prior probabilities. In summary, physician effects were
null when combined, but closer inspection showed differential contributions at higher filter levels.
106
Primary treatment attributes were the two chiropractic techniques. Ancillary treatments such as heel lifts
were used, but were not systematically implemented nor
was patient adherence recorded. In aggregate, an average
of 0.5o change was observed in the group. A subgroup
improved and another progressed. Overall, twice as
many improved as progressed at all filters. The improving subgroup had three or fewer segments in the subapical area in the scoliosis curve variable (Apex = 3).
The contribution of each chiropractor cannot be
separated from his technique by partitioning variance
components, since care is nested within the chiropractor.
Logistic regression and likelihood ratios did not support
an overall association by treatment or clinician (Appendix A), but Bayesian analysis on the fINC data (the fIO
and fBL had empty cells) revealed an underlying complexity between treaters (or technique). The chiropractic
physician posterior probability (after observing data
changes during study) for clinical outcome increased by
0.03, 0.10, and 0.13 for filters 1o, 3o, and 5o over the
calculated nanve prior probabilities; and 0.18, 0.24, and
0.40 in an equipoise state. For the fINC set, one chiropractor (Gonstead technique) experienceda higher level
of clinical success at filter 5o with 14:1 ratio of improvement (for n=15). The other clinician experienced a ratio
of patient improvement at 18:7 at filter 5o. If technique,
and not personal style of the chiropractor, or a unique
subset of patients, were responsible for the difference,
then a 14 times more improvers for Gonstead versus 2.57
times for the diversified technique would be remarkable.
However, this question remains for future research to
investigate.
If a chiropractic or medical doctor is faced with
recommending care or not in a young adolescent with a
mild scoliosis, then this study may provide some
Figure 6. Evidential support by change filter and each source of deviance from the null hypothesis
107
BAYESIAN ANALYSIS OF SCOLIOSIS ! MENKE ET AL.
preliminary guidance. Doing nothing at all (cautious
observation) could result in a 50/50 chance of the curve
progressing. On the other hand, if diversified or Gonstead
care is provided, then the odds are much greater (18:7
to 14:1) that the patient will experience an improvement
in the scoliosis. Prospective and controlled studies are
needed to further test this question.
Number needed to treat
The calculation of number needed to treat, NNT, is
the reciprocal of posterior probabilities of clinical failure subtracted from success: 1/(improvement – not
improved). NNT indicates how many patients must be
treated to obtain one clinical success of improvement or
"stabilization." NNT's ranged from 1.6 to 1.2 for naive
priors and 1.9 to 1.3 for prior probabilities under the
equipoise assumption for filters 1o, 3o, and 5o. For every
1 to 2 people seen with mild scoliosis, 1 improved. This
finding was associated with Apex rather than treatment
characteristics, since treatment variance components
were not associated with outcome.
DISCUSSION
Recruitment of sufficient numbers of qualified
patients is one of the most challenging aspects in
clinical and field research. Under frequentist analyses, underpowered studies become a limiting factor,
and missing responses to interventions increasingly
likely. If statistical significance is achieved, results
are suspect when sample sizes are inadequate. Fortunately, null hypothesis significance testing is not the
only tool in the clinical researcher's toolbox.
For a largely ill-defined domain such as scoliosis
with lots of unproven treatments, other measures
such as pain, quality of life, disability, satisfaction,
and associated costs may point to other reasons to
seek chiropractic management than just curve reduction. Also, other measures allow critical multiplism
methods to assess change from various independent
perspectives (Houts, Cook, and Shadish 1986). Here
though, is verisimilitude to the modern chiropractic
or medical practice, caring for a patient with scoliosis, where thecurve is the primary concern, and the
ultimate objective is to at least not observe progression. So this studyrepresents clinical fidelity in concerns and outcomes of scoliosis.
Including patients who did not change over the
course of observation or treatment in with the clinical success group, at least twice as many improved
as progressed. At higher filters of 5o , this ratio
approached 3 to 1. Omitting the clinically indeterminate values in the middle does naturally increase
the distance between outlying group means. But
that expected difference was not tested here; the
observation that twice as many improved at each
level of filter.
Analyses were done to identify which patient
factors could discriminate improvers from progressors,
measure their impact on outcome, and finally
calculatetheir support as opposed to a null hypothesis of
no effect. We identified factors consistent in contribution across the three vectors, and with moderate to
strong evidential support in clinical contribution: bone
age, a natural progression; and infra-apex segment
count, which was associated strongly with outcome, but
less so with time or treatment.
Initial Cobb angle, a commonly used predictor in
the prognosis of scoliosis was not supported. Initial
span of scoliosis segments, correlated with outcome,
but not with initial Cobb angle, and negatively with
bone age. Span correlated with Apex, which in turn
correlated more highly than span to the curve change.
As span and Apex were collinear, both could not serve
as regression predictors. Likelihood ratios and evidential support calculations favored Apex over Span, so
Apex was chosen for regression and Bayesian analysis.
The role of bone age as an outcome predictor was
more complex and interesting. Initial bone age correlated with the change in Cobb angle at r = 0.42 (p <
0.01), about 15% of the outcome variance. That physiologically older kids had better outcomes is likely a
snapshot of the later course of scoliosis which is consistent with younger children being more at risk for progression. Bone age was the second hierarchical predictor
for the scoliosis in this study, though its contribution
was not as consistent across analyses as Apex.
Again, the equation:
Cobb.change = 5.1 - 2*Apex + 0.32*bone age
models the progression of scoliosis under the few
parameters collected here. We expect further testing
and refining in future studies. Apex and bone age
predictors may only have been unique to this small
sample with mild scoliosis, but only future research will
tell. Attempts at critical multiplism were thwarted due
to the limited number and kind of dependent measures
originally collected. But even with the conservative
analyses and inferences performed here, this sample of
41 patients with mild scoliosis could provide directions
for future research. The objectives of equation modeling here are to build useful empirical tools for the
practicing clinician. Science can and should inform
better clinical decision-making, rather than simply reject hypotheses of no difference, which adds little to
patient knowledge and management.
The greatest changes were among those with over
10o in initial Cobb angle. This pattern may echo regression to the mean, though in the scoliosis literature
greater angles are at higher risk for progression. More
patients improved than progressed, but this too may be
the natural history of mild AIS, which is unknown.
Predicting regression to the mean is just as valuable as
BAYESIAN ANALYSIS OF SCOLIOSIS! MENKE ET AL.
predicting response to a treatment. If most kids with
AIS regress to the mean, then clinicians should be
apprised of this as yet another description of natural
history. Many clinicians however think that AIS should
be followed with at least one comparative radiograph
due to a perceived risk for progression. These findings
are suggestive enough to warrant future hypothesis
testing.
Is no change in scoliosis curvature a positive outcome? Despite many different studies over the past two
decades (Montgomery and Willner 1988, Pecina et al.
1992, Bunnell 1988, Brooks et al. 1988, Rogala et al.
1978) that have looked at the natural history of lateonset scoliosis no summary conclusions can be made.
The empirically chosen predictive factors (Apex and
bone age) held up in both datasets –with both no change
as a successful outcome and no change as an unsuccessful outcome. In the past a variety of different criteria
have been used to assess the course of scoliosis, including the definition of scoliosis change (e.g. > 5 degrees
or > ten degrees), and inclusion criteria of patients in
treatment and discrepancies in follow-up (Soucacos et
al. 1998).
A characteristic of curve compression appears to
be a better clinical prognosticator than bone age in the
clinical trajectory of mild scoliosis. Those with three or
fewer infra-apex segments improved twice as often as
those with more than 4 segments. Apex and bone age
accounted for 49% of the adjusted variance, an impressive value on the face of it, but interpreted cautiously for
the given study constraints. However, the value of Apex
as an outcome predictor, meditated by treatment or time
was supported consistently via logistic regression, likelihood ratios, evidential support analysis, and Bayesian
analysis. And though recorded treatment number, frequency, or duration did not contribute as a "main effect"
to clinical outcome, a slight decrease in uncertainty
contributed by the difference between the clinicians
does not allow us to rule out care as contributing to
outcome.
CONCLUSION
End vertebrae must be specified to reduce measurement error and to allow for a shared method to
measure change of scoliosis. All patients in this
studyimproved at a ratio of 2:1 in all three filter or
observations levels and across improvement only (fIO)
and improvement plus no change (fINC) analyses.
Degree and direction of improvement was best predicted by number of infra-apex segments (a measure of
curve compression), and bone age. Patients with three
or fewer infra-apex segments had approximately 2:1
odds for improvement over one year, regardless of
initial curve angle. Logistic regression likelihood ratio
and Bayesian analysis supported a predictive role for
Apex, and to lesser extents bone age and span of the
108
scoliotic curve in number of segments; but initial Cobb
angle had no apparent contribution to results.
The regression equation predicting change from Apex
should be tested and refined in future scoliosis research.
Other indicators of health should be recorded in future
studies to fully assess the impact of conservative management on more signs and symptoms of scoliosis,
including quality of life and pain.
Probabilistic models can serve to reduce clinical
uncertainty. To the extent that attributes of a patient's
presentation can help improve clinical decisions, improve
prognosis, and reduce costs, is a promising direction for
health care. Calculating odds for improvementunder a
certain management or treatment can be enormously
informative and helpful for patient management and
public policy. Attribution and attachment to a single
cause or cure can lead a patient lead into excessive and
expensive treatment or excessively delayed proper treatment. Though single-cause determinism is attractive, it
is oversimplified and unpredictable. The most important aspect of treatment is properly identifying the
problem. Only then may treatment commence.
Since potentially valuable information was obscured in this study by the design and conventional
frequentist methods, we suggest that all future studies
carefully consider nomothetic estimation rather than
automatically adopt frequentist null hypothesis significance testing. Models of health and disease are stochastic, not deterministic. Odds for recovery are increased
under certain circumstances, and reduced under others.
No one cause leads to one specific effect. Given the
state of today's health care system and population
health, one obvious place to decrease risks, costs, and
side effects is in improved predictability. Informed
consent procedures should give patients odds for success in addition to risks and side effects for any given
procedure and management protocol.
The aim of clinical science is to enhance individual predictability in patients. We believe that the
non-frequentist methods used herein uncovered information overlooked by the typical null hypothesis significance testing. Whether these findings are ultimately
of use in improving doctor prognostications remains
for future research to support or refute. To this end, we
support and recommend Bayesian and evidential support methodology.
LITERATURE CITED
BROOKS H. L., S. P. AZEN, E. GERBERG, R. BROOKS, AND
L. CHAN. 1975. Scoliosis: A prospective epidemiological study. Journal of Bone and Joint Surgery 57A:968-972.
BUNNELL W. P. 1988. The natural history of idiopathic
scoliosis. Clinical Orthopedics 229:20-25.
BURWELL R. G., P. H. DANGERFIELD, T. G. LOWE, AND J.
Y. MARGULIES. 2000. Spine: State of the Art:
109
BAYESIAN ANALYSIS OF SCOLIOSIS ! MENKE ET AL.
Etiology of idiopathic adolescent scoliosis.
Lippincott Williams & Wilkins, Inc., Baltimore.
DICKSON R. A. 1999. Spinal deformity – adolescent
idiopathic scoliosis: Nonoperative treatment. Spine
24:2601-2606.
EBRALL P. S., A. IGGO, P. HOBSON, AND G. FARRANT.
1994. Preliminary report: the thermal characteristics of spinal levels identified as having differential
temperature by contact thermocouple measurement (Nervoscope). Chiropractic Journal of Australia 24:139-46.
GOLDBERG, M. S., N. E. MAYO, B. POITRAS, S. SCOTT, AND
J. HANLEY. 1994. The Ste-Justine adolescent idiopathic scoliosis cohort study: Part I: description of
the study. Spine 19:1551-1561.
GOLDBERG, M. S., N. E. MAYO, B. POITRAS, S. SCOTT, AND
J. HANLEY. 1994. The Ste-Justine adolescent idiopathic scoliosis cohort study: Part II: perception of
health, self and body image, and participation in
physical activities. Spine 19:1562-1572.
GOODMAN S. N., AND R. ROYALL 1988. Evidence and
scientific research. American Journal of Public
Health 78:1568-1574.
HOUTS, A. C., T. D. COOK, AND W. R. SHADISH. 1986. The
person-situation debate: a critical multiplist perspective. Journal of Personality 54:52-105.
HOWARD A., J. G. WRIGHT, AND D. HEDDEN. 1998. A
comparative study of TSLO, Charleston, Milwaukee braces for idiopathic scoliosis. Spine 23:24042411.
LANTZ C. A., AND J. CHEN. 2001. Effect of chiropractic
intervention on small scoliotic curves in younger
subjects: a time-series cohort design. Journal of
Manipulative and Physiological Therapeutics
24:385-93.
LANTZ C. A., AND J. CHEN. 1996. The effect of chiropractic full-spine adjustments on adolescent idiopathic
scoliosis with curves less than 20 degrees. Proceedings of the international conference on spinal
manipulation. Arlington, VA: Foundation for Chiropractic Education and Research :30-32.
LONSTEIN J. E., AND J. M. CARLSON. 1984. The prediction
of curve progression in untreated idiopathic scoliosis during growth. Journal of Bone Joint SurgeryAmerican 66:1061-71.
MAYO N. E., M. S. GOLDBERG, B. POITRAS, S. SCOTT, AND
J. HANLEY. 1994. The Ste-Justine adolescent idiopathic scoliosis cohort study: Part III: Back pain.
Spine 19:1573-1581.
MONTGOMERY F., AND S. WILLNER. 1988. The natural
history of idiopathic scoliosis. Spine 13:401-404.
PANJABI M. M., A. A. WHITE, AND R. A. BRAND. 1974. A
note on defining body parts configurations. Journal of Biomechanics 7:385-90.
PECINA M., M. DAKOVIC, AND I. BOJANIC. 1992. The
natural history of mild idiopathic scoliosis. Acta
Medicine Croatia 46:75-78.
PLAUGHER G. 1993. Textbook of clinical chiropractic: A
specific biomechanical approach. Williams &
Wilkins, Baltimore.
PLAUGHER G. AND M. A. LOPES. Scoliosis. 1993 Pp. 186187 in G. Plaugher, ed. Textbook of clinical chiropractic: A specific biomechanical approach.
Williams & Wilkins, Baltimore.
PLAUGHER G., M. A. LOPES, P. E. MELCH, AND E. E.
CREMATA. 1991. The inter- and intraexaminer reliability of a paraspinal skin temperature differential
instrument. Journal of Manipulative and Physiological Therapeutics 14:361-367.
POITRAS B., N. E. MAYO, M. S. GOLDBERG, S. SCOTT, AND
J. HANLEY. 1994. The Ste-Justine adolescent idiopathic scoliosis cohort study: Part IV: Surgical
correction and back pain. Spine 19:1582-1588.
R DEVELOPMENT CORE TEAM. 2004. R: A language and
environment for statistical computing. R Foundation for Statistical Computing http://www.Rproject.org Accessed 2006.
ROGALA E. J., D. S. DRUMMOND, AND J. GURR. 1978.
Scoliosis: incidence and natural history: A prospective epidemiological study. Journal of Bone
and Joint Surgery 60A:173-176.
ROWE L. J., AND T. R. YOCHUM. 1987. Measurements in
skeletal radiology. Pp. 169-224 in T. R. Yochum
and L. J. Rowe, ed. Essentials of Skeletal Radiology. Williams & Wilkins, Baltimore.
SIMPSON J. A., AND E. S. C. WEINER. 1999. The Oxford
English Dictionary. Oxford University Press, Oxford.
SOUCACOS P. N., K. ZACHARIS K., J. GELALIS, K. SOULTANIS,
N. KALOS, A. BERIS, T. XENAKIS, AND E. O. JOHNSON,
1998. Assessment of curve progression in idiopathic scoliosis. European Spine Journal 7:270277.
WEINSTEIN S. L., L. A. DOLAN, K. F. SPRATT, K. K.
PETERSON, M. J. SPOONAMORE, AND I. V. PONSETI.
2003. Health and function of patients with untreated idiopathic scoliosis: A 50-year natural history study. Journal of the American Medical
Association 289:559-567.
WEINSTEIN, S. L. 1999. Natural history. Spine 24:25922600.
BAYESIAN ANALYSIS OF SCOLIOSIS! MENKE ET AL.
110
111
BAYESIAN ANALYSIS OF SCOLIOSIS ! MENKE ET AL.
Nonprofit Org
US Postage
PAID
Arizona State
University
YV7 1001
Arizona-Nevada Academy of Science
ARIZONA-NEVADA ACADEMY OF SCIENCE
WILLIAM P. BAKER
BIOMEDICAL SCIENCES PROGRAM
MIDWESTERN UNIVERSITY
19555 N. 59TH AVE
GLENDALE, AZ 85308
RETURN SERVICE REQUESTED
DATED MATERIAL