Voice Source Register Differences in Female Musical Theatre Singers
Eva Björkner1*†, Johan Sundberg2*, Tom Cleveland3** and Ed Stone4**
*Dept. of Speech Music Hearing, KTH, Stockholm, Sweden
† Lab.of Acoustics and Audio Signal Processing, Helsinki Univ. of Technology, Finland
**Vanderbilt Voice Center, Dept. of Otolaryngology, Vanderbilt Univ., Nashville, TN, USA
evab@speech.kth.se, pjohan@speech.kth.se
ABSTRACT
Musical theatre singing requires the use of two
vocal registers in the female voice. The voice
source and subglottal pressure Ps
characteristics of these registers are analysed
by inverse filtering. The relationship between
Ps and closed quotient Qclosed, peak-to-peak
pulse amplitude Up-t-p, maximum flow
declination rate MFDR and the normalised
amplitude quotient NAQ were examined. Ps
was typically slightly higher in chest than in
head register . For typical tokens MFDR and
Qclosed were significantly greater while NAQ
and Up-t-p were significantly lower in chest than
in head.
In classical singing mainly the head register is
used while in non-classical styles, like pop,
jazz and blues, chest is used more commonly.
Musical theatre singing, on the other hand,
demands a perfect control of both registers. In
this repertoire high subglottal pressures are
typically used. Such pressures sometimes
jeopardise vocal health. To understand the
reason for this, a better knowledge of the
register function in female singing is needed.
Therefore, the present investigation studies the
register function in professional female
musical theatre singers by analysing their voice
source characteristics, paying special attention
to the influence of subglottal pressure on these
characteristics.
1. INTRODUCTION
2. MATERIAL AND METHODS
Vocal register is a phenomenon of great
relevance in vocal art, particularly in female
singing. An important task in singing training,
regardless of style, is to teach the student how
to master the transition from one register to the
other with minimal timbral changes. The
register used in the lower part and in the
adjacent higher part of the female pitch range
is here referred to as chest and head,
respectively. Vocal registers reflect voice
source characteristics, e.g., Qclosed, peak-topeak pulse amplitude Up-t-p, and maximum
flow declination rate MFDR. All these
parameters are heavily influenced by subglottal
pressure Ps and glottal adduction. Hence it
seemed reasonable to analyse these parameters.
The Normalised Amplitude Quotient NAQ
defined as the ratio between Up-t-p/(T0*MFDR)
[1] seems related to glottal adduction and is
correlated with degree of perceived phonatory
pressedness [2].
2.1 Subjects and recording
Seven female singers, all classically trained
and working professionally as musical theatre
actors, volunteered as subjects. Their ages
were between 17-43 years and all, except one,
had professional experience for duration of 11
to 25 years. Their task was to sing a sequence
of the syllable /pae/ at a pitch where they could
use both chest and head register. This pitch
varied between C4 and G4 for different
subjects. They initiated the sequence at high
lung volume and at maximum degree of vocal
loudness and continued while gradually
decreasing vocal loudness. They were asked to
perform this sequence three times first in chest
and then three times in head register. The
vowel /ae/ was chosen since its high first
formant adds to the reliability of inverse
filtering and the oral pressure during the pocclusion allows estimation of Ps.
The flow signal was recorded using the
Rothenberg mask, a specially designed
pneumotachograph for capturing oral flow.
The subject held a plastic tube, inner diameter
4 mm, in the corner of her mouth for recording
oral pressure. The audio signal outside the
mask was recorded from a microphone at a
distance of 30 cm from the lips. All these
signals plus an EGG signal (Glottal
Enterprises) were recorded on a multi-channel
digital recorder [TEAC RD 180 PCM].
2.2 Analysis
The effect of Ps variation on the voice source
can be ideally analysed by examining glottal
parameters as function of several Ps values.
Therefore, for each subject and register ten
approximately equidistantly spaced Ps-values
were selected. The entire material thus
consisted of a total of 140 samples, ten for
each register and singer, respectively.
Subject MAR produced emphatic /p/
explosions in her head register recordings, as
demonstrated by sharply peaked oral pressure
peaks [3]. Following the recommendation of
Hertegård the estimates of Ps in these cases
were taken from the discontinuity appearing in
the initial part of the pressure peak.
2.3 Listening test
A computerised listening test (Judge,
S.Granqvist) was run with a panel of three voice
experts. Their task was to rate along a visual
analogue scale how representative the various
280 sung samples (10 degrees of vocal
loudness x 2 registers, x two presentations of
each sample) were of chest and head register.
The subjects were presented with a visual
analogue rating scale on the computer display,
where 0 marked ‘Chest’ and 1000 was marked
‘Head’. The program recorded all settings on
text files. Fig 1 shows the standard deviations
as function of the ratings averaged across the
three raters. The standard deviations were
highest in the center of the scale, as expected.
For some singers the chest and head register
500
AL
COX
MAR
SUB
450
CIE
JUL
PAT
400
SD [mm]
350
300
250
200
data are gathered toward the left and right sides
of the graph, respectively, indicating that their
chest and head register samples were perceived
as clear examples of these registers. Singers
JUL and AL, in particular, produced samples
that differed less clearly. Therefore their data
were disregarded.
A total of 16% of the samples received mean
ratings in the interval 0 – 250 while 49% of the
samples received ratings in the interval 7501000. The 17 samples that received ratings in
the range of 0-250 were thus accepted as ‘clear
cases of chest register’ while the 17 samples
that received the highest mean ratings were
considered as clear cases of head register.
2.4 Voice source analysis
Flow glottograms were obtained using the
DeCap custom-made program (S. Granqvist).
The formant frequency values were adjusted
such that a ripple- free closed phase was
obtained. These values were checked, by using
them for synthesising the vowel sound in the
custom made MADDE synthesiser (Granqvist).
From the resulting flow glottograms, period
time, Qclosed, Up-t-p and MFDR were measured.
In addition the normalised amplitude quotient
NAQ was determined. As Ps significantly
influences most of these parameters [2] it
seemed relevant to examine their variation
with Ps for the two registers.
3. RESULTS
3.1 Statistical analysis
A 2-way ANOVA was carried out for the clear
cases, with register and singer as factors. Due
to a technical problem singer SUB`s recordings
had to be excluded from the statistical analysis.
Results showed register as highly significant
(p=0.01) for all parameters. Factor singer was
found to be significant for Ps, Up-t-p and MFDR.
For MFDR a significant interaction was also
found between the two factors. A 2-way
ANOVA for all cases showed that register was
highly significant for the parameters Ps, Qclosed
and NAQ, and singer was highly significant for
parameter Ps, Up-t-p, MFDR and NAQ. No
significant interaction between register and
singer was found for any parameter.
150
100
50
0
0
100
200
300
400
500
600
700
800
900
100
Mean rating [mm]
Fig 1. Standard deviations of the three experts’ register
ratings as function of the averages of these ratings.
Symbols refer to singer subjects, and filled and open
symbols pertain to chest and head register, respectively.
3.2 Acoustic/aerodynamic analysis
Fig 2 illustrates the differences between the
registers in terms of the means across the 17
clear cases. The mean and SD of Ps were
Qclose d
5 ,0 0
Mean across clear cases
10 ,0 0
0, 12
Mean across clear cases
15 ,0 0
35
30
25
20
15
10
0, 10
0, 08
0, 06
0, 04
0, 02
5
0
0 ,0 0
NA Q
0
40
Mean across clear cases
Mean across clear cases
Ch e st
He ad
2 0 ,0 0
MFDR
U p -t-p
0, 14
45
0, 30
Mean across clear cases
Ps
2 5 ,0 0
-50
0, 25
-100
0, 20
-150
0, 15
-200
0, 10
-250
0, 05
-300
0, 00
-350
0, 00
Fig 2. Means of the indicated parameters across singer subjects for the clear examples of the two registers, see text.
the individual data underlying the figure,
several surprisingly clear trends can be
observed. For all subjects sound level was
higher in chest (Fig 4a) apparently supporting
the typical observation that head register at low
F0 is difficult to combine with loud phonation.
Fig 4b shows that the higher sound level in
chest corresponded to a more negative MFDR.
The closed phase was clearly longer in chest,
Fig 4c. While Up-t-p did not differ consistently
between the registers and varied among
singers, Fig 4d, NAQ was consistently lower in
chest than in head (Fig 4e). Summarising the Ps
difference between the registers cannot
account for all of the differences illustrated in
Fig 2.
4. DISCUSSION
Several precautions were taken to optimise
measurement reliability of the inverse filtering
and our results showed a systematic variation
with Ps that was similar to that found under
more ideal experimental conditions. This
suggests that our data were reasonably reliable.
The availability of several Ps values seems a
strong advantage, providing heavy support for
the observation that Ps influences a number of
flow glottogram characteristics. For example,
the variation of Qclosed with Ps was similar to
-10
250
b
25
QClosed [%]
CIE
Qcl osed [%]
20
20
150
-30
20
30
15
25
200
head
MFDR
SUB
COX
chest
-20
Sound level [dB]
40
30
15
100
e
-40
10
e
0,20
0,15
0,10
50
5
10
10
-50
5
0
0,25
NAQ
Mean Qclosed
MAR
PAT
d
c
Qclosed [%]
Ps & QClosed
50
0,30
30
a
Up-t-p [arb unit]
higher for the chest register samples, which
also had higher Qclosed, somewhat higher Up-t-p
and lower NAQ means. The MFDR values
were more negative in the chest. register.
Mean Ps values, calculated over each subject’s
10 Ps values in each register, showed higher
values for chest than for head for all subjects.
Fig 3a illustrates the relationship between Ps
and Qclosed. Chest register phonations tended to
show higher Qclosed values than head register
phonations. The same relationship is illustrated
also in fig 3b, showing the relationship
between pressure and Qclosed, averaged across
subjects. Here pressure is given in terms of the
normalized excess pressure PSEN [4]. The
dashed curves represent an approximation of
the data points by means of a power function
(Sundberg & al, 1999). The registers differ
clearly with respect to the asymptote, reaching
25% and 20% in chest and head registers,
respectively. In addition, the growth of Qclosed
with increasing PSEN is quicker in chest
register. These findings strongly suggest that a
longer closed phase is typical of chest as
compared to head register.
As Ps heavily influences glottal parameters,
comparisons at identical Ps values are
informative. All singers had used a Ps value of
11 cm H2O, approximately (Fig.s 4a – e).
Considering the random variation inherent in
MAR
COX
CIE
SUB
0,05
0
0
MAR
COX
CIE
SUB
MAR
COX
CIE
SUB
MAR
COX
CIE
SUB
0,00
MAR
COX
CIE
SUB
0
0
5
10
15
20
Pressure [cmH2O]
25
30
0
2
4
6
8
10
12
Normalised Excess Pressure
Fig 3. Closed quotient Qclosed as function of subglottal
pressure Ps (left panel) and normalised excess pressure
(right panel). Filled and open symbols refer to chest
and head register, respectively. The right panel shows
the mean values for all subjects and the dotted and
dashed curves represent the power function that best
approximates these data points.
Fig. 4. Sound level, MFDR Qclosed, Up-t-p, and NAQ for the
singers’ phonations at a Ps of 11 cm H2O, approximately,
in chest and head register.
that found for professional male singer [5]. The
listening test showed that no more than 16% of
all sung vowels were classified as clear chest
register while 49% were classified as clear
head register. One possible reason for this bias
could be that it is difficult to differentiate
registers in this pitch range, particularly in soft
phonation. A contributing factor may be that
the subjects sang into a pneumotachograph
mask that attenuated the higher spectrum
components slightly. As chest register typically
had a longer closed phase than head, strong
higher spectrum overtones should belong to the
characteristics of chest register. Attenuation of
such overtones can therefore be expected to
reduce the timbral difference between the
registers.
Our results suggest that modification of Ps and
possibly also of glottal adduction are needed
for changing from chest to head register or vice
versa. As female musical theatre singers use
both registers they would need a refined
control of both breathing and phonation
muscles. According to our observations chest
register is characterised by a high Ps and a
greater Qclosed and by a lower Up-t-p and NAQ.
In female untrained voices’ chest register
Qclosed was found to be higher than in head
register [6]. Comparing professional
baritones’, tenors’ and counter tenors’ modal
and falsetto registers Sundberg and Högset [7]
found that Ps and Qclosed were higher, glottal
leakage smaller, and the fundamental was
weaker in chest register. These observations
are compatible with or similar to the findings
of the present study. They suggest that much of
the voice source differences, between these
registers, can be explained if the vocal folds
are assumed to be thicker in the modal/chest
register. Thicker folds would be associated
with a longer phase lag between the upper and
lower margins of the vocal folds, which should
cause a more extended closed phase.
5. CONCLUSIONS
The chest and head register voice source in
female musical theatre singers differ in several
respects. In typical tokens of chest register Ps
and MFDR are higher, Qclosed is greater while
Up-t-p and NAQ are lower than in head register.
Register differences are perceptually clearer in
loud than in soft phonation. The results also
show that Ps has a strong influence on flow
glottogram parameters. As NAQ seems
associated with degree of perceived phonatory
pressedness, the low NAQ values for chest
register suggests a more adducted phonation as
compared to head register.
6. ACKNOWLEDGEMENTS
The authors are indebted to the singers for their
kind participation, to Paavo Alku for support
and valuable discussions, and to Mattias
Heldner for the statistical analysis. The
experiment was carried out at the Vanderbilt
University Voice Center, head Dr Robert
Ossoff.
Financial support for Eva Björkner’s work
with this investigation was provided by the
European Community's Human Potential
Programme under contract HPRN-CT-200200276 [HOARSE-network].
A previous version of present study has been
published at ICA2004 in Kyoto, Japan.
7. REFERENCES
[1] Alku P, Bäckström T, Vilkman E (2002)
Normalized amplitude quotient for
parameterization of the glottal flow,' J Acoust
Soc Amer, 112, 701-710.
[2] Sundberg J, Thalén M, Alku P, Vilkman E
(2002) Estimating perceived phonatory
pressedness in singing from flow glottograms,
TMH-QPSR 43, 89-96.
[3] Hertegård S, Gauffin J, Lindestad P-Å
(1995) A comparison of subglottal and
intraoral pressure measurements during
phonation, J Voice 9, 149-155.
[4]Titze I (1992) Phonation threshold
pressure: a missing link in glottal
aerodynamics, J Acoust Soc Amer 91, 29262935.
[5] Sundberg J, Andersson M, Hultqvist C
(1999) Effects of subglottal pressure variation
on professional baritone singers’ voice
sources, J Acoust Soc Amer 105, 1965-1971.
[6] Sundberg J, Kullberg Å, (1999) Voice
source studies of register differences in
untrained female singing. Logopedics
Phoniatrics Vocology 24, 76-83.
[7] Sundberg J & Högset C (2001) Voice
source differences between falsetto and modal
registers in counter tenor, tenor, and baritone
singers. Logopedics Phoniatrics Vocology 26,
26-36.