Voice Source Register Differences in Female Musical Theatre Singers

Eva  Björkner; J. Sundberg

Voice Source Register Differences in Female Musical Theatre Singers Eva Björkner1*†, Johan Sundberg2*, Tom Cleveland3** and Ed Stone4** *Dept. of Speech Music Hearing, KTH, Stockholm, Sweden † Lab.of Acoustics and Audio Signal Processing, Helsinki Univ. of Technology, Finland **Vanderbilt Voice Center, Dept. of Otolaryngology, Vanderbilt Univ., Nashville, TN, USA evab@speech.kth.se, pjohan@speech.kth.se ABSTRACT Musical theatre singing requires the use of two vocal registers in the female voice. The voice source and subglottal pressure Ps characteristics of these registers are analysed by inverse filtering. The relationship between Ps and closed quotient Qclosed, peak-to-peak pulse amplitude Up-t-p, maximum flow declination rate MFDR and the normalised amplitude quotient NAQ were examined. Ps was typically slightly higher in chest than in head register . For typical tokens MFDR and Qclosed were significantly greater while NAQ and Up-t-p were significantly lower in chest than in head. In classical singing mainly the head register is used while in non-classical styles, like pop, jazz and blues, chest is used more commonly. Musical theatre singing, on the other hand, demands a perfect control of both registers. In this repertoire high subglottal pressures are typically used. Such pressures sometimes jeopardise vocal health. To understand the reason for this, a better knowledge of the register function in female singing is needed. Therefore, the present investigation studies the register function in professional female musical theatre singers by analysing their voice source characteristics, paying special attention to the influence of subglottal pressure on these characteristics. 1. INTRODUCTION 2. MATERIAL AND METHODS Vocal register is a phenomenon of great relevance in vocal art, particularly in female singing. An important task in singing training, regardless of style, is to teach the student how to master the transition from one register to the other with minimal timbral changes. The register used in the lower part and in the adjacent higher part of the female pitch range is here referred to as chest and head, respectively. Vocal registers reflect voice source characteristics, e.g., Qclosed, peak-topeak pulse amplitude Up-t-p, and maximum flow declination rate MFDR. All these parameters are heavily influenced by subglottal pressure Ps and glottal adduction. Hence it seemed reasonable to analyse these parameters. The Normalised Amplitude Quotient NAQ defined as the ratio between Up-t-p/(T0*MFDR) [1] seems related to glottal adduction and is correlated with degree of perceived phonatory pressedness [2]. 2.1 Subjects and recording Seven female singers, all classically trained and working professionally as musical theatre actors, volunteered as subjects. Their ages were between 17-43 years and all, except one, had professional experience for duration of 11 to 25 years. Their task was to sing a sequence of the syllable /pae/ at a pitch where they could use both chest and head register. This pitch varied between C4 and G4 for different subjects. They initiated the sequence at high lung volume and at maximum degree of vocal loudness and continued while gradually decreasing vocal loudness. They were asked to perform this sequence three times first in chest and then three times in head register. The vowel /ae/ was chosen since its high first formant adds to the reliability of inverse filtering and the oral pressure during the pocclusion allows estimation of Ps. The flow signal was recorded using the Rothenberg mask, a specially designed pneumotachograph for capturing oral flow. The subject held a plastic tube, inner diameter 4 mm, in the corner of her mouth for recording oral pressure. The audio signal outside the mask was recorded from a microphone at a distance of 30 cm from the lips. All these signals plus an EGG signal (Glottal Enterprises) were recorded on a multi-channel digital recorder [TEAC RD 180 PCM]. 2.2 Analysis The effect of Ps variation on the voice source can be ideally analysed by examining glottal parameters as function of several Ps values. Therefore, for each subject and register ten approximately equidistantly spaced Ps-values were selected. The entire material thus consisted of a total of 140 samples, ten for each register and singer, respectively. Subject MAR produced emphatic /p/ explosions in her head register recordings, as demonstrated by sharply peaked oral pressure peaks [3]. Following the recommendation of Hertegård the estimates of Ps in these cases were taken from the discontinuity appearing in the initial part of the pressure peak. 2.3 Listening test A computerised listening test (Judge, S.Granqvist) was run with a panel of three voice experts. Their task was to rate along a visual analogue scale how representative the various 280 sung samples (10 degrees of vocal loudness x 2 registers, x two presentations of each sample) were of chest and head register. The subjects were presented with a visual analogue rating scale on the computer display, where 0 marked ‘Chest’ and 1000 was marked ‘Head’. The program recorded all settings on text files. Fig 1 shows the standard deviations as function of the ratings averaged across the three raters. The standard deviations were highest in the center of the scale, as expected. For some singers the chest and head register 500 AL COX MAR SUB 450 CIE JUL PAT 400 SD [mm] 350 300 250 200 data are gathered toward the left and right sides of the graph, respectively, indicating that their chest and head register samples were perceived as clear examples of these registers. Singers JUL and AL, in particular, produced samples that differed less clearly. Therefore their data were disregarded. A total of 16% of the samples received mean ratings in the interval 0 – 250 while 49% of the samples received ratings in the interval 7501000. The 17 samples that received ratings in the range of 0-250 were thus accepted as ‘clear cases of chest register’ while the 17 samples that received the highest mean ratings were considered as clear cases of head register. 2.4 Voice source analysis Flow glottograms were obtained using the DeCap custom-made program (S. Granqvist). The formant frequency values were adjusted such that a ripple- free closed phase was obtained. These values were checked, by using them for synthesising the vowel sound in the custom made MADDE synthesiser (Granqvist). From the resulting flow glottograms, period time, Qclosed, Up-t-p and MFDR were measured. In addition the normalised amplitude quotient NAQ was determined. As Ps significantly influences most of these parameters [2] it seemed relevant to examine their variation with Ps for the two registers. 3. RESULTS 3.1 Statistical analysis A 2-way ANOVA was carried out for the clear cases, with register and singer as factors. Due to a technical problem singer SUB`s recordings had to be excluded from the statistical analysis. Results showed register as highly significant (p=0.01) for all parameters. Factor singer was found to be significant for Ps, Up-t-p and MFDR. For MFDR a significant interaction was also found between the two factors. A 2-way ANOVA for all cases showed that register was highly significant for the parameters Ps, Qclosed and NAQ, and singer was highly significant for parameter Ps, Up-t-p, MFDR and NAQ. No significant interaction between register and singer was found for any parameter. 150 100 50 0 0 100 200 300 400 500 600 700 800 900 100 Mean rating [mm] Fig 1. Standard deviations of the three experts’ register ratings as function of the averages of these ratings. Symbols refer to singer subjects, and filled and open symbols pertain to chest and head register, respectively. 3.2 Acoustic/aerodynamic analysis Fig 2 illustrates the differences between the registers in terms of the means across the 17 clear cases. The mean and SD of Ps were Qclose d 5 ,0 0 Mean across clear cases 10 ,0 0 0, 12 Mean across clear cases 15 ,0 0 35 30 25 20 15 10 0, 10 0, 08 0, 06 0, 04 0, 02 5 0 0 ,0 0 NA Q 0 40 Mean across clear cases Mean across clear cases Ch e st He ad 2 0 ,0 0 MFDR U p -t-p 0, 14 45 0, 30 Mean across clear cases Ps 2 5 ,0 0 -50 0, 25 -100 0, 20 -150 0, 15 -200 0, 10 -250 0, 05 -300 0, 00 -350 0, 00 Fig 2. Means of the indicated parameters across singer subjects for the clear examples of the two registers, see text. the individual data underlying the figure, several surprisingly clear trends can be observed. For all subjects sound level was higher in chest (Fig 4a) apparently supporting the typical observation that head register at low F0 is difficult to combine with loud phonation. Fig 4b shows that the higher sound level in chest corresponded to a more negative MFDR. The closed phase was clearly longer in chest, Fig 4c. While Up-t-p did not differ consistently between the registers and varied among singers, Fig 4d, NAQ was consistently lower in chest than in head (Fig 4e). Summarising the Ps difference between the registers cannot account for all of the differences illustrated in Fig 2. 4. DISCUSSION Several precautions were taken to optimise measurement reliability of the inverse filtering and our results showed a systematic variation with Ps that was similar to that found under more ideal experimental conditions. This suggests that our data were reasonably reliable. The availability of several Ps values seems a strong advantage, providing heavy support for the observation that Ps influences a number of flow glottogram characteristics. For example, the variation of Qclosed with Ps was similar to -10 250 b 25 QClosed [%] CIE Qcl osed [%] 20 20 150 -30 20 30 15 25 200 head MFDR SUB COX chest -20 Sound level [dB] 40 30 15 100 e -40 10 e 0,20 0,15 0,10 50 5 10 10 -50 5 0 0,25 NAQ Mean Qclosed MAR PAT d c Qclosed [%] Ps & QClosed 50 0,30 30 a Up-t-p [arb unit] higher for the chest register samples, which also had higher Qclosed, somewhat higher Up-t-p and lower NAQ means. The MFDR values were more negative in the chest. register. Mean Ps values, calculated over each subject’s 10 Ps values in each register, showed higher values for chest than for head for all subjects. Fig 3a illustrates the relationship between Ps and Qclosed. Chest register phonations tended to show higher Qclosed values than head register phonations. The same relationship is illustrated also in fig 3b, showing the relationship between pressure and Qclosed, averaged across subjects. Here pressure is given in terms of the normalized excess pressure PSEN [4]. The dashed curves represent an approximation of the data points by means of a power function (Sundberg & al, 1999). The registers differ clearly with respect to the asymptote, reaching 25% and 20% in chest and head registers, respectively. In addition, the growth of Qclosed with increasing PSEN is quicker in chest register. These findings strongly suggest that a longer closed phase is typical of chest as compared to head register. As Ps heavily influences glottal parameters, comparisons at identical Ps values are informative. All singers had used a Ps value of 11 cm H2O, approximately (Fig.s 4a – e). Considering the random variation inherent in MAR COX CIE SUB 0,05 0 0 MAR COX CIE SUB MAR COX CIE SUB MAR COX CIE SUB 0,00 MAR COX CIE SUB 0 0 5 10 15 20 Pressure [cmH2O] 25 30 0 2 4 6 8 10 12 Normalised Excess Pressure Fig 3. Closed quotient Qclosed as function of subglottal pressure Ps (left panel) and normalised excess pressure (right panel). Filled and open symbols refer to chest and head register, respectively. The right panel shows the mean values for all subjects and the dotted and dashed curves represent the power function that best approximates these data points. Fig. 4. Sound level, MFDR Qclosed, Up-t-p, and NAQ for the singers’ phonations at a Ps of 11 cm H2O, approximately, in chest and head register. that found for professional male singer [5]. The listening test showed that no more than 16% of all sung vowels were classified as clear chest register while 49% were classified as clear head register. One possible reason for this bias could be that it is difficult to differentiate registers in this pitch range, particularly in soft phonation. A contributing factor may be that the subjects sang into a pneumotachograph mask that attenuated the higher spectrum components slightly. As chest register typically had a longer closed phase than head, strong higher spectrum overtones should belong to the characteristics of chest register. Attenuation of such overtones can therefore be expected to reduce the timbral difference between the registers. Our results suggest that modification of Ps and possibly also of glottal adduction are needed for changing from chest to head register or vice versa. As female musical theatre singers use both registers they would need a refined control of both breathing and phonation muscles. According to our observations chest register is characterised by a high Ps and a greater Qclosed and by a lower Up-t-p and NAQ. In female untrained voices’ chest register Qclosed was found to be higher than in head register [6]. Comparing professional baritones’, tenors’ and counter tenors’ modal and falsetto registers Sundberg and Högset [7] found that Ps and Qclosed were higher, glottal leakage smaller, and the fundamental was weaker in chest register. These observations are compatible with or similar to the findings of the present study. They suggest that much of the voice source differences, between these registers, can be explained if the vocal folds are assumed to be thicker in the modal/chest register. Thicker folds would be associated with a longer phase lag between the upper and lower margins of the vocal folds, which should cause a more extended closed phase. 5. CONCLUSIONS The chest and head register voice source in female musical theatre singers differ in several respects. In typical tokens of chest register Ps and MFDR are higher, Qclosed is greater while Up-t-p and NAQ are lower than in head register. Register differences are perceptually clearer in loud than in soft phonation. The results also show that Ps has a strong influence on flow glottogram parameters. As NAQ seems associated with degree of perceived phonatory pressedness, the low NAQ values for chest register suggests a more adducted phonation as compared to head register. 6. ACKNOWLEDGEMENTS The authors are indebted to the singers for their kind participation, to Paavo Alku for support and valuable discussions, and to Mattias Heldner for the statistical analysis. The experiment was carried out at the Vanderbilt University Voice Center, head Dr Robert Ossoff. Financial support for Eva Björkner’s work with this investigation was provided by the European Community's Human Potential Programme under contract HPRN-CT-200200276 [HOARSE-network]. A previous version of present study has been published at ICA2004 in Kyoto, Japan. 7. REFERENCES [1] Alku P, Bäckström T, Vilkman E (2002) Normalized amplitude quotient for parameterization of the glottal flow,' J Acoust Soc Amer, 112, 701-710. [2] Sundberg J, Thalén M, Alku P, Vilkman E (2002) Estimating perceived phonatory pressedness in singing from flow glottograms, TMH-QPSR 43, 89-96. [3] Hertegård S, Gauffin J, Lindestad P-Å (1995) A comparison of subglottal and intraoral pressure measurements during phonation, J Voice 9, 149-155. [4]Titze I (1992) Phonation threshold pressure: a missing link in glottal aerodynamics, J Acoust Soc Amer 91, 29262935. [5] Sundberg J, Andersson M, Hultqvist C (1999) Effects of subglottal pressure variation on professional baritone singers’ voice sources, J Acoust Soc Amer 105, 1965-1971. [6] Sundberg J, Kullberg Å, (1999) Voice source studies of register differences in untrained female singing. Logopedics Phoniatrics Vocology 24, 76-83. [7] Sundberg J & Högset C (2001) Voice source differences between falsetto and modal registers in counter tenor, tenor, and baritone singers. Logopedics Phoniatrics Vocology 26, 26-36.

RELATED PAPERS

RELATED TOPICS

Log In

Voice Source Register Differences in Female Musical Theatre Singers

Voice Source Register Differences in Female Musical Theatre Singers

Related Papers

RELATED PAPERS

RELATED TOPICS