The speaker’s gender is a crucial factor affecting the acoustic features of the voice. One such feature is voice intensity, also known as sound pressure level (SPL). Previous studies have indicated that the female voice may involve lower values of SPL than the male voice. Moreover, there are suggestions that the variability of voice intensity tends to be lower for women than for men as well.
The major aim of this paper is to examine the effects of literary character’s gender on the reader’s SPL, measured in decibels (dB), and the variability of voice intensity, measured as the standard deviation (SD) of SPL, while reading prose aloud. The secondary aims are to investigate the general shifts of SPL and SD of SPL in dialogues independently of other variables and to consider the possible effects of the reader’s gender and the reader’s dialect. In order to accomplish these tasks, a representative sample of dialogue excerpts with male and female characters was used. Each fragment was located in the corresponding audiobook and analysed in terms of the two acoustic features under discussion. Typical values of SPL and the SD of SPL for different readers were measured in the entire chapters from which fragments were selected and the results were compared with those obtained from the extracts. In this way, it was possible to establish the relative shifts of SPL and the SD of SPL for each of the analysed fragments.
Contrary to what had been expected, a statistical analysis of the results revealed no effects of the character’s gender on any of the response variables. However, conclusions concerning secondary aims were more definitive. A general trend to decrease the SD of SPL in dialogues in comparison to the rest of the text in a novel was observed. This tendency is independent of any of the factors included in the study. It was also observed that male American readers tend to lower their voice intensity when reading dialogues. All these findings may be applied in developing text to speech software.
If the inline PDF is not rendering correctly, you can download the PDF file here.
Abdulla Waleed H. & Nikola Kirilov Kasabov. 2001. Improving speech recognition performance through gender separation. In Proceedings of Artificial Neural Networks and Expert Systems International Conference (ANNES) 218–222. Dunedin New Zeland.
Audacity Team. 2014. Audacity(R): Free audio editor and recorder (version 2.0.5) [computer software].
Boersma Paul & David Weenink. 2014. Praat a system for doing phonetics by computer (version 5.4.01) [computer software]. Amsterdam: University of Amsterdam.
Boren Braxton Agnieszka Roginska & Brian Gill. 2013. Maximum averaged and peak levels of vocal sound pressure. In Audio Engineering Society Convention 135 692–698. Audio Engineering Society.
Byrd Dani. 1992. Preliminary results on speaker-dependent variation in the TIMIT database. The Journal of the Acoustical Society of America 92(1). 593–596. DOI: 10.1121/1.404271
Byrd Dani. 1994. Relations of sex and dialect to reduction. Speech Communication 15(1–2). 39–54. DOI: 10.1016/0167-6393(94)90039-6
Chen Sheng H. 2005. The effects of tones on speaking frequency and intensity ranges in Mandarin and Min dialects. The Journal of the Acoustical Society of America 117(5). 3225–3230. DOI: 10.1121/1.1872312
Childers Donald G. & Ke Wu. 1991. Gender recognition from speech. Part II: Fine analysis. The Journal of the Acoustical society of America 90(4). 1841–1856. DOI: 10.1121/1.401664
Coleman Robert F. Janet Henn Mabis & Joanne Kidd Hinson. 1977. Fundamental frequencysound pressure level profiles of adult male and female voices. Journal of Speech Language and Hearing Research 20(2). 197–204. DOI: 10.1044/jshr.2002.197
Fichtelius Anna Iréne Johansson & Kerstin Nordin. 1980. Three investigations of sex-associated speech variation in day school. Women’s Studies International Quarterly 3(2–3). 219–225. DOI: 10.1016/S0148-0685(80)92218-6
Fitzsimons Mary Noirin Sheahan & Hugh Staunton. 2001. Gender and the integration of acoustic dimensions of prosody: Implications for clinical studies. Brain and Language 78(1). 94–108. DOI: 10.1006/brln.2000.2448
Gelfer Marylou Pausewang & Shannon Ryan Young. 1997. Comparisons of intensity measures and their stability in male and female sneakers. Journal of Voice 11(2). 178–186. DOI: 10.1016/S0892-1997(97)80076-8
Gelfer Marylou Pausewang & Victoria A. Mikos. 2005. The relative contributions of speaking fundamental frequency and formant frequencies to gender identification based on isolated vowels. Journal of Voice 19(4). 544–554. DOI: 10.1016/j.jvoice.2004.10.006
Henton Caroline G. & R. Anthony W. Bladon. 1985. Breathiness in normal female speech: Inefficiency versus desirability. Language & Communication 5(3). 221–227. DOI: 10.1016/0271-5309(85)90012-6
Holmberg Eva B. Robert E. Hillman & Joseph S. Perkell. 1988. Glottal airflow and transglottal air pressure measurements for male and female speakers in soft normal and loud voice. The Journal of the Acoustical Society of America 84(2). 511–529. DOI: 10.1121/1.396829
Huber Jessica E. Elaine T. Stathopoulos Gina M. Curione Theresa A. Ash & Kenneth Johnson. 1999. Formants of children women and men: The effects of vocal intensity variation. The Journal of the Acoustical Society of America 106(3). 1532–1542. DOI: 10.1121/1.427150
Hu Yakun Dapeng Wu & Antonio Nucci. 2012. Pitch-based gender identification with two-stage classification. Security and Communication Networks 5(2). 211–225. DOI: 10.1002/sec.308
Hwa Chen S. 2006. Sex differences in frequency and intensity in reading and voice range profiles for Taiwanese adult speakers. Folia Phoniatrica et Logopaedica 59(1). 1–9. DOI: 10.1159/000096545
Jacewicz Ewa Robert A. Fox Caitlin O’Neill & Joseph Salmons. 2009. Articulation rate across dialect age and gender. Language Variation and Change 21(2). 233–256. DOI: 10.1017/S0954394509990093
Jung E. A. Th. Schwarzbacher K. Humphreys & Bob Lawlor. 2002. Application of real-time AMDF pitch-detection in a voice gender normalisation system. In Proceedings of 7th International Conference on Spoken Language Processing 2521–2524.
Kent Raymond D. & Charles Read. 1992. The acoustic analysis of speech. San Diego California: Singular Publishing Group.
Klatt Dennis H. 1987. Acoustic correlates of breathiness: First harmonic amplitude turbulence noise and tracheal coupling. The Journal of the Acoustical Society of America 82(S1). S91. DOI: 10.1121/1.2025051
Klatt Dennis H. & Laura C. Klatt. 1990. Analysis synthesis and perception of voice quality variations among female and male talkers. The Journal of the Acoustical Society of America 87(2). 820–857. DOI: 10.1121/1.398894
Latinus Marianne & Margot J. Taylor. 2012. Discriminating male and female voices: Differentiating pitch and gender. Brain Topography 25(2). 194–204. DOI: 10.1007/s10548-011-0207-9
Machado Sheron Emília Duarte Júlia Teles Lara Reis & Francisco Rebelo. 2012. Selection of a voice for a speech signal for personalized warnings: The effect of speaker’s gender and voice pitch. Work 41. 3592–3598. DOI: 10.3233/WOR-2012-0670-3592
Mendoza Elvira Nieves Valencia Juana Muñoz & Humberto Trujillo. 1996. Differences in voice quality between men and women: Use of the long-term average spectrum (LTAS). Journal of Voice 10(1). 59–66. DOI: 10.1016/S0892-1997(96)80019-1
Ohara Yumiko. 2003. Performing gender through voice pitch: A cross-cultural analysis of Japanese and American English. In Ursula Pasero & Friederike Braun (eds.) Wahrnehmung und Herstellung von Geschlecht 105–116. Opladen/Wiesbaden: VS Verlag für Sozialwissenschaften. DOI: 10.1007/978-3-322-89014-6_8
Ptacek Paul H. Eric K. Sander Walter H. Maloney & C. C. Roe Jackson. 1966. Phonatory and related changes with advanced age. Journal of Speech Language and Hearing Research 9(3). 353–360. DOI: 10.1044/jshr.0903.353
Quené Hugo. 2008. Multilevel modeling of between-speaker and within-speaker variation in spontaneous speech tempo. The Journal of the Acoustical Society of America 123(2). 1104–1113. DOI: 10.1121/1.2821762
R Development Core Team. 2013. R: A language and environment for statistical computing (version 3.0.3) [computer software]. Vienna Austria.
Schmidt Charles P. Marylou Pausewang Gelfer & Moya L. Andrews. 1990. Intensity range as a function of task and training. Journal of Voice 4(1). 30–36. DOI: 10.1016/S0892-1997(05)80079-7
Schötz Susanne. 2006. Perception analysis and synthesis of speaker age. Lund: Lund University Press.
Stolarski Łukasz. 2015. Pitch patterns in vocal expression of “happiness” and “sadness” in the reading aloud of prose on the basis of selected audiobooks. Research in Language 13(2). 141–162. DOI: 10.1515/rela-2015-0016
Stolarski Łukasz. 2017. Rendering of gender when reading fiction aloud. Linguistica Silesiana 38. 249–283.
Sulter Arend M. Harm K. Schutte & Donald G. Miller. 1996. Standardized laryngeal videostroboscopic rating: Differences between untrained and trained male and female subjects and effects of varying sound intensity fundamental frequency and age. Journal of Voice 10(2). 175–189. DOI: 10.1016/S0892-1997(96)80045-2
Sulter Arend M. Hero P. Wit Harm K. Schutte & Donald G. Miller. 1994. A structured approach to voice range profile (phonetogram) analysis. Journal of Speech Language and Hearing Research 37(5). 1076–1085. DOI: 10.1044/jshr.3705.1076
Susser Robin D. & D. M. Bless. 1983. Vocal intensity levels of normal and voice-disordered children. (Proceedings of the XIX Congress of the International Association of Logopedics and Phoniatrics).
Terasawa Ruriko Yuki Kakita & Minoru Hirano. 1984. Simultaneous measurements of mean air flow rate fundamental frequency and voice intensity. The Japan Journal of Logopedics and Phoniatrics 25(3). 189–207. DOI: 10.5112/jjlp.25.189
Verhoeven Jo Guy De Pauw & Hanne Kloots. 2004. Speech rate in a pluricentric language: A comparison between Dutch in Belgium and the Netherlands. Language and Speech 47(3). 297–308. DOI: 10.1177/00238309040470030401
Whiteside Sandra P. 1996. Temporal-based acoustic-phonetic patterns in read speech: Some evidence for speaker sex differences. Journal of the International Phonetic Association 26(1). 23–40. DOI: 10.1017/S0025100300005302
Yamazawa Hideko & Harry Hollien. 1992. Speaking fundamental frequency patterns of Japanese women. Phonetica 49(2). 128–140. DOI: 10.1159/000261907
Yuan Jiahong Mark Liberman & Christopher Cieri. 2006. Towards an integrated understanding of speaking rate in conversation. In Proceedings of INTERSPEECH 2006 541–544.
Yuasa Ikuko Patricia. 2008. Culture and gender of voice pitch: A sociophonetic comparison of the Japanese and Americans. London Oakville: Equinox Publishing.