The Vowel as an Acoustic, Emotional, and Biological Stimulus in Music Pedagogy

Authors

DOI:

https://doi.org/10.24193/subbpsyped.2026.1.13

Keywords:

vowels, phonoaesthetics, formant architecture, music pedagogy, biological regulation, emotional resonance

Abstract

This research explores the multidimensional role of vowels as acoustic, emotional, and biological stimuli within the context of music pedagogy. The study argues that vowels are not neutral phonological units but active triggers of affective and physiological processes. Using the formant architecture of the standard Serbian language as a baseline, the analysis demonstrates how specific vowel configurations are instinctively associated with spatial and emotional archetypes. Through the lens of the facial feedback hypothesis, the research shows that the mechanical activation of orofacial muscles during vowel production functions as a somatic stimulus, modulating the performer’s internal affective state. Neurophysiological evidence further confirms that vowels elicit stable cortical responses, while empirical studies highlight their capacity to regulate biomarkers of stress, immunity, and social bonding through collective singing. By integrating insights from acoustic phonetics, biomechanics, and psychophysiology, this work repositions the vowel as a fundamental pedagogical tool – one that simultaneously shapes technical vocalization, authentic emotional expression, and biological regulation. The findings advocate for a holistic approach to music education, particularly for children and adolescents, where vowel articulation fosters systemic balance, resilience, and artistic integrity.

References

Benson, E. A. (2020). Training Contemporary Commercial Singers. Compton Publishing.

Benson, E. A. & Rosen, K. (2023). Anti-Fat Bias in the Singing Voice Studio, Part Two: How to Make a Size Inclusive Voice Studio. Journal of Singing, 79(5), 653–660.

Berlin, B. (2006). Evidence for pervasive sound symbolism in ethnozoological nomenclature. In L. Hinton, J. Nichols, & J. J. Ohala (Eds.), Sound Symbolism (pp. 76–93). Cambridge University Press.

Blacking, J. (1973). How Musical is Man? University of Washington Press.

Bowling, D.L., Gahr, J., Ancochea, G., Hoeschele, M., Canoine, V., Fusani, L. & Fitch, W.T. (2022). Endogenous oxytocin, cortisol, and testosterone in response to group singing. Hormons and Behaviour, doi: 10.1016/j.yhbeh.2021.105105.

Bozeman, K. (2017). The pedagogic use of absolute spectral tone color theory. Journal of Singing, 74(2), 179–183.

Brandner, M., Frank, M. & Sontacchi, A. (2022). Horizontal and Vertical Voice Directivity Characteristics of Sung Vowels in Classical Singing. Acoustics, 4(4), 849–866. https://doi.org/10.3390/acoustics4040051

Canessa‑Pollard, V., Anikin, A. & Reby, D. (2025). Chants across seven traditions share acoustic traits that enhance subjective relaxation. Proceedings of the National Academy of Sciences of the United States of America, 122(35): e2506480122. doi: 10.1073/pnas.2506480122.

Curtis, M. E. & Bharucha, J. J. (2010). The Minor Third Communicates Sadness in Speech, Mirroring Its Use in Music. Emotion, 10(3), 335–348.

Darwin, C. (1872). The Expression of the Emotions in Man and Animals. London: John Murray.

Davitz, J. R. (1964). The communication of emotional meaning. New York: McGraw-Hill.

De Jong, K. J. (1995). The Supraglottal Articulation of Prominence in English: Linguistic Stress as Localized Hyperarticulation. Journal of the Acoustical Society of America, 97: 491–504.

Ekman, P., Friesen, W. V. & Ellsworth, P. (1982). What are the relative contributions of facial behavior and contextual information to the judgment of emotion? In P. Ekman (Ed.), Emotion in the human face (pp. 111–127). Cambridge: Cambridge University Press.

Ekström, A. G. (2022). What's next for size-sound symbolism? Frontiers in Language Sciences, 1, Article 1046637. https://doi.org/10.3389/flang.2022.1046637

Fancourt, D., & Perkins, R. (2018). The effects of singing interventions on cortisol, oxytocin, and cytokine levels. Evidence-Based Complementary and Alternative Medicine, 1–12.

Fitch, W. T. (2000). The evolution of speech: A comparative review. Trends in Cognitive Sciences, 4(7), 258–267.

Flemming, E. (2004). Contrast and perceptual distinctiveness. In B. Hayes, R. Kirchner & D. Steriade (Eds.), Phonetically based phonology (pp. 232–276). Cambridge: Cambridge University Press. doi:10.1017/CBO9780511486401.008

Good, A. & Russo, F. A. (2021). Changes in mood, oxytocin, and cortisol following group and individual singing: A pilot study. Psychology of Music, 50(4). https://doi.org/10.1177/03057356211042668

Habibi, A. & Damasio, A. (2014). Music, feelings, and the human brain. Psychomusicology: Music, Mind, and Brain, 24(1), 92–102.

Helmholtz, H. von (1863). Sensations of Tone as a Psychological Basis for the Theory of Music. London: Longmans, Green and Co.

Howell, I. (2015). Absolute Spectral Tone Color: A Psychoacoustic Concept Applied to Voice Pedagogy. New England Conservatory.

Igna, O. N. & Ma, H. (2025). Vocal pedagogy as a contemporary scientific direction. Education and Pedagogy Journal, 2, 41–58. doi: 10.23951/2782-2575-2025-2-41-58

Jakobson, R. & Halle, M. (1956). Fundamentals of Language. Mouton.

Jespersen, O. (1922). Language; It’s Nature, Development and Origin. London: George Allen & Unwin Ltd.

Kostić, D., Nestorović, M. & Kalić, D. (1964). Akustička fonetika srpskohrvatskog jezika. Beograd: Institut za eksperimentalnu fonetiku i patologiju govora.

Kienast, M. & Sendlmeier, W. F. (2000). Acoustical analysis of spectral and temporal changes in emotional speech. Institute of Communication science, Technical University of Berlin, Germany.

Kreutz, G., Bongard, S., Rohrmann, S., Hodapp, V. & Grebe, D. (2004). Effects of choir singing or listening on secretory immunoglobulin A, cortisol, and emotional state. Journal of Behavioral Medicine, 27(6), 623–635.

Ladefoged, P. (1964). A Phonetic Study of West African Languages – An Auditory-Instrumental Survey. Cambridge: Cambridge University Press.

Lee, S., Yildirim, S., Kazemzadeh, A. & Narayanan, S. S. (2005). An articulatory study of emotional speech production. In: Proceedings of Interspeech, pp. 497–500.

Li, W., Mazlan, C.A.N. & Safian, A.R (2026). 21st-century skills in vocal pedagogy: a scoping review. Frontiers of Education, 11. https://doi.org/10.3389/feduc.2026.1744952

Lieberman, P. (1988). The Biology and Evolution of Language. Harvard University Press.

Lomonosov, M. (1755). Rossiyskaya grammatika [Russian Grammar]. Imperial Academy of Sciences.

Milankovic, V., Petrovic, M. and Petrovic, J. (2003). Tone constellation: a personal spatial scale presentation. In R. Kopiez, A. C. Lehmann, I. Wolther and C. Wolf (Eds.), Proceedings of the 5th Triennial ESCOM Conference (pp. 413–415), Hanover University of Music and Drama, Germany.

Morales-Luque, C., Carrillo-Franco, L., López-González, M.V., González-García, M., Dawid-Milner, M.S. (2025). Mapping the Neurophysiological Link Between Voice and Autonomic Function: A Scoping Review. Biology (Basel), 14(10):1382. doi: 10.3390/biology14101382.

Morton, E. S. (1977). On the occurrence and significance motivation-structural rules in some bird and mammal sounds. American Naturalist, 111, 855–869.

Murray, I. R. and Arnott, J. L. (1993). Toward the simulation of emotion in synthetic speech: A review of the literature on human vocal emotion. Journal of the Acoustical Society of America, 93, 1097–1108.

Nair, A., Schellenberg, M. & Gick, B. (2015). A Case Study on the Efficacy of Ultrasound Biofeedback in Voice Pedagogy. In M. Wolters, J. Livingstone, B. Beattie, R. Smith, M. MacMahon, J. Stuart-Smith, J. M. Scobbie (Eds.), 18th Proceedings of the International Congress of Phonetic Sciences (10-14 August 2015). Glasgow, Scotland UK: University of Glasgow.

Nastase, V., Sokolova, M. & Sayyad Shirabad J. (2007). Do happy words sound happy? A study of the relation between form and meaning for English words expressing emotions. In: Proceedings of the Recent Advances in Natural Language Processing (pp. 406–410).

Nettl, B. (1954). Text-music relationships in Arapaho songs. Southwestern Journal of Anthropology, 10m, 192–199.

Newman, S. S. (1933). Further experiments in phonetic symbolism. American Journal of Psychology, 45(1), 53-75.

Nordstrand, M., Svanfeldt, G., Granstrom, B. & House, D. (2004). Measurements of articulatory variation in expressive speech for a set of Swedish vowels. Speech Communication, 44, 187–196.

Obleser, J., Elbert, T., Lahiri, A., & Eulitz, C. (2003). Cortical representation of vowels reflects acoustic dissimilarity determined by formant frequencies. Brain Research. Cognitive Brain Research, 15(3): 207-13. doi: 10.1016/s0926-6410(02)00193-3.

Oller, D. K. (1980). The Emergency of the Sounds of Speech in Infancy – Chapter 6. In G. H. Yeni-Komshian, J. F. Kavanagh & C. A. Ferguson (Eds.), Child Phonology (pp. 93–112). https://doi.org/10.1016/B978-0-12-770601-6.50011-5.

Paget R. (1930). Human Speech – Some Observations, Experiments and Conclusions as to the Nature, Origin, Purpose and Possible Improvement of Human Speech. London: Kegan Paul, Trench, Trubner & Co.

Petrović, M. (2017). The Importance of Vowels in Music Education. London: iMerc.

Petrović, M., Pekić, S., Soldatović, I. & Savović, S. (2025). Oxytocin and Cortisol in Children in Response to Group Singing. In A. Branković & S. Dabić (Eds.), Program and Book of Abstracts of the 28th International Pedagogical Forum of Performing Arts „In Search of a Model of Sustainable Music Education“ (pp. 63–66). Belgrade: Faculty of Music.

Philippot, P., Chapelle, G. & Blairy, S. (2002). Respiratory feedback in the generation of emotion. Cognition and Emotion, 16(5), 605–627.

Pulvermüller, F., Huss, M., Kherif, F., del Prado Martin F. M., Hauk, O. & Shtyrov, Y. (2006). Motor cortex maps articulatory features of speech sounds. Proceedings of the National Academy of Sciences of the United States of America, 103(20), 7865-70. doi: 10.1073/pnas.0509989103.

Reybrouck, M. (2021). Musical Sense-Making. Enaction, Experience, and Computation. Routledge: Abingdon, UK; New York, NY, USA.

Ringeval, F., Chetouani, M. et al. (2008). Automatic Prosodic Disorders Analysis for Impared Communication Children. Paper presented at the workshop on Child, Computer and Interaction. Chania, Greece.

Rummer, R., Schweppe, J., Schlegelmilch, R. & Grice, M. (2014). Mood is linked to vowel type: The role of articulatory movements. Emotion, 14(2), 246–250.

Sapir, E. (1929). A study in phonetic symbolism. Journal of Experimental Psychology, 12(3), 225–239.

Saus, W., Seither-Preisler, A. & Schneider, P. (2025). Harmonic vowels and neural dynamics: MEG evidence for auditory resonance integration in singing. Frontiers of Neuroscience, 19. https://doi.org/10.3389/fnins.2025.1625403

Scherer, K. R. (1986). Vocal affect expression: a review and a model for future research. Psychological Bulletin, 99, 143–165.

Scherer, K. R. & Wallbott, H. G. (1994). Evidence for universality and cultural variation of differential emotion response patterning. Journal of Personality and Social Psychology, 66(2), 310–328. https://doi.org/10.1037/0022-3514.66.2.310

Spencer, H. (1855). Principles of Psychology, Third edition, Vol 2. New York: Appleton.

Stark, R. E. (1981). Infant vocalization: A comprehensive view. Infant Mental Health, 2(2), 118–128. https://doi.org/10.1002/1097-0355(198122)2:2<118::AID-IMHJ2280020208>3.0.CO;2-5

Sundberg, J. (1987). The Science of the Singing Voice. Dekalb, Illinois: Northern Illinois University Press.

Stevens, K. (1998). Acoustic Phonetics. The MIT Press.

Titze, I. R. (1994). Principles of Voice Production. Englewood Cliffs, NJ: Prentice Hall.

Tsao, F-M., Huei-Mei Liu, H-M. & Kuhl, P. K. (2004). Speech perception in infancy predicts language development in the second year of life: a longitudinal study. Child Development, 75(4):1067-84. doi: 10.1111/j.1467-8624.2004.00726.x.

Ultan, R. (1978). Size-sound symbolism. Universals of Human Language, 2, 525–568.

Vainio, L., Myllylä, I. L. & Vainio, M. (2024). Sound symbolism in manual and vocal responses: phoneme-response interactions associated with grasping as well as vertical and size dimensions of keypresses. Cognitive Processing, 25, 363–378. https://doi.org/10.1007/s10339-024-01188-y

Vainio, L., Wikström, A., Repetto, C. & Vainio, M. (2023). Sound-symbolic association between speech sound and spatial meaning in relation to the concepts of up/down and above/below. Language and Cognition: An Interdisciplinary Journal of Language and Cognitive Science, 15(4), 884–903. https://doi.org/10.1017/langcog.2023.31

Wasz-Höckert, O., Michelsson, K. & Lind, J. (1985). Twenty-five years of Scandinavian cry research. In Lester, B. M. & Boukydis, C. F. Z. (Eds.), Infant Crying: Theoretical and Research Perspectives (pp. 83–104). New York, NY: Plenum Press.

Welch G. F., Biasutti M., MacRitchie J., McPherson G. E. & Himonides E. (2020). Editorial: The impact of music on human development and well-being. Frontiers of Psychology, 11:1246. 10.3389/fpsyg.2020.01246

Yildirim, M., Bulut, C., Kazemzadeh, L., Busso, C., Deng, Z., Lee, S. & Narayanan, S. (2004). An acoustic study of emotions expressed. In: Proceedings of the 8th International Conference on Spoken Language Processing, Jeju Island, Korea.

Downloads

Published

2026-06-02

How to Cite

PETROVIĆ, M. (2026). The Vowel as an Acoustic, Emotional, and Biological Stimulus in Music Pedagogy. Studia Universitatis Babeș-Bolyai Psychologia-Paedagogia, 71(1), 303–320. https://doi.org/10.24193/subbpsyped.2026.1.13

Issue

Section

Articles

Similar Articles

1 2 3 4 > >> 

You may also start an advanced similarity search for this article.