Perceptual scaling of voice identity: common dimensions for different vowels and speakers

Oliver Baumann, Pascal Belin

Research output: Contribution to journalArticleResearchpeer-review

68 Citations (Scopus)

Abstract

THE AIMS OF OUR STUDY WERE: (1) to determine if the acoustical parameters used by normal subjects to discriminate between different speakers vary when comparisons are made between pairs of two of the same or different vowels, and if they are different for male and female voices; (2) to ask whether individual voices can reasonably be represented as points in a low-dimensional perceptual space such that similarly sounding voices are located close to one another. Subjects were presented with pairs of voices from 16 male and 16 female speakers uttering the three French vowels "a", "i" and "u" and asked to give speaker similarity judgments. Multidimensional analyses of the similarity matrices were performed separately for male and female voices and for three types of comparisons: same vowels, different vowels and overall average. The resulting dimensions were then interpreted a posteriori in terms of relevant acoustical measures. For both male and female voices, a two-dimensional perceptual space was found to be most appropriate, with axes largely corresponding to contributions of the larynx (pitch) and supra-laryngeal vocal tract (formants), mirroring the two largely independent components of source and filter in voice production. These perceptual spaces of male and female voices and their corresponding voice samples are available at: http://vnl.psy.gla.ac.uk section Resources.

Original languageEnglish
Pages (from-to)110-120
Number of pages11
JournalPsychological Research
Volume74
Issue number1
DOIs
Publication statusPublished - Jan 2010
Externally publishedYes

Fingerprint

Larynx

Cite this

@article{303146c3bbcc485796df6354cfb0e691,
title = "Perceptual scaling of voice identity: common dimensions for different vowels and speakers",
abstract = "THE AIMS OF OUR STUDY WERE: (1) to determine if the acoustical parameters used by normal subjects to discriminate between different speakers vary when comparisons are made between pairs of two of the same or different vowels, and if they are different for male and female voices; (2) to ask whether individual voices can reasonably be represented as points in a low-dimensional perceptual space such that similarly sounding voices are located close to one another. Subjects were presented with pairs of voices from 16 male and 16 female speakers uttering the three French vowels {"}a{"}, {"}i{"} and {"}u{"} and asked to give speaker similarity judgments. Multidimensional analyses of the similarity matrices were performed separately for male and female voices and for three types of comparisons: same vowels, different vowels and overall average. The resulting dimensions were then interpreted a posteriori in terms of relevant acoustical measures. For both male and female voices, a two-dimensional perceptual space was found to be most appropriate, with axes largely corresponding to contributions of the larynx (pitch) and supra-laryngeal vocal tract (formants), mirroring the two largely independent components of source and filter in voice production. These perceptual spaces of male and female voices and their corresponding voice samples are available at: http://vnl.psy.gla.ac.uk section Resources.",
author = "Oliver Baumann and Pascal Belin",
year = "2010",
month = "1",
doi = "10.1007/s00426-008-0185-z",
language = "English",
volume = "74",
pages = "110--120",
journal = "Psychological Research",
issn = "0340-0727",
publisher = "Springer",
number = "1",

}

Perceptual scaling of voice identity : common dimensions for different vowels and speakers. / Baumann, Oliver; Belin, Pascal.

In: Psychological Research, Vol. 74, No. 1, 01.2010, p. 110-120.

Research output: Contribution to journalArticleResearchpeer-review

TY - JOUR

T1 - Perceptual scaling of voice identity

T2 - common dimensions for different vowels and speakers

AU - Baumann, Oliver

AU - Belin, Pascal

PY - 2010/1

Y1 - 2010/1

N2 - THE AIMS OF OUR STUDY WERE: (1) to determine if the acoustical parameters used by normal subjects to discriminate between different speakers vary when comparisons are made between pairs of two of the same or different vowels, and if they are different for male and female voices; (2) to ask whether individual voices can reasonably be represented as points in a low-dimensional perceptual space such that similarly sounding voices are located close to one another. Subjects were presented with pairs of voices from 16 male and 16 female speakers uttering the three French vowels "a", "i" and "u" and asked to give speaker similarity judgments. Multidimensional analyses of the similarity matrices were performed separately for male and female voices and for three types of comparisons: same vowels, different vowels and overall average. The resulting dimensions were then interpreted a posteriori in terms of relevant acoustical measures. For both male and female voices, a two-dimensional perceptual space was found to be most appropriate, with axes largely corresponding to contributions of the larynx (pitch) and supra-laryngeal vocal tract (formants), mirroring the two largely independent components of source and filter in voice production. These perceptual spaces of male and female voices and their corresponding voice samples are available at: http://vnl.psy.gla.ac.uk section Resources.

AB - THE AIMS OF OUR STUDY WERE: (1) to determine if the acoustical parameters used by normal subjects to discriminate between different speakers vary when comparisons are made between pairs of two of the same or different vowels, and if they are different for male and female voices; (2) to ask whether individual voices can reasonably be represented as points in a low-dimensional perceptual space such that similarly sounding voices are located close to one another. Subjects were presented with pairs of voices from 16 male and 16 female speakers uttering the three French vowels "a", "i" and "u" and asked to give speaker similarity judgments. Multidimensional analyses of the similarity matrices were performed separately for male and female voices and for three types of comparisons: same vowels, different vowels and overall average. The resulting dimensions were then interpreted a posteriori in terms of relevant acoustical measures. For both male and female voices, a two-dimensional perceptual space was found to be most appropriate, with axes largely corresponding to contributions of the larynx (pitch) and supra-laryngeal vocal tract (formants), mirroring the two largely independent components of source and filter in voice production. These perceptual spaces of male and female voices and their corresponding voice samples are available at: http://vnl.psy.gla.ac.uk section Resources.

U2 - 10.1007/s00426-008-0185-z

DO - 10.1007/s00426-008-0185-z

M3 - Article

VL - 74

SP - 110

EP - 120

JO - Psychological Research

JF - Psychological Research

SN - 0340-0727

IS - 1

ER -