Recognition of voice onset time for use in pronunciation modeling

Abe Kazemzadeh; Sungbok Lee; Shrikanth Narayanan

doi:10.1121/1.4785771

Back

Recognition of voice onset time for use in pronunciation modeling

Journal article

Peer reviewed

Recognition of voice onset time for use in pronunciation modeling

Abe Kazemzadeh, Sungbok Lee and Shrikanth Narayanan

The Journal of the Acoustical Society of America, Vol.118(3_Supplement), pp.2026-2026

09/01/2005

DOI: https://doi.org/10.1121/1.4785771

Abstract

This study examines methods for recognizing native and accented voiceless stops based on voice onset time (VOT). These methods are tested on data from the Tball corpus of early elementary school children, which includes both native English speakers and Spanish speakers learning English, and which is transcribed to highlight pronunciation variation. We examine the English voiceless stop series, which have long VOT and aspiration, and the corresponding voiceless stops in Spanish accented English, which have short VOT and little aspiration. The methods tested are : (1) to train hidden Markov models (HMMs) based on native speech and then extract the VOT times by post-processing phone-level alignments, (2) to train HMMs with explicit aspiration models, and (3) to train, for each phoneme, different HMMs for native and accented variants. Error rates of 23%–53% for distinguishing phone VOT characteristics are reported for the first method, 5%–57% for the second method, and 0%–36% for the third. The error rates varied depending on the different phones examined. In general, the /p/ and /k/ phones had results that varied more than /t/. These results are discussed in light of each method’s usefulness and ease of implementation, and possible improvements are proposed.

Metrics

3 Record Views

Details

Title: Recognition of voice onset time for use in pronunciation modeling
Author/Creator: Abe Kazemzadeh - University of Southern California
Sungbok Lee - University of Southern California
Shrikanth Narayanan - University of Southern California
Publication Details: The Journal of the Acoustical Society of America, Vol.118(3_Supplement), pp.2026-2026
Academic Unit: Software Engineering and Data Science
Language: English
Resource Type: Journal article
Record Identifier: 991015166254603691

Recognition of voice onset time for use in pronunciation modeling

Abstract

Related links

Metrics

Details