Nonlinear interactive source-filter models for speech

KOÇ, Turgay; ÇİLOĞLU, TOLGA

doi:10.1016/j.csl.2014.12.002

Nonlinear interactive source-filter models for speech

KOÇ T., ÇİLOĞLU T.

COMPUTER SPEECH AND LANGUAGE, cilt.36, ss.365-394, 2016 (SCI-Expanded, Scopus)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 36
Basım Tarihi: 2016
Doi Numarası: 10.1016/j.csl.2014.12.002
Dergi Adı: COMPUTER SPEECH AND LANGUAGE
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
Sayfa Sayıları: ss.365-394
Anahtar Kelimeler: Speech production, Source-filter theory, Source-filter interaction, Speech modeling, ELECTROGLOTTOGRAPHIC SIGNALS, LINEAR PREDICTION, SINGING VOICE, GLOTTAL FLOW, VOCAL-TRACT, AIR-FLOW, PHONATION, WAVE, SIMULATION, TURBULENCE
Orta Doğu Teknik Üniversitesi Adresli: Evet

Özet

The linear source-filter model of speech production assumes that the source of the speech sounds is independent of the filter. However, acoustic simulations based on the physical speech production models show that when the fundamental frequency of the source harmonics approaches the first formant of the vocal tract filter, the filter has significant effects on the source due to the nonlinear coupling between them. In this study, two interactive system models are proposed under the quasi steady Bernoulli flow and linear vocal tract assumptions. An algorithm is developed to estimate the model parameters. Glottal flow and the linear vocal tract parameters are found by conventional methods. Rosenberg model is used to synthesize the glottal waveform. A recursive optimization method is proposed to find the parameters of the interactive model. Finally, glottal flow produced by the nonlinear interactive system is computed. The experimental results show that the interactive system model produces fine details of glottal flow source accurately. (C) 2014 Elsevier Ltd. All rights reserved.