Dynamic programming approach to voice transformation

Salor, Ozgul; Demirekler, MÜBECCEL

doi:10.1016/j.specom.2006.06.003

Dynamic programming approach to voice transformation

Salor O., Demirekler M.

SPEECH COMMUNICATION, cilt.48, sa.10, ss.1262-1272, 2006 (SCI-Expanded, Scopus)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 48 Sayı: 10
Basım Tarihi: 2006
Doi Numarası: 10.1016/j.specom.2006.06.003
Dergi Adı: SPEECH COMMUNICATION
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
Sayfa Sayıları: ss.1262-1272
Orta Doğu Teknik Üniversitesi Adresli: Hayır

Özet

This paper presents a voice transformation algorithm which modifies the speech of a source speaker such that it is perceived as if spoken by a target speaker. A novel method which is based on dynamic programming approach is proposed. The designed system obtains speaker-specific codebooks of line spectral frequencies (LSFs) for both source and target speakers. Those codebooks are used to train a mapping histogram matrix, which is used for LSF transformation from one speaker to the other. The baseline system uses the maxima of the histogram matrix for LSF transformation. The shortcomings of this system, which are the limitations of the target LSF space and the spectral discontinuities due to independent mapping of subsequent frames, have been overcome by applying the dynamic programming approach. Dynamic programming approach tries to model the long-term behaviour of LSFs of the target speaker, while it is trying to preserve the relationship between the subsequent frames of the source LSFs, during transformation. Both objective and subjective evaluations have been conducted and it has been shown that dynamic programming approach improves the performance of the system in terms of both the speech quality and speaker similarity. (c) 2006 Elsevier B.V. All rights reserved.