Language modelling for Turkish as an agglutinative language

Ciloglu T., Comez M., Sahin S.

IEEE 12th Signal Processing and Communications Applications Conference, Kusadasi, Türkiye, 28 - 30 Nisan 2004, ss.461-462, (Tam Metin Bildiri)

Yayın Türü: Bildiri / Tam Metin Bildiri
Doi Numarası: 10.1109/siu.2004.1338563
Basıldığı Şehir: Kusadasi
Basıldığı Ülke: Türkiye
Sayfa Sayıları: ss.461-462
Orta Doğu Teknik Üniversitesi Adresli: Evet

Özet

Two types of language models have been considered for Turkish continuous speech recogniton. In one case words are seperated into their stems and their rest, and language models are calculated based on this new set of units. In the other case words are considered as a whole but language models are calculated with respect to the stems of the words. Studies are carried out for bi-gram and tri-gram formalisms.