Language modelling for Turkish as an agglutinative language

Ciloglu T. , Comez M., Sahin S.

IEEE 12th Signal Processing and Communications Applications Conference, Kusadasi, Türkiye, 28 - 30 Nisan 2004, ss.461-462 identifier identifier

  • Doi Numarası: 10.1109/siu.2004.1338563
  • Basıldığı Şehir: Kusadasi
  • Basıldığı Ülke: Türkiye
  • Sayfa Sayıları: ss.461-462


Two types of language models have been considered for Turkish continuous speech recogniton. In one case words are seperated into their stems and their rest, and language models are calculated based on this new set of units. In the other case words are considered as a whole but language models are calculated with respect to the stems of the words. Studies are carried out for bi-gram and tri-gram formalisms.