Language modelling for Turkish as an agglutinative language


Ciloglu T., Comez M., Sahin S.

IEEE 12th Signal Processing and Communications Applications Conference, Kusadasi, Turkey, 28 - 30 April 2004, pp.461-462 identifier identifier

  • Publication Type: Conference Paper / Full Text
  • Doi Number: 10.1109/siu.2004.1338563
  • City: Kusadasi
  • Country: Turkey
  • Page Numbers: pp.461-462
  • Middle East Technical University Affiliated: Yes

Abstract

Two types of language models have been considered for Turkish continuous speech recogniton. In one case words are seperated into their stems and their rest, and language models are calculated based on this new set of units. In the other case words are considered as a whole but language models are calculated with respect to the stems of the words. Studies are carried out for bi-gram and tri-gram formalisms.