A hybrid named entity recognizer for Turkish with applications to different text genres


Küçük D., YAZICI A.

25th International Symposium on Computer and Information Sciences, ISCIS 2010, London, İngiltere, 22 - 24 Eylül 2010, cilt.62 LNEE, ss.113-116 identifier

  • Yayın Türü: Bildiri / Tam Metin Bildiri
  • Cilt numarası: 62 LNEE
  • Doi Numarası: 10.1007/978-90-481-9794-1_23
  • Basıldığı Şehir: London
  • Basıldığı Ülke: İngiltere
  • Sayfa Sayıları: ss.113-116
  • Anahtar Kelimeler: information extraction, named entity recognition, Turkish
  • Orta Doğu Teknik Üniversitesi Adresli: Evet

Özet

In this study, we present a hybrid named entity recognizer for Turkish, which is based on a previously proposed rule based recognizer. Since rule based systems for specic domains require their knowledge sources to be manually revised when ported to other domains, we turn the rule based recognizer into a hybrid one so that it learns from annotated data and improves its knowledge sources accordingly. Both the hybrid recognizer and its predecessor are evaluated on the same corpora and the hybrid recognizer achieves comparably better results. The current study is significant since it presents the first hybrid -manually engineered and learning-named entity recognizer for Turkish texts. © 2011 Springer Science+Business Media B.V.