OrienTel: Turkish telephone speech database

Ciloglu T., Acar D., Tokatli A.

IEEE 12th Signal Processing and Communications Applications Conference, Kusadasi, Türkiye, 28 - 30 Nisan 2004, ss.280-283, (Tam Metin Bildiri)

Yayın Türü: Bildiri / Tam Metin Bildiri
Doi Numarası: 10.1109/siu.2004.1338314
Basıldığı Şehir: Kusadasi
Basıldığı Ülke: Türkiye
Sayfa Sayıları: ss.280-283
Orta Doğu Teknik Üniversitesi Adresli: Evet

Özet

This paper describes Turkish telephone speech database created within the framework of Orientel (IST-2000-28373), a 5th framework project. Orientel aims to collect telephone speech data from 21 languages. Turkish database has been successfully completed in 16 months. The work includes recordings, annotations and documentation of 1700 recording sessions. The speaker distribution has been balanced with respect to criteria such as age, sex, dialect, calling environment and network. The database contains digits, numbers, time, date, words and sentences. It is the first Turkish speech database of its size and also of its detailed systematic manner followed in the preparation and validation.