Compiling the first spoken corpus for Turkish youth talk


Efeoglu-Ozcan E., Guler H.

AUSTRALIAN REVIEW OF APPLIED LINGUISTICS, 2025 (ESCI, Scopus) identifier identifier

  • Yayın Türü: Makale / Tam Makale
  • Basım Tarihi: 2025
  • Doi Numarası: 10.1075/aral.25007.efe
  • Dergi Adı: AUSTRALIAN REVIEW OF APPLIED LINGUISTICS
  • Derginin Tarandığı İndeksler: Emerging Sources Citation Index (ESCI), Scopus, Periodicals Index Online, Communication & Mass Media Index, EBSCO Education Source, Educational research abstracts (ERA), ERIC (Education Resources Information Center), Linguistic Bibliography, Linguistics & Language Behavior Abstracts, MLA - Modern Language Association Database
  • Orta Doğu Teknik Üniversitesi Adresli: Evet

Özet

This paper addresses issues related to the design and compilation of the first spoken corpus of youth talk in an under-represented language in corpus linguistics, Turkish. Designed to offer a maximally representative sample of Turkish youth talk, the Corpus of Turkish Youth Language (CoTY) is a 168,748-token specialised corpus within the single register of informal, naturally occurring and spontaneous interaction exclusively among friends. The speakers are Turkish-speaking youth aged 14 to 18 from diverse socio-economic backgrounds in T & uuml;rkiye. In this paper, the issues that surfaced during corpus design and construction are presented, with a discussion and justification of the methodological choices in relation to the long-term project objectives. The corpus contributes to the field as a valuable resource and tool for cross-linguistic youth language research. As an overarching fundamental goal, the project also aims to expand on the cumulative linguistic and methodological knowledge in spoken corpus design and construction.