Machine Learning-based Silence Detection in Call Center Telephone Conversations

Iheme L. O., Ozan S., Akagunduz E.

International Conference on Artificial Intelligence and Data Processing (IDAP), Malatya, Turkey, 21 - 22 September 2019 identifier identifier

  • Publication Type: Conference Paper / Full Text
  • Doi Number: 10.1109/idap.2019.8875958
  • City: Malatya
  • Country: Turkey
  • Keywords: Voice Activity Detection, Bag of Audio Words, MFCC, Clustering, Call Center, VOICE ACTIVITY DETECTION
  • Middle East Technical University Affiliated: No


This study presents the development of a voice activity detection (VAD) system tested on call center telephony data obtained from our local site. The concept of bag of audio words (BoAW) combined with a naive Bayes classifier was applied to achieve the task. It was formulated as a binary classification problem with speech as the positive class and silence/background noise as the negative class. All the processing was performed on the Mel-frequency cepstral coefficients (MFCCs) extracted from the audio recordings. The results which are presented as accuracy score and receiver operating characteristics (ROC) indicate an excellent performance of the developed model. The system is to be deployed within our call center to aid data analysis and improve overall efficiency of the center.