Acoustic source separation of convolutive mixtures based on intensity vector statistics


GÜNEL B., HACIHABİBOĞLU H., Kondoz A.

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, cilt.16, sa.4, ss.748-756, 2008 (SCI-Expanded) identifier identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 16 Sayı: 4
  • Basım Tarihi: 2008
  • Doi Numarası: 10.1109/tasl.2008.918967
  • Dergi Adı: IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
  • Sayfa Sayıları: ss.748-756
  • Anahtar Kelimeler: array processing, B-format, beamforming, circular statistics, coincident microphone array, convolutive mixtures, intensity, source separation, von Mises, BLIND SOURCE SEPARATION, ALGORITHMS
  • Orta Doğu Teknik Üniversitesi Adresli: Hayır

Özet

Various techniques have previously been proposed for the separation of convolutive mixtures. These techniques can be classified as stochastic, adaptive, and deterministic. Stochastic methods are computationally expensive since they require an iterative process for the calculation of the demixing filters based on a separation criterion that usually assumes that the source signals are statistically independent. Adaptive methods, such as the adaptive beamformers, also exploit signal properties in order to optimize a multichannel filter structure. However, these algorithms need initialization and time to converge. Deterministic methods, on the other hand, provide a closed-form solution based on the deterministic aspects of the problem, such as the channel characteristics and the source directions. This paper presents a technique that exploits the intensity vector statistics to achieve a nearly closed-form solution for the separation of the convolutive mixtures as recorded with a coincident microphone array. No assumptions are made on the signals, but it is assumed that the source directions are known a priori. Directivity functions based on von Mises functions are designed for beamforming depending on the circular statistics of the calculated intensity vectors. Numerical evaluation results were presented for various speech and instrument sounds and source positions in two reverberant rooms.