Factors on the execution times of metropolis resampling and its variations


DÜLGER Ö., OĞUZTÜZÜN M. H. S.

5th International Conference on Computer Science and Engineering, UBMK 2020, Diyarbakır, Türkiye, 9 - 10 Eylül 2020, ss.152-155 identifier

  • Yayın Türü: Bildiri / Tam Metin Bildiri
  • Doi Numarası: 10.1109/ubmk50275.2020.9219464
  • Basıldığı Şehir: Diyarbakır
  • Basıldığı Ülke: Türkiye
  • Sayfa Sayıları: ss.152-155
  • Anahtar Kelimeler: Graphics processing unit, Metropoiis-C1 resampling, Metropolis resampling, Metropolis-C2 resampling, Noncoalesced global memory access pattern, Particle filter
  • Orta Doğu Teknik Üniversitesi Adresli: Evet

Özet

Particle filter is a serial Monte-Carlo estimation method. It is suitable for the applications whose system or measurement model is highly non-linear and uncertainties are large. The standard particle filter encounters degeneracy problem. One of the important solution is resampling. The most common resampling method is Systematic resampling. It requires collective operations over the weights. This causes numerical instability problem for Systematic resampling. Furthermore, interaction between all weights causes less-readily parallelization of Systematic resampling on graphics processing unit (GPU). To overcome these problems, Metropolis resampling is proposed. Since it only uses ratio of two weights rather than collective operations, it does not suffer from numerical instability and it is more suitable for the parallel implementation of it on GPU. Although it is fast in theory, it suffers from non-coalesced global memory access patterns when implemented on GPU. Metropolis- C1 and Metropolis-C2 are proposed to overcome this problem. In this study, we investigate the factors on the execution times of Metropolis, Metropolis-C1 and Metropolis-C2. There are mainly three factors. These are physical resources and limitations of the GPU, non-coalesced global memory access patterns and the number of particles. We discuss how they affect the execution times of these resampling algorithms in detail.