A Framework for Machine Vision based on Neuro-Mimetic Front End Processing and Clustering


Akbas E., WADHWA A., ECKSTEIN M., MADHOW U.

52nd Annual Allerton Conference on Communication, Control, and Computing Allerton, Illinois, Amerika Birleşik Devletleri, 1 - 03 Ekim 2014, ss.311-318 identifier identifier

  • Yayın Türü: Bildiri / Tam Metin Bildiri
  • Doi Numarası: 10.1109/allerton.2014.7028471
  • Basıldığı Şehir: Illinois
  • Basıldığı Ülke: Amerika Birleşik Devletleri
  • Sayfa Sayıları: ss.311-318
  • Orta Doğu Teknik Üniversitesi Adresli: Hayır

Özet

Convolutional deep neural nets have emerged as a highly effective approach for machine vision, but there are a number of open issues regarding training (e.g., a large number of model parameters to be learned, and a number of manually tuned algorithm parameters) and interpretation (e.g., geometric interpretations of neurons at various levels of the hierarchy). In this paper, our goal is to explore alternative convolutional architectures which are easier to interpret and simpler to implement. In particular, we investigate a framework that combines a front end based on the known neuroscientific findings about the visual pathway, together with unsupervised feature extraction based on clustering. Supervised classification, using a generic radial basis function (RBF) support vector machine (SVM), is applied at the end. We obtain competitive classification results on standard image databases, beating the state of the art for NORB (uniform-normalized) and approaching it for MNIST.