A Framework for Machine Vision based on Neuro-Mimetic Front End Processing and Clustering

Akbas E., WADHWA A., ECKSTEIN M., MADHOW U.

52nd Annual Allerton Conference on Communication, Control, and Computing Allerton, Illinois, Amerika Birleşik Devletleri, 1 - 03 Ekim 2014, ss.311-318

Yayın Türü: Bildiri / Tam Metin Bildiri
Doi Numarası: 10.1109/allerton.2014.7028471
Basıldığı Şehir: Illinois
Basıldığı Ülke: Amerika Birleşik Devletleri
Sayfa Sayıları: ss.311-318
Orta Doğu Teknik Üniversitesi Adresli: Hayır

Özet

Convolutional deep neural nets have emerged as a highly effective approach for machine vision, but there are a number of open issues regarding training (e.g., a large number of model parameters to be learned, and a number of manually tuned algorithm parameters) and interpretation (e.g., geometric interpretations of neurons at various levels of the hierarchy). In this paper, our goal is to explore alternative convolutional architectures which are easier to interpret and simpler to implement. In particular, we investigate a framework that combines a front end based on the known neuroscientific findings about the visual pathway, together with unsupervised feature extraction based on clustering. Supervised classification, using a generic radial basis function (RBF) support vector machine (SVM), is applied at the end. We obtain competitive classification results on standard image databases, beating the state of the art for NORB (uniform-normalized) and approaching it for MNIST.