Text recognition and correction for automated data collection by mobile devices


Ozarslan S., EREN P. E.

Conference on Imaging and Multimedia Analytics in a Web and Mobile World, San-Francisco, Kostarika, 5 - 06 Şubat 2014, cilt.9027 identifier identifier

  • Yayın Türü: Bildiri / Tam Metin Bildiri
  • Cilt numarası: 9027
  • Doi Numarası: 10.1117/12.2040668
  • Basıldığı Şehir: San-Francisco
  • Basıldığı Ülke: Kostarika
  • Orta Doğu Teknik Üniversitesi Adresli: Evet

Özet

Participatory sensing is an approach which allows mobile devices such as mobile phones to be used for data collection, analysis and sharing processes by individuals. Data collection is the first and most important part of a participatory sensing system, but it is time consuming for the participants. In this paper, we discuss automatic data collection approaches for reducing the time required for collection, and increasing the amount of collected data. In this context, we explore automated text recognition on images of store receipts which are captured by mobile phone cameras, and the correction of the recognized text. Accordingly, our first goal is to evaluate the performance of the Optical Character Recognition (OCR) method with respect to data collection from store receipt images. Images captured by mobile phones exhibit some typical problems, and common image processing methods cannot handle some of them. Consequently, the second goal is to address these types of problems through our proposed Knowledge Based Correction (KBC) method used in support of the OCR, and also to evaluate the KBC method with respect to the improvement on the accurate recognition rate. Results of the experiments show that the KBC method improves the accurate data recognition rate noticeably.