Multi-frame knowledge based text enhancement for mobile phone captured videos

Ozarslan S., EREN P. E.

Conference on Mobile Devices and Multimedia - Enabling Technologies, Algorithms, and Applications, San-Francisco, Kostarika, 3 - 05 Şubat 2014, cilt.9030 identifier identifier

  • Cilt numarası: 9030
  • Doi Numarası: 10.1117/12.2040606
  • Basıldığı Şehir: San-Francisco
  • Basıldığı Ülke: Kostarika


In this study, we explore automated text recognition and enhancement using mobile phone captured videos of store receipts. We propose a method which includes Optical Character Resolution ( OCR) enhanced by our proposed Row Based Multiple Frame Integration (RB-MFI), and Knowledge Based Correction (KBC) algorithms. In this method, first, the trained OCR engine is used for recognition; then, the RB-MFI algorithm is applied to the output of the OCR. The RB-MFI algorithm determines and combines the most accurate rows of the text outputs extracted by using OCR from multiple frames of the video. After RB-MFI, KBC algorithm is applied to these rows to correct erroneous characters. Results of the experiments show that the proposed video-based approach which includes the RB-MFI and the KBC algorithm increases the word character recognition rate to 95%, and the character recognition rate to 98%.