A heuristic algorithm for optical character recognition of Arabic script

Atici, AA; YarmanVural, FT

doi:10.1016/s0165-1684(97)00117-5

A heuristic algorithm for optical character recognition of Arabic script

Atici A., YarmanVural F.

SIGNAL PROCESSING, cilt.62, sa.1, ss.87-99, 1997 (SCI-Expanded, Scopus)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 62 Sayı: 1
Basım Tarihi: 1997
Doi Numarası: 10.1016/s0165-1684(97)00117-5
Dergi Adı: SIGNAL PROCESSING
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
Sayfa Sayıları: ss.87-99
Anahtar Kelimeler: segmentation, main feature segment, key feature, HMM, optical character recognition, contour following, chain code
Orta Doğu Teknik Üniversitesi Adresli: Hayır

Özet

In this paper, a heuristic method is developed for segmentation, feature extraction and recognition of the Arabic script. The study is part of a large project for transcription of the documents in Ottoman Archives. A geometrical and topological feature analysis method is developed for segmentation and feature extraction stages. Chain code transformation is applied to main strokes of the characters, which are classified by the hidden Markov model (HMM) in the recognition stage. Experimental results indicate that the performance of the proposed method is quite satisfactory, provided that the thinning process does not yield spurious branches. (C) 1997 Elsevier Science B.V.