A hierarchical representation of form documents for identification and retrieval


Duygulu P., Atalay V.

7th Annual Document Recognition and Retrieval Conference, San-Jose, Kostarika, 26 - 27 Ocak 2000, cilt.3967, ss.128-139 identifier

  • Yayın Türü: Bildiri / Tam Metin Bildiri
  • Cilt numarası: 3967
  • Basıldığı Şehir: San-Jose
  • Basıldığı Ülke: Kostarika
  • Sayfa Sayıları: ss.128-139
  • Orta Doğu Teknik Üniversitesi Adresli: Evet

Özet

In this paper, we present a logical representation for form documents to be used for identification and retrieval. A hierarchical structure is proposed to represent the logical structure of a form by using lines. The approach is top-down and no domain knowledge such as the preprinted data or filled-in data is used. Logically same forms are associated to the same hierarchical structure. This representation can handle geometrical modifications and slight variations.