skip to main content
Language:
Search Limited to: Search Limited to: Resource type Show Results with: Show Results with: Search type Index

Fast Binarization of Unevenly Illuminated Document Images Based on Background Estimation for Optical Character Recognition Purposes

J.UCS (Annual print and CD-ROM archive ed.), 2019-01, Vol.25 (6), p.627-646 [Peer Reviewed Journal]

COPYRIGHT 2019 Pensoft Publishers ;ISSN: 0948-695X ;EISSN: 0948-6968 ;DOI: 10.3217/jucs-025-06-0627

Digital Resources/Online E-Resources

Citations Cited by
  • Title:
    Fast Binarization of Unevenly Illuminated Document Images Based on Background Estimation for Optical Character Recognition Purposes
  • Author: Michalak, Hubert ; Okarma, Krzysztof
  • Subjects: binarization ; document image analysis ; OCR
  • Is Part Of: J.UCS (Annual print and CD-ROM archive ed.), 2019-01, Vol.25 (6), p.627-646
  • Description: One of the key operations during the image preprocessing step in Optical Character Recognition (OCR) algorithms is image binarization. Although for uniformly illuminated images, obtained typically by atbed scanners, the use of a single global threshold may be sufficient for further recognition of individual characters, it cannot be applied directly in case of non-uniform lightened document images. Such problem may occur during capturing photos of documents in unknown lighting conditions making a proper text recognition impossible in some parts of the image. Since the application of popular adaptive thresholding methods, e.g. Niblack, Sauvola and their modifications, based on the analysis of the neighbourhood of each pixel is time consuming, a faster solution might be the division of images into blocks or elimination of non-uniform background. Such an approach can be considered as a balance solution filling the gap between global and local adaptive thresholding. The solution proposed in the paper, useful also for various mobile devices due to limited computational requirements, is based on the approximation of lighting distribution of the background using the reduced resolution images. The proposed method allows to obtain very good OCR results being superior in comparison to typical adaptive binarization algorithms both in terms of the resulting OCR accuracy and computational efficiency.
  • Publisher: Verlag der Technischen Universität Graz
  • Language: English
  • Identifier: ISSN: 0948-695X
    EISSN: 0948-6968
    DOI: 10.3217/jucs-025-06-0627
  • Source: DOAJ Directory of Open Access Journals

Searching Remote Databases, Please Wait