skip to main content
Language:
Search Limited to: Search Limited to: Resource type Show Results with: Show Results with: Search type Index

Deep learning for terahertz image denoising in nondestructive historical document analysis

Scientific reports, 2022-12, Vol.12 (1), p.22554-22554, Article 22554 [Peer Reviewed Journal]

2022. The Author(s). ;The Author(s) 2022. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;The Author(s) 2022 ;ISSN: 2045-2322 ;EISSN: 2045-2322 ;DOI: 10.1038/s41598-022-26957-7 ;PMID: 36581647

Full text available

Citations Cited by
  • Title:
    Deep learning for terahertz image denoising in nondestructive historical document analysis
  • Author: Dutta, Balaka ; Root, Konstantin ; Ullmann, Ingrid ; Wagner, Fabian ; Mayr, Martin ; Seuret, Mathias ; Thies, Mareike ; Stromer, Daniel ; Christlein, Vincent ; Schür, Jan ; Maier, Andreas ; Huang, Yixing
  • Subjects: Aging ; Deep learning
  • Is Part Of: Scientific reports, 2022-12, Vol.12 (1), p.22554-22554, Article 22554
  • Description: Historical documents contain essential information about the past, including places, people, or events. Many of these valuable cultural artifacts cannot be further examined due to aging or external influences, as they are too fragile to be opened or turned over, so their rich contents remain hidden. Terahertz (THz) imaging is a nondestructive 3D imaging technique that can be used to reveal the hidden contents without damaging the documents. As noise or imaging artifacts are predominantly present in reconstructed images processed by standard THz reconstruction algorithms, this work intends to improve THz image quality with deep learning. To overcome the data scarcity problem in training a supervised deep learning model, an unsupervised deep learning network (CycleGAN) is first applied to generate paired noisy THz images from clean images (clean images are generated by a handwriting generator). With such synthetic noisy-to-clean paired images, a supervised deep learning model using Pix2pixGAN is trained, which is effective to enhance real noisy THz images. After Pix2pixGAN denoising, 99% characters written on one-side of the Xuan paper can be clearly recognized, while 61% characters written on one-side of the standard paper are sufficiently recognized. The average perceptual indices of Pix2pixGAN processed images are 16.83, which is very close to the average perceptual index 16.19 of clean handwriting images. Our work has important value for THz-imaging-based nondestructive historical document analysis.
  • Publisher: England: Nature Publishing Group
  • Language: English
  • Identifier: ISSN: 2045-2322
    EISSN: 2045-2322
    DOI: 10.1038/s41598-022-26957-7
    PMID: 36581647
  • Source: PubMed Central
    ProQuest Central
    DOAJ Directory of Open Access Journals

Searching Remote Databases, Please Wait