Result Details

Software module for automatic enhancement of digitized documents

Created: 2019
Type
software
Language
English
Authors
Hradiš Michal, Ing., Ph.D., UAMT (FEEC), DCGM (FIT)
Kodym Oldřich, Ing., Ph.D., DCGM (FIT)
Description

Tool for text-guided textual document scan quality enhancement. The method works on lines of text that can be input through a PAGE XML or detected automatically by a built-in OCR. By using text input along with the image, the results can be correctly readable even with parts of the original text missing or severely degraded in the source image. The tool includes functionality for cropping the text lines, processing them with our provided models for either text enhancement and inpainting, and for blending the enhanced text lines back into the source document image. We currently provide models for OCR and enhancement of czech newspapers optimized for low-quality scans from micro-films.

This package can be used as a standalone command line tool to process document pages in bulk. Alternatively, the package provides a python class that can be integrated in third-party software.

Keywords

OCR, document, quality enhancement, Generative Adversarial Networks, Convolutional Networks

Location

github.com/DCGM/pero-enhance

License
In order to use the result by another entity, it is always necessary to acquire a license
License Fee
The licensor does not require a license fee for the result
Projects
Advanced content extraction and recognition for printed and handwritten documents for better accessibility and usability, MK, Program na podporu aplikovaného výzkumu a experimentálního vývoje národní a kulturní identity na léta 2016 až 2022 (NAKI II), DG18P02OVV055, start: 2018-03-01, end: 2022-12-31, completed
Research groups
Departments
Back to top