Publication Details

Finetuning Is a Surprisingly Effective Domain Adaptation Baseline in Handwriting Recognition

KOHÚT Jan and HRADIŠ Michal. Finetuning Is a Surprisingly Effective Domain Adaptation Baseline in Handwriting Recognition. In: Document Analysis and Recognition - ICDAR 2023. Lecture Notes in Computer Science, vol. 14190. San José: Springer Nature Switzerland AG, 2023, pp. 269-286. ISBN 978-3-031-41684-2. ISSN 0302-9743. Available from: https://pero.fit.vutbr.cz/publications
Czech title
Efektivní dománová adaptace v rámci rozpoznávání ručně psaného písma
Type
conference paper
Language
english
Authors
Kohút Jan, Ing. (DCGM FIT BUT)
Hradiš Michal, Ing., Ph.D. (DCGM FIT BUT)
URL
Keywords

Handwritten text recognition, OCR, Data augmentation, Finetuning.

Abstract

In many machine learning tasks, a large general dataset and a small specialized dataset are available. In such situations, various domain adaptation methods can be used to adapt a general model to the target dataset. We show that in the case of neural networks trained for handwriting recognition using CTC, simple finetuning with data augmentation works surprisingly well in such scenarios and that it is resistant to overfitting even for very small target domain datasets. We evaluated the behavior of finetuning with respect to augmentation, training data size, and quality of the pre-trained network, both in writer-dependent and writer-independent settings. On a large real-world dataset, finetuning provided an average relative CER improvement of 25 % with 16 text lines for new writers and 50 % for 256 text lines.

Published
2023
Pages
269-286
Journal
Lecture Notes in Computer Science, vol. 14190, no. 1, ISSN 0302-9743
Proceedings
Document Analysis and Recognition - ICDAR 2023
Series
Lecture Notes in Computer Science
Conference
International Conference on Document Analysis and Recognition, San José, California, USA, US
ISBN
978-3-031-41684-2
Publisher
Springer Nature Switzerland AG
Place
San José, US
DOI
EID Scopus
BibTeX
@INPROCEEDINGS{FITPUB12964,
   author = "Jan Koh\'{u}t and Michal Hradi\v{s}",
   title = "Finetuning Is a Surprisingly Effective Domain Adaptation Baseline in Handwriting Recognition",
   pages = "269--286",
   booktitle = "Document Analysis and Recognition - ICDAR 2023",
   series = "Lecture Notes in Computer Science",
   journal = "Lecture Notes in Computer Science",
   volume = 14190,
   number = 1,
   year = 2023,
   location = "San Jos\'{e}, US",
   publisher = "Springer Nature Switzerland AG",
   ISBN = "978-3-031-41684-2",
   ISSN = "0302-9743",
   doi = "10.1007/978-3-031-41685-9\_17",
   language = "english",
   url = "https://www.fit.vut.cz/research/publication/12964"
}
Back to top