Thesis Details

Analýza rozložení stran textových dokumentů pomocí hlubokých neuronových sítí

Bachelor's Thesis Student: Endrych David Academic Year: 2018/2019 Supervisor: Kodym Oldřich, Ing., Ph.D.
English title
Convolutional Networks for Document Layout Analysis
Language
Czech
Abstract

The goal of this thesis is to create a tool for analyzig the page layouts of text documents. The problem is solved by convolution neural networks. The architecture chosen in this thesis is the U-Net architecture. The cross entropy error function with weight map is used for train the network model. Paragraph regions are obtained throught connected component analysis. Experiments are evaluated using the Symmetric Best Dice object metric. Experiments have shown that it is better to use all paragraph edges than to focus only on vertical paragraph edges. In addition, experiments show that batche sampling strategies and adaptive resolution help to improve analysis results. The experiments also describe the application of separators, which is useful in analyzing multi-column documents.

Keywords

computer vision, deep neural networks, page layout analysis, image segmentation, U-Net, artificial intelligence

Department
Degree Programme
Information Technology
Files
Status
defended, grade B
Date
11 June 2019
Reviewer
Committee
Herout Adam, prof. Ing., Ph.D. (DCGM FIT BUT), předseda
Bidlo Michal, doc. Ing., Ph.D. (DCSY FIT BUT), člen
Čadík Martin, doc. Ing., Ph.D. (DCGM FIT BUT), člen
Křivka Zbyněk, Ing., Ph.D. (DIFS FIT BUT), člen
Rogalewicz Adam, doc. Mgr., Ph.D. (DITS FIT BUT), člen
Citation
ENDRYCH, David. Analýza rozložení stran textových dokumentů pomocí hlubokých neuronových sítí. Brno, 2019. Bachelor's Thesis. Brno University of Technology, Faculty of Information Technology. 2019-06-11. Supervised by Kodym Oldřich. Available from: https://www.fit.vut.cz/study/thesis/20900/
BibTeX
@bachelorsthesis{FITBT20900,
    author = "David Endrych",
    type = "Bachelor's thesis",
    title = "Anal\'{y}za rozlo\v{z}en\'{i} stran textov\'{y}ch dokument\r{u} pomoc\'{i} hlubok\'{y}ch neuronov\'{y}ch s\'{i}t\'{i}",
    school = "Brno University of Technology, Faculty of Information Technology",
    year = 2019,
    location = "Brno, CZ",
    language = "czech",
    url = "https://www.fit.vut.cz/study/thesis/20900/"
}
Back to top