Thesis Details

Statistická analýza dat z PDF souborů

Bachelor's Thesis Student: Oltmanová Kristína Academic Year: 2020/2021 Supervisor: Bartík Vladimír, Ing., Ph.D.

Language

Slovak

Abstract

This thesis is concerning the process of data extraction from tables from documents in PDF format and their subsequent analysis with the exploitation of statistical methods. The goal of this thesis is to demonstrate the process of obtaining, processing and analyzing data from PDF files, which, in consideration of their program processing, create a finite number of subgroups with common characteristics. Firstly, the reader will become acquainted with the fundamentals of PDF file processing and basic mathematical principles that are required in order to statistically evaluate given data. Obtained theoretical principles are then applied to practical use and programming form in the Python programming language. The resulting web application is programmed using the Flask Python library and is usable on a local server.

Keywords

control chart, statistical process control, Shewhart control chart, Hotelling control chart, process capability index, PDF table extraction, statistical analysis, Python, Flask, web application

Department

Department of Information Systems FIT BUT

Degree Programme

Information Technology

Files

Status

defended, grade B

Date

18 June 2021

Reviewer

Burgetová Ivana, Ing., Ph.D.

Committee

Kolář Dušan, doc. Dr. Ing. (DIFS FIT BUT), předseda
Burgetová Ivana, Ing., Ph.D. (DIFS FIT BUT), člen
Fučík Otto, doc. Dr. Ing. (DCSY FIT BUT), člen
Hrubý Martin, Ing., Ph.D. (DITS FIT BUT), člen
Španěl Michal, Ing., Ph.D. (DCGM FIT BUT), člen

Citation

OLTMANOVÁ, Kristína. Statistická analýza dat z PDF souborů. Brno, 2021. Bachelor's Thesis. Brno University of Technology, Faculty of Information Technology. 2021-06-18. Supervised by Bartík Vladimír. Available from: https://www.fit.vut.cz/study/thesis/23695/

BibTeX

@bachelorsthesis{FITBT23695,
    author = "Krist\'{i}na Oltmanov\'{a}",
    type = "Bachelor's thesis",
    title = "Statistick\'{a} anal\'{y}za dat z PDF soubor\r{u}",
    school = "Brno University of Technology, Faculty of Information Technology",
    year = 2021,
    location = "Brno, CZ",
    language = "slovak",
    url = "https://www.fit.vut.cz/study/thesis/23695/"
}

Theses