Thesis Details
Statistická analýza dat z PDF souborů
This thesis is concerning the process of data extraction from tables from documents in PDF format and their subsequent analysis with the exploitation of statistical methods. The goal of this thesis is to demonstrate the process of obtaining, processing and analyzing data from PDF files, which, in consideration of their program processing, create a finite number of subgroups with common characteristics. Firstly, the reader will become acquainted with the fundamentals of PDF file processing and basic mathematical principles that are required in order to statistically evaluate given data. Obtained theoretical principles are then applied to practical use and programming form in the Python programming language. The resulting web application is programmed using the Flask Python library and is usable on a local server.
control chart, statistical process control, Shewhart control chart, Hotelling control chart, process capability index, PDF table extraction, statistical analysis, Python, Flask, web application
Burgetová Ivana, Ing., Ph.D. (DIFS FIT BUT), člen
Fučík Otto, doc. Dr. Ing. (DCSY FIT BUT), člen
Hrubý Martin, Ing., Ph.D. (DITS FIT BUT), člen
Španěl Michal, Ing., Ph.D. (DCGM FIT BUT), člen
@bachelorsthesis{FITBT23695, author = "Krist\'{i}na Oltmanov\'{a}", type = "Bachelor's thesis", title = "Statistick\'{a} anal\'{y}za dat z PDF soubor\r{u}", school = "Brno University of Technology, Faculty of Information Technology", year = 2021, location = "Brno, CZ", language = "slovak", url = "https://www.fit.vut.cz/study/thesis/23695/" }