Integrace, vizualizace a dolování z dat zemí světa

Master's Thesis Student: Dušek Vladimír Academic Year: 2021/2022 Supervisor: Bartík Vladimír, Ing., Ph.D.
English title
Integration, Visualization, and Mining from Data of World Countries

This thesis explores the utilization of open data about countries around the world, particularly data in the areas of progress and quality of life. The goal was to design and implement a web application to present this data and further use the data for data mining. The integration and processing of data from open data sources were accomplished using the Apache Airflow platform. The Python framework FastAPI was used to create the API and the JavaScript library ReactJS was used to implement the web application. In the application, the indicators are categorized. Each of them can be displayed for different groups of countries, for different time periods, and in several visualizations. From the domain of data mining, clustering of countries based on a group of indicators and prediction of future development of selected indicators using regression analysis was performed. The final application is available at


Apache Airflow, ETL, FastAPI, ReactJS, PostgreSQL, data analysis, databases, data warehouses, data mining, information systems, data integration, regression, clustering, data visualization, web applications, data mining, data processing

