Thesis Details
Sémantická analýza webového obsahu
This work deals with the topics of semantic web, web page segmentation and technologies, which are used in this area. It also deals with a modification of one web page segmentation method, specifically DOM-based segmentation, using semantic web technologies. Thus, this work designs the way of web page segmentation based on semantic analysis of individual elements of the web pages content. An application that demonstrates the functionality of the designed segmentation method was also created within this work. With the implemented application, experiments were performed, whose results are also part of this work.
Semantic web, RDF, RDFS, RDF/XML, ontology, DBpedia, DBpedia-Spotlight, Web page segmentation, Python, lxml, Pyspotlight
Burget Radek, doc. Ing., Ph.D. (DIFS FIT BUT), člen
Grégr Matěj, Ing., Ph.D. (DIFS FIT BUT), člen
Holík Lukáš, doc. Mgr., Ph.D. (DITS FIT BUT), člen
Kořenek Jan, doc. Ing., Ph.D. (DCSY FIT BUT), člen
Rychlý Marek, RNDr., Ph.D. (DIFS FIT BUT), člen
@mastersthesis{FITMT22669, author = "Luk\'{a}\v{s} Hubl", type = "Master's thesis", title = "S\'{e}mantick\'{a} anal\'{y}za webov\'{e}ho obsahu", school = "Brno University of Technology, Faculty of Information Technology", year = 2020, location = "Brno, CZ", language = "czech", url = "https://www.fit.vut.cz/study/thesis/22669/" }