Thesis Details

Extrakce dat z popisu zboží

Master's Thesis Student: Sláma Vojtěch Academic Year: 2007/2008 Supervisor: Burget Radek, doc. Ing., Ph.D.
English title
Data Extraction from Product Descriptions
Language
Czech
Abstract

This work concentrates on the design and implementation of an automated support for data extraction from product descriptions. This system will be used for e-shop purposes. The work introduces present approaches to information extraction from HTML documents. It focuses chiefly at wrappers and methods for their induction. The visual approach to information extraction is also mentioned. System requirements and basic principles are described in the design part of the work. Next, a detailed description of a path tracing algorithm in document object model is explained. The last section of the work evaluates the results of experiments made with the implemented system.

Keywords

Information extraction, wrapper, wrapper induction, webshop, e-shop, JavaScript, DOM.

Department
Degree Programme
Information Technology, Field of Study Information Systems
Files
Status
defended, grade B
Date
16 June 2008
Reviewer
Committee
Švéda Miroslav, prof. Ing., CSc. (DIFS FIT BUT), předseda
Burget Radek, doc. Ing., Ph.D. (DIFS FIT BUT), člen
Drahanský Martin, prof. Ing., Dipl.-Ing., Ph.D. (DITS FIT BUT), člen
Matoušek Petr, doc. Ing., Ph.D., M.A. (DIFS FIT BUT), člen
Šafařík Jiří, prof. Ing., CSc. (WBU in Pilsen), člen
Vojnar Tomáš, prof. Ing., Ph.D. (DITS FIT BUT), člen
Citation
SLÁMA, Vojtěch. Extrakce dat z popisu zboží. Brno, 2008. Master's Thesis. Brno University of Technology, Faculty of Information Technology. 2008-06-16. Supervised by Burget Radek. Available from: https://www.fit.vut.cz/study/thesis/7080/
BibTeX
@mastersthesis{FITMT7080,
    author = "Vojt\v{e}ch Sl\'{a}ma",
    type = "Master's thesis",
    title = "Extrakce dat z popisu zbo\v{z}\'{i}",
    school = "Brno University of Technology, Faculty of Information Technology",
    year = 2008,
    location = "Brno, CZ",
    language = "czech",
    url = "https://www.fit.vut.cz/study/thesis/7080/"
}
Back to top