Publication Details

Monant Medical Misinformation Dataset: Mapping Articles to Fact-Checked Claims

SRBA Ivan, PECHER Branislav, TOMLEIN Matúš, MÓRO Róbert, ŠTEFANCOVÁ Elena, ŠIMKO Jakub and BIELIKOVÁ Mária. Monant Medical Misinformation Dataset: Mapping Articles to Fact-Checked Claims. In: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. Madrid: Association for Computing Machinery, 2022, pp. 2949-2959. ISBN 978-1-4503-8732-3. Available from: https://dl.acm.org/doi/10.1145/3477495.3531726
Czech title
Monant datový soubor medicinských dezinformací: mapování článků k tvrzením ověřeným fakty
Type
conference paper
Language
english
Authors
Srba Ivan ()
Pecher Branislav, Ing. (DCGM FIT BUT)
Tomlein Matúš ()
Móro Róbert ()
Štefancová Elena ()
Šimko Jakub, doc. Ing., Ph.D. (DCGM FIT BUT)
Bieliková Mária, prof. Ing., PhD. (DCGM FIT BUT)
URL
Keywords

medical misinformation, dataset, fact-checking, Monant platform

Abstract

False information has a significant negative influence on individuals as well as on the whole society. Especially in the current COVID-19 era, we witness an unprecedented growth of medical misinformation. To help tackle this problem with machine learning approaches, we are publishing a feature-rich dataset of approx. 317k medical news/blog articles and 3.5k fact-checked claims. It also contains 573 manual and more than 51k predicted labels mapping the claims to the articles. They represent claim presence, i.e., whether a claim is contained in the given article, and article stance towards the claim. We provide several baselines for these two tasks and evaluate them on the manually labelled part of the dataset. The dataset enables a number of additional tasks related to medical misinformation, such as misinformation characterization studies or studies of misinformation diffusion between sources.

Published
2022
Pages
2949-2959
Proceedings
Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval
Conference
The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, ES
ISBN
978-1-4503-8732-3
Publisher
Association for Computing Machinery
Place
Madrid, ES
DOI
UT WoS
000852715903001
EID Scopus
BibTeX
@INPROCEEDINGS{FITPUB12680,
   author = "Ivan Srba and Branislav Pecher and Mat\'{u}\v{s} Tomlein and R\'{o}bert M\'{o}ro and Elena \v{S}tefancov\'{a} and Jakub \v{S}imko and M\'{a}ria Bielikov\'{a}",
   title = "Monant Medical Misinformation Dataset: Mapping Articles to Fact-Checked Claims",
   pages = "2949--2959",
   booktitle = "Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval",
   year = 2022,
   location = "Madrid, ES",
   publisher = "Association for Computing Machinery",
   ISBN = "978-1-4503-8732-3",
   doi = "10.1145/3477495.3531726",
   language = "english",
   url = "https://www.fit.vut.cz/research/publication/12680"
}
Back to top