Result Details

Can LLMs Extract Human-like Fine-grained Evidence for Evidence-based Fact-checking?

JAROLÍM, A.; FAJČÍK, M.; MAKAIOVÁ, L. Can LLMs Extract Human-like Fine-grained Evidence for Evidence-based Fact-checking?. In Proceedings of the Nineteenth Workshop on Recent Advances in Slavonic Natural Languages Processing, RASLAN 2025. Recent Advances in Slavonic Natural Language Processing. 2025. no. 2025, p. 25-36. ISBN: 978-80-263-1858-3.
Type
conference paper
Language
English
Authors
Jarolím Antonín, Ing., DCGM (FIT)
Fajčík Martin, Ing., Ph.D., DCGM (FIT)
Makaiová Lucia, Bc., FIT (FIT), DCGM (FIT)
Abstract

Misinformation frequently spreads in user comments under online news articles, highlighting the need for effective methods to detect factually incorrect information. To strongly support or refute claims extracted from such comments, it is necessary to identify relevant documents and pinpoint the exact text spans that justify or contradict each claim. This paper focuses on the latter task --- fine-grained evidence extraction for Czech and Slovak claims. We create new dataset, containing two-way annotated fine-grained evidence created by paid annotators. We evaluate large language models (LLMs) on this dataset to assess their alignment with human annotations. The results reveal that LLMs often fail to copy evidence verbatim from the source text, leading to invalid outputs. Error-rate analysis shows that the llama3.1:8b model achieves a high proportion of correct outputs despite its relatively small size, while the gpt-oss-120b model underperforms despite having many more parameters. Furthermore, the models qwen3:14b, deepseek-r1:32b, and gpt-oss:20b demonstrate an effective balance between model size and alignment with human annotations.

Keywords

Fact-checking; Fine-grained evidence; LLMs

URL
Published
2025
Pages
25–36
Journal
Recent Advances in Slavonic Natural Language Processing, no. 2025, ISSN
Proceedings
Proceedings of the Nineteenth Workshop on Recent Advances in Slavonic Natural Languages Processing, RASLAN 2025
Conference
Recent Advances in Natural Language Processing
ISBN
978-80-263-1858-3
EID Scopus
BibTeX
@inproceedings{BUT201605,
  author="Antonín {Jarolím} and Martin {Fajčík} and Lucia {Makaiová}",
  title="Can LLMs Extract Human-like Fine-grained Evidence for Evidence-based Fact-checking?",
  booktitle="Proceedings of the Nineteenth Workshop on Recent Advances in Slavonic Natural Languages Processing, RASLAN 2025",
  year="2025",
  journal="Recent Advances in Slavonic Natural Language Processing",
  number="2025",
  pages="25--36",
  isbn="978-80-263-1858-3",
  issn="2336-4289",
  url="https://raslan2025.nlp-consulting.net/"
}
Projects
FactDeMice - Evidence-based Fact-Checking with Fact-consistent translation, Fake Review Detection, and Automatic Misinformative Claim Extraction, TAČR, 8. veřejná soutěž programu SIGMA Dílčí cíl 4: Bilaterální spolupráce, TQ16000028, start: 2025-01-01, end: 2027-12-31, running
Departments
Back to top