Detail výsledku

Can LLMs Extract Human-like Fine-grained Evidence for Evidence-based Fact-checking?

JAROLÍM, A.; FAJČÍK, M.; MAKAIOVÁ, L. Can LLMs Extract Human-like Fine-grained Evidence for Evidence-based Fact-checking?. In Proceedings of the Nineteenth Workshop on Recent Advances in Slavonic Natural Languages Processing, RASLAN 2025. Recent Advances in Slavonic Natural Language Processing. 2025. no. 2025, p. 25-36. ISBN: 978-80-263-1858-3.
Typ
článek ve sborníku konference
Jazyk
angličtina
Autoři
Jarolím Antonín, Ing., UPGM (FIT)
Fajčík Martin, Ing., Ph.D., UPGM (FIT)
Makaiová Lucia, Bc., FIT (FIT), UPGM (FIT)
Abstrakt

Misinformation frequently spreads in user comments under online news articles, highlighting the need for effective methods to detect factually incorrect information. To strongly support or refute claims extracted from such comments, it is necessary to identify relevant documents and pinpoint the exact text spans that justify or contradict each claim. This paper focuses on the latter task --- fine-grained evidence extraction for Czech and Slovak claims. We create new dataset, containing two-way annotated fine-grained evidence created by paid annotators. We evaluate large language models (LLMs) on this dataset to assess their alignment with human annotations. The results reveal that LLMs often fail to copy evidence verbatim from the source text, leading to invalid outputs. Error-rate analysis shows that the llama3.1:8b model achieves a high proportion of correct outputs despite its relatively small size, while the gpt-oss-120b model underperforms despite having many more parameters. Furthermore, the models qwen3:14b, deepseek-r1:32b, and gpt-oss:20b demonstrate an effective balance between model size and alignment with human annotations.

Klíčová slova

Fact-checking; Fine-grained evidence; LLMs

URL
Rok
2025
Strany
25–36
Časopis
Recent Advances in Slavonic Natural Language Processing, č. 2025, ISSN
Sborník
Proceedings of the Nineteenth Workshop on Recent Advances in Slavonic Natural Languages Processing, RASLAN 2025
Konference
Recent Advances in Natural Language Processing
ISBN
978-80-263-1858-3
EID Scopus
BibTeX
@inproceedings{BUT201605,
  author="Antonín {Jarolím} and Martin {Fajčík} and Lucia {Makaiová}",
  title="Can LLMs Extract Human-like Fine-grained Evidence for Evidence-based Fact-checking?",
  booktitle="Proceedings of the Nineteenth Workshop on Recent Advances in Slavonic Natural Languages Processing, RASLAN 2025",
  year="2025",
  journal="Recent Advances in Slavonic Natural Language Processing",
  number="2025",
  pages="25--36",
  isbn="978-80-263-1858-3",
  issn="2336-4289",
  url="https://raslan2025.nlp-consulting.net/"
}
Projekty
FactDeMice - Ověřování faktů na základě textových dokladů s využitím překladu konzistentního s fakty, detekcí falešných recenzí a automatické extrakce zavádějících tvrzení, TAČR, 8. veřejná soutěž programu SIGMA Dílčí cíl 4: Bilaterální spolupráce, TQ16000028, zahájení: 2025-01-01, ukončení: 2027-12-31, řešení
Pracoviště
Nahoru