Detail výsledku

On Sensitivity of Learning with Limited Labelled Data to the Effects of Randomness: Impact of Interactions and Systematic Choices

PECHER, B.; SRBA, I.; BIELIKOVÁ, M. On Sensitivity of Learning with Limited Labelled Data to the Effects of Randomness: Impact of Interactions and Systematic Choices. Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing. Miami: Association for Computational Linguistics, 2024. p. 522-556. ISBN: 979-8-8917-6164-3.
Typ
článek ve sborníku konference
Jazyk
anglicky
Autoři
Pecher Branislav, Ing., Ph.D.
SRBA, I.
Bieliková Mária, prof. Ing., Ph.D., UPGM (FIT)
Abstrakt

While learning with limited labelled data can effectively deal with a lack of labels, it is also sensitive to the effects of uncontrolled randomness introduced by so-called randomness factors (i.e., non-deterministic decisions such as choice or order of samples). We propose and formalise a method to systematically investigate the effects of individual randomness factors while taking the interactions (dependence) between them into consideration. To this end, our method mitigates the effects of other factors while observing how the performance varies across multiple runs. Applying our method to multiple randomness factors across in-context learning and fine-tuning approaches on 7 representative text classification tasks and meta-learning on 3 tasks, we show that: 1) disregarding interactions between randomness factors in existing works led to inconsistent findings due to incorrect attribution of the effects of randomness factors, such as disproving the consistent sensitivity of in-context learning to sample order even with random sample selection; and 2) besides mutual interactions, the effects of randomness factors, especially sample order, are also dependent on more systematic choices unexplored in existing works, such as number of classes, samples per class or choice of prompt format.

Klíčová slova

NLP in resource-constrained settings, in-context learning, fine-tuning, meta-learning, sensitivity, effects of randomness, stability

Rok
2024
Strany
522–556
Sborník
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing
Konference
Conference on Empirical Methods in Natural Language Processing
ISBN
979-8-8917-6164-3
Vydavatel
Association for Computational Linguistics
Místo
Miami
DOI
BibTeX
@inproceedings{BUT193223,
  author="PECHER, B. and SRBA, I. and BIELIKOVÁ, M.",
  title="On Sensitivity of Learning with Limited Labelled Data to the Effects of Randomness: Impact of Interactions and Systematic Choices",
  booktitle="Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing",
  year="2024",
  pages="522--556",
  publisher="Association for Computational Linguistics",
  address="Miami",
  doi="10.18653/v1/2024.emnlp-main.32",
  isbn="979-8-8917-6164-3"
}
Pracoviště
Nahoru