Ústav počítačové grafiky a multimédií
2026
- ALGASOV, A.; NEPOVINNYKH, E.; ZOLOTAREV, F.; EEROLA, T.; KÄLVIÄINEN, H.; STEWART, C.; OTARASHVILI, L.; HOLMBERG, J. On Combining Animal Re-Identification Models to Address Small Datasets. International journal of computer vision, 2026, vol. 134, no. 3,
p. 1-18. Detail - BELANEC, R.; PECHER, B.; SRBA, I.; BIELIKOVÁ, M. PEFT-Bench: A Parameter-Efficient Fine-Tuning Methods Benchmark. Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers). Morocco: Association for Computational Linguistics, 2026.
p. 3035-3054. ISBN: 979-8-89176-380-7. Detail - BELANEC, R.; SRBA, I.; BIELIKOVÁ, M. PEFT-Factory: Unified Parameter-Efficient Fine-Tuning of Autoregressive Large Language Models. Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 3: System Demonstrations). Morocco: Association for Computational Linguistics, 2026.
p. 188-202. ISBN: 979-8-89176-382-1. Detail - GAO, R.; LIU, X.; XING, B.; YU, Z.; SCHULLER, B.; KÄLVIÄINEN, H. Identity-Free Artificial Emotional Intelligence via Micro-Gesture Understanding. IEEE Transactions on Affective Computing, 2026, no. 04 February 2026, 15 p. Detail
- GURGUROV, D.; TRINLEY, K.; VYKOPAL, I.; VAN GENABITH, J.; OSTERMANN, S.; ZAMPARELLI, R. Multilingual Political Views of Large Language Models: Identification and Steering. Mumbai, India: Association for Computational Linguistics, 2026.
p. 279-298. ISBN: 979-8-89176-303-6. Detail - CHLUBNA, T. vkCompViz: Universal C++ Library for GPU-Based Experiments. Journal of open source software, 2026, vol. 11, no. 117, 5 p. Detail
- KIŠŠ, M.; HRADIŠ, M.; DVOŘÁKOVÁ, M.; JIROUŠEK, V.; KERSCH, F. AnnoPage Dataset: Dataset of Non-Textual Elements in Documents with Fine-Grained Categorization. In Document Analysis and Recognition – ICDAR 2025 Workshops. Cham: Springer Nature Switzerland, 2026.
p. 50-66. ISBN: 978-3-032-09370-7. Detail - KRAVIC, N.; PAJEVIC, I.; HASANOVIC, M.; DE LEON MARTINEZ, S.; NIEDERKROTENTHALER, T.; VORACEK, M.; DERVIC, K. War orphan age at father loss and resilience in late adolescence. Wiener klinische Wochenschrift, 2026, no. January 2026, 8 p. Detail
- PECHER, B.; ČEGIŇ, J.; BELANEC, R.; SRBA, I.; ŠIMKO, J.; BIELIKOVÁ, M. Better as Generators Than Classifiers: Leveraging LLMs and Synthetic Data for Low-Resource Multilingual Classification. Findings of the Association for Computational Linguistics: EACL 2026. Morocco: Association for Computational Linguistics, 2026.
p. 2840-2857. ISBN: 979-8-89176-386-9. Detail - POLOK, A.; KLEMENT, D.; KOCOUR, M.; HAN, J.; LANDINI, F.; YUSUF, B.; WIESNER, M.; KHUDANPUR, S.; ČERNOCKÝ, J.; BURGET, L. DiCoW: Diarization-Conditioned Whisper for Target Speaker Automatic Speech Recognition. COMPUTER SPEECH AND LANGUAGE, 2026, vol. 95, no. 1,
p. 1-19. Detail - REPKA, S.; EEROLA, T.; MOTL, D.; VÝRAVSKÝ, J.; ZEMČÍK, P. Unsupervised Mineral Segmentation with Graph Neural Networks and Multi-modal SEM Data. In Lecture Notes in Computer Science. Lecture Notes in Computer Science. Cham: Springer Nature, 2026.
p. 25-36. ISBN: 978-3-032-05059-5. Detail - THORBECKE, I.; VILLATORO-TELLO, E.; ZULUAGA, J.; KUMAR, S.; BURDISSO, S.; RANGAPPA, P.; CAROFILIS, A.; MADIKERI, S.; MOTLÍČEK, P.; PANDIA, K.; HACIOGLU, K.; STOLCKE, A. Unifying Global and Near-Context Biasing in a Single Trie Pass. In Lecture Notes in Artificial Intelligence. Lecture Notes in Computer Science. CHAM: Springer Nature, 2026.
p. 170-181. ISBN: 978-3-032-02547-0. Detail - VAŠKO, M.; HEROUT, A.; HRADIŠ, M. Archival Faces: Detection of Faces in Digitized Historical Documents. In Document Analysis and Recognition – ICDAR 2025 Workshops: Wuhan, China, September 20–21, 2025, Proceedings, Part II. Cham: Springer Nature Switzerland, 2026.
p. 17-34. ISBN: 978-3-032-09370-7. Detail - VYKOPAL, I.; KARAMOLEGKOU, A.; KOPČAN, J.; PENG, Q.; JAVŮREK, T.; GREGOR, M.; ŠIMKO, M. Investigating Language and Retrieval Bias in Multilingual Previously Fact-Checked Claim Detection. Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers). Stroudsburg, PA, USA: Association for Computational Linguistics, 2026.
p. 5195-5221. ISBN: 979-8-89176-380-7. Detail - VYKOPAL, I.; PIKULIAK, M.; OSTERMANN, S.; ŠIMKO, M. Assessing Web Search Credibility and Response Groundedness in Chat Assistants. Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers). Stroudsburg, PA, USA: Association for Computational Linguistics, 2026.
p. 2539-2560. ISBN: 979-8-89176-380-7. Detail
2025
- AKKIRAJU, B.; POTHULA, A.; KESIRAJU, S.; VUPPALA, A. IIITH-BUT system for IWSLT 2025 low-resource Bhojpuri to Hindi speech translation. Proceedings of the 22nd International Conference on Spoken Language Translation (IWSLT 2025). Vienna, Austria: Association for Computational Linguistics, 2025.
p. 333-339. ISBN: 979-8-89176-272-5. Detail - Alexander Polok, Jiangyu Han, Dominik Klement, Samuele Cornell, Jan Černocký, Lukáš Burget. BUT System for the MLC-SLM Challenge. ISCA: ISCA, 2025.
p. 23-27. Detail - ANIKINA, T.; ČEGIŇ, J.; ŠIMKO, J.; OSTERMANN, S. A Rigorous Evaluation of LLM Data Generation Strategies for Low-Resource Languages. Suzhou, China: Association for Computational Linguistics, 2025.
p. 8293-8314. ISBN: 979-8-89176-332-6. Detail - ANIKINA, T.; VYKOPAL, I.; KULA, S.; CHIKKALA, K.; SKACHKOVA, N.; YANG, J.; SOLOPOVA, V.; SCHMITT, V.; OSTERMANN, S. dfkinit2b at CheckThat! 2025: Leveraging LLMs and Ensemble of Methods for Multilingual Claim Normalization. Madrid: Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2025), 2025. Detail
- ANTOŠ, D.; ŠVEC, T.; HOŘÍNKOVÁ, J.; BARTEČKOVÁ, E. Borders of Physical Self in Virtual Reality: A Systematic Review of Virtual Hand Position Discrepancy Detection. Frontiers in Psychiatry, 2025, vol. 15, no. 1,
p. 1-16. Detail - ANTTI, N.; KOHOUT, T.; KAŠPÁREK, T. The Asteroid Spectral Imager (ASPECT) on the Milani CubeSat. Space science reviews, 2025, vol. 2025, no. 221,
p. 1-27. Detail - BARAHONA, S.; SILNOVA, A.; MOŠNER, L.; PENG, J.; PLCHOT, O.; ROHDIN, J.; ZHANG, L.; HAN, J.; PALKA, P.; LANDINI, F.; BURGET, L.; STAFYLAKIS, T.; CUMANI, S.; BOBOŠ, D.; HLAVAČEK, M.; KODOVSKY, M.; PAVLIČEK, T. Analysis of ABC Frontend Audio Systems for the NIST-SRE24. In Proceedings of the Annual Conference of the International Speech Communication Association Interspeech. Interspeech. Rotterdam: International Speech Communication Association, 2025.
p. 5763-5767. Detail - BAŘINA, D. Improved verification limit for the convergence of the Collatz conjecture. Journal of supercomputing, 2025, vol. 81, no. 1,
p. 1-14. ISSN: 1573-0484. Detail - BELANEC, R.; OSTERMANN, S.; SRBA, I.; BIELIKOVÁ, M. Task Prompt Vectors: Effective Initialization through Multi-Task Soft Prompt Transfer. Springer, Berlin, Heidelberg, 2025.
p. 77-94. ISBN: 978-3-662-72242-8. Detail - BEŇOVÁ, I.; GREGOR, M.; GATT, A. CV-Probes: Studying the interplay of lexical and world knowledge in visually grounded verb understanding. 2025.
p. 4425-4433. ISBN: 1069-7977. Detail - Biswas, S; Khan, MAA; Ali, MH; Rohdin, J; Pramanik, S; Khan, MIA; Chakravarty, SK; Pramanik, BK. Interpreting Deep Neural Networks in Diabetic Retinopathy Grading: A Comparison with Human Decision Criteria. Life-Basel, 2025, vol. 15, no. 9, 25 p. Detail
- CANCELLIERI, M.; DOČEKAL, M.; PRIDE, D.; GRUENPETER, M.; DOUARD, D.; KNOTH, P. Interoperable verification and dissemination of software assets in repositories using COAR Notify. 2025. Detail
- CAROFILIS, A.; RANGAPPA, P.; MADIKERI, S.; KUMAR, S.; BURDISSO, S.; PRAKASH, J.; VILLATORO-TELLO, E.; MOTLÍČEK, P.; SHARMA, B.; HACIOGLU, K.; VENKATESAN, S.; VYAS, S.; STOLCKE, A. Better Semi-supervised Learning for Multi-domain ASR Through Incremental Retraining and Data Filtering. In Interspeech. Interspeech. Rotterdam, The Netherlands: Isca-Int Speech Communication Assoc, 2025.
p. 3618-3622. Detail - CUMANI, S.; SILNOVA, A.; BARAHONA, S.; MOŠNER, L.; PLCHOT, O.; ROHDIN, J. Analysis of the ABC classification backends for NIST SRE24. In Proceedings of the Annual Conference of the International Speech Communication Association Interspeech. Interspeech. Rotterdam: International Speech Communication Association, 2025.
p. 3978-3982. Detail - ČADÍK, M. Visual Geo-Localization and Camera Pose Estimation in Natural Environments. Vědecké spisy Vysokého učení technického v Brně. Edice habilitační a inaugurační spisy. VUTIUM, 2025.
p. 1-38. Detail - ČEGIŇ, J.; PECHER, B.; ŠIMKO, J.; SRBA, I.; BIELIKOVÁ, M.; BRUSILOVSKY, P. Use Random Selection for Now: Investigation of Few-Shot Selection Strategies in LLM-based Text Augmentation. Suzhou, China: Association for Computational Linguistics, 2025.
p. 5533-5550. ISBN: 979-8-89176-335-7. Detail - ČEGIŇ, J.; ŠIMKO, J. LLMs vs Established Text Augmentation Techniques for Classification: When do the Benefits Outweight the Costs?. In Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers). Albuquerque, New Mexico: Association for Computational Linguistics, 2025.
p. 10476-10496. ISBN: 979-8-8917-6189-6. Detail - ČIEF, M.; KVETON, B.; KOMPAN, M. Cross-Validated Off-Policy Evaluation. In Proceedings of the AAAI Conference on Artificial Intelligence. Pennsylvania: 2025.
p. 16073-16081. ISBN: 978-1-57735-897-8. Detail - DE LEON MARTINEZ, S.; KANG, J.; MORO, R.; DE RIJKE, M.; KVETON, B.; OOSTERHUIS, H.; BIELIKOVÁ, M. RecGaze: The First Eye Tracking and User Interaction Dataset for Carousel Interfaces. In SIGIR '25: Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval. New York: Association for Computing Machinery, 2025.
p. 3702-3711. ISBN: 979-8-4007-1592-1. Detail - DE LEON, J.; DE LEON MARTINEZ, S.; ARTES-RODRIGUEZ, A.; BACA-GARCIA, E.; DE LAS CUEVAS, C. Reflections on the Potential and Risks of AI for Scientific Article Writing after the AI Endorsement by Some Scientific Publishers: Focusing on Scopus AI. Actas españolas de psiquiatría, 2025, vol. 53, no. 2,
p. 433-442. Detail - DOHNAL, F.; ZEMAN, T.; BARTA, J.; PUPÍKOVÁ, J.; KINCL, P.; HUBÁČEK, M.; ŠTOLLER, J.; KLÍMA, O.; BAŘINA, D.; KUDLÁK, A.; RAK, J.; HUDYMA, N.; PAULUS, F. Ukrytí obyvatelstva před nebezpečím. Brno: Univerzita obrany, 2025. 119 s. ISBN: 978-80-7609-023-1. Detail
- DRAHY, V.; MARIK, R.; KÄLVIÄINEN, H. Non-stationary Signal Analysis: Detrending and Anomaly Detection. In Lecture Notes in Computer Science. Lecture Notes in Computer Science. CHAM: Springer Nature, 2025.
p. 45-59. ISBN: 978-3-031-95910-3. Detail - FAJČÍK, M.; DOČEKAL, M.; DOLEŽAL, J.; ONDŘEJ, K.; BENEŠ, K.; SMRŽ, P.; POLOK, A.; HRADIŠ, M. BenCzechMark : A Czech-Centric Multitask and Multimetric Benchmark for Large Language Models with Duel Scoring Mechanism. Transactions of the Association for Computational Linguistics, 2025, vol. 13, no. 9,
p. 1068-1095. Detail - FOUCHER, V.; DE LEON MARTINEZ, S.; MORO, R. Eye Movements as Indicators of Deception: A Machine Learning Approach. In ETRA '25: Proceedings of the 2025 Symposium on Eye Tracking Research and Applications. New York: ACM, 2025.
p. 1-7. ISBN: 979-8-4007-1487-0. Detail - GAO, R.; LIU, X.; HU, Z.; XING, B.; XIA, B.; YU, Z.; KÄLVIÄINEN, H. FSBench: A Figure Skating Benchmark for Advancing Artistic Sports Understanding. 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, 2025.
p. 13595-13605. Detail - GRAUMAN, K.; WESTBURY, A.; BYRNE, E.; CARTILLIER, V.; CHAVIS, Z.; FURNARI, A.; GIRDHAR, R.; HAMBURGER, J.; JIANG, H.; KUKREJA, D.; LIU, M.; LIU, X.; MARTIN, M.; NAGARAJAN, T.; RADOSAVOVIC, I.; RAMAKRISHNAN, S.; RYAN, F.; SHARMA, J.; WRAY, M.; XU, M.; XU, E.; ZHAO, C.; BANSAL, S.; BATRA, D.; CRANE, S.; DO, T.; DOULATY, M.; ERAPALLI, A.; FEICHTENHOFER, C.; FRAGOMENI, A.; FU, Q.; GEBRESELASIE, A.; GONZALEZ, C.; HILLIS, J.; HUANG, X.; HUANG, Y.; JIA, W.; KHOO, W.; KOLAR, J.; KOTTUR, S.; KUMAR, A.; LANDINI, F.; LI, C.; LI, Y.; LI, Z.; MANGALAM, K.; MODHUGU, R.; MUNRO, J.; MURRELL, T.; NISHIYASU, T.; PRICE, W.; RUIZ PUENTES, P.; RAMAZANOVA, M.; SARI, L.; SOMASUNDARAM, K.; SOUTHERLAND, A.; SUGANO, Y.; TAO, R.; VO, M.; WANG, Y.; WU, X.; YAGI, T.; ZHAO, Z.; ZHU, Y.; ARBELAEZ, P.; CRANDALL, D.; DAMEN, D.; FARINELLA, G.; FUEGEN, C.; GHANEM, B.; KRISHNA, V.; JAWAHAR, C.; JOO, H.; KITANI, K.; LI, H.; NEWCOMBE, R.; OLIVA, A.; PARK, H.; REHG, J.; SATO, Y.; SHI, J.; ZHENG SHOU, M.; TORRALBA, A.; TORRESANI, L.; YAN, M.; MALIK, J. Ego4D: Around the World in 3,600 Hours of Egocentric Video. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2025, vol. 47, no. 11,
p. 9468-9509. Detail - GURGUROV, D.; VYKOPAL, I.; GENABITH, J.; OSTERMANN, S. Small Models, Big Impact: Efficient Corpus and Graph-Based Adaptation of Small Multilingual Language Models for Low-Resource Languages. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 4: Student Research Workshop). Vienna: Association for Computational Linguistics, 2025.
p. 355-395. ISBN: 979-8-89176-254-1. Detail - HAN, J.; LANDINI, F.; ROHDIN, J.; SILNOVA, A.; DIEZ, M.; ČERNOCKÝ, J.; BURGET, L. Fine-tune Before Structured Pruning: Towards Compact and Accurate Self-Supervised Models for Speaker Diarization. In Proceedings of the Annual Conference of the International Speech Communication Association Interspeech. Interspeech. Rotterdam, The Netherlands: International Speech Communication Association, 2025.
p. 1583-1587. Detail - HANÁK, J.; NOVÁK, J.; CHUDÝ, P. Cognitive Agent Evaluation for Synthetic Pilot Training. In 2025 AIAA DATC/IEEE 44th Digital Avionics Systems Conference (DASC). Montreal, QC, Canada: IEEE, 2025.
p. 1-10. ISBN: 979-8-3315-2519-4. Detail - HANÁK, J.; NOVÁK, J.; CHUDÝ, P.; BEN-ASHER, J. Cross-Entropy Method for Laser Defense Applications. Journal of Aerospace Information Systems, 2025, vol. 22, no. 1,
p. 53-58. ISSN: 2327-3097. Detail - HEGDE, P.; KESIRAJU, S.; ŠVEC, J.; SEDLÁČEK, Š.; YUSUF, B.; PLCHOT, O.; DEEPAK, K.; ČERNOCKÝ, J. Factors affecting the in-context learning abilities of LLMs for dialogue state tracking. In Proceedings of the Annual Conference of the International Speech Communication Association Interspeech. Interspeech. Rotterdam, The Netherlands: International Speech Communication Association, 2025.
p. 4818-4822. Detail - HORI, T.; KOCOUR, M.; HAIDER, A.; MCDERMOTT, E.; ZHUANG, X. Delayed Fusion: Integrating Large Language Models into First-Pass Decoding in End-to-end Speech Recognition. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Hyderabad: IEEE Signal Processing Society, 2025.
p. 1-5. ISBN: 979-8-3503-6874-1. Detail - CHEN, X.; LIN, I.; ZHANG, L.; DU, J.; WU, H.; LEE, H.; JANG, J. Codec-Based Deepfake Source Tracing via Neural Audio Codec Taxonomy. In Proceedings of the Annual Conference of the International Speech Communication Association Interspeech. Interspeech. Rotterdam, Nizozemí: International Speech Communication Association, 2025.
p. 1538-1542. Detail - CHEN, X.; LU, W.; ZHANG, R.; XU, J.; LU, X.; ZHANG, L.; WEI, J. Continual Unsupervised Domain Adaptation for Audio Deepfake Detection. In Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing. Hyderabad, Indická republika: Institute of Electrical and Electronics Engineers Inc., 2025.
p. 1-5. ISBN: 979-8-3503-6874-1. Detail - CHIKKALA, K.; ANIKINA, T.; SKACHKOVA, N.; VYKOPAL, I.; AGERRI, R.; GENABITH, J. Automatic Fact-checking in English and Telugu. Shoumen, Bulgaria: INCOMA Ltd., 2025.
p. 140-151. Detail - CHLUBNA, T.; MILET, T.; ZEMČÍK, P. How Color Profile Affects the Visual Quality in Light Field Rendering and Novel View Synthesis. MULTIMEDIA TOOLS AND APPLICATIONS, 2025, vol. 84, no. 14,
p. 11079-11095. Detail - CHLUBNA, T.; MILET, T.; ZEMČÍK, P. Light Field Video Streaming on GPU. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2025, vol. 2025, no. 138, 12 p. Detail
- CHLUBNA, T.; VLNAS, M.; BAŘINA, D.; MILET, T.; ZEMČÍK, P. Focus-aware compression and image quality metric for 3D displays. Signal processing, 2025, vol. 2026, no. 238,
p. 1-14. ISSN: 0165-1684. Detail - CHLUBNA, T.; VLNAS, M.; MILET, T.; ZEMČÍK, P. Survey of FOSS 3D/2D Graphics Software Blender Usage in Science, Academia, and Industry. The visual computer, 2025, vol. 42, no. 1,
p. 1-32. Detail - CHLUBNA, T.; ZEMČÍK, P. Comparative Survey of Image Compression Methods Across Different Pixel Formats and Bit Depths. Signal Image and Video Processing, 2025, vol. 19, no. 12, 13 p. Detail
- INGROVA, P.; KRALIK, M.; POLCEROVA, L.; PAVLIKOVA, V.; KLÍMA, O.; CUTA, M. Relationships between Sociosexuality and Dermatoglyphic Traits. Anthropological Review, 2025, vol. 88, no. 1,
p. 33-60. Detail - Ivana Beňová, Jana Košecká, Michal Gregor, Martin Tamajka, Marcel Veselý, Marián Šimko. Beyond Image-Text Matching: Verb Understanding in Multimodal Transformers Using Guided Masking. In SOFSEM 2025: Theory and Practice of Computer Science. Lecture Notes in Computer Science. CHAM: Springer Nature, 2025.
p. 80-93. ISBN: 978-3-031-82669-6. Detail - JAROLÍM, A.; FAJČÍK, M.; MAKAIOVÁ, L. Can LLMs Extract Human-like Fine-grained Evidence for Evidence-based Fact-checking?. In Proceedings of the Nineteenth Workshop on Recent Advances in Slavonic Natural Languages Processing, RASLAN 2025. Recent Advances in Slavonic Natural Language Processing. 2025. no. 2025,
p. 25-36. ISBN: 978-80-263-1858-3. Detail - KANG, J.; DE RIJKE, M.; DE LEON MARTINEZ, S.; OOSTERHUIS, H. Rethinking Click Models in Light of Carousel Interfaces: Theory-Based Categorization and Design of Click Models. In ICTIR '25: Proceedings of the 2025 International ACM SIGIR Conference on Innovative Concepts and Theories in Information Retrieval (ICTIR). New York City: Association for Computing Machinery, 2025.
p. 44-55. ISBN: 979-8-4007-1861-8. Detail - KAREINEN, J.; EEROLA, T.; KRAFT, K.; LENSU, L.; SUIKKANEN, S.; KÄLVIÄINEN, H. Self-Supervised Pretraining for Fine-Grained Plankton Recognition. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops. IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops. IEEE Computer Society, 2025.
p. 2122-2132. ISBN: 9798331599942. Detail - KAREINEN, J.; SKYTTA, A.; EEROLA, T.; KRAFT, K.; LENSU, L.; SUIKKANEN, S.; LEHTINIEMI, M.; KÄLVIÄINEN, H. Open-Set Plankton Recognition. In Lecture Notes in Computer Science. Lecture Notes in Computer Science. Milan, Italy: Springer Nature, 2025.
p. 168-184. ISBN: 978-3-031-91671-7. Detail - KHURANA, S.; KLEMENT, D.; LAURENT, A.; BOBOS, D.; NOVOSAD, J.; GAZDIK, P.; ZHANG, E.; HUANG, Z.; HUSSEIN, A.; MARXER, R.; MASUYAMA, Y.; AIHARA, R.; HORI, C.; GERMAIN, F.; WICHERN, G.; LE ROUX, J. Factorized RVQ-GAN For Disentangled Speech Tokenization. In Proceedings of the Annual Conference of the International Speech Communication Association Interspeech. Interspeech. Rotterdam, The Netherlands: International Speech Communication Association, 2025.
p. 3514-3518. Detail - KIŠŠ, M.; HRADIŠ, M. Masked Self-Supervised Pre-Training for Text Recognition Transformers on Large-Scale Datasets. In Document Analysis and Recognition – ICDAR 2025 Workshops. Cham: Springer Nature Switzerland, 2025.
p. 53-70. ISBN: 978-3-032-09367-7. Detail - KOHÚT, J.; DOČEKAL, M.; HRADIŠ, M.; VAŠKO, M. BiblioPage: A Dataset of Scanned Title Pages for Bibliographic Metadata Extraction. In Document Analysis and Recognition – ICDAR 2025. Cham: Springer Nature Switzerland, 2025.
p. 287-304. ISBN: 978-3-032-04623-9. Detail - KOHÚT, J.; HRADIŠ, M.;. Practical Fine-Tuning of Autoregressive Models on Limited Handwritten Texts. In Document Analysis and Recognition – ICDAR 2025. Cham: Springer Nature Switzerland, 2025.
p. 22-39. ISBN: 978-3-032-04629-1. Detail - KOSTELNÍK, M.; HRADIŠ, M.; BENEŠ, K. TextBite: A Historical Czech Document Dataset for Logical Page Segmentation. In Document Analysis and Recognition – ICDAR 2025 Workshops. Cham: Springer Nature Switzerland, 2025.
p. 124-140. ISBN: 978-3-032-09367-7. Detail - KUBÍK, T.; GUIBAULT, F.; ŠPANĚL, M.; LOMBAERT, H. ToothForge: Automatic Dental Shape Generation using Synchronized Spectral Embeddings. In Proceedings of Information Processing in Medical Imaging 2025. Lecture Notes in Computer Science. Kos: Springer Science and Business Media Deutschland GmbH, 2025.
p. 313-326. ISBN: 9783031966248. Detail - KUBÍK, T.; KODYM, O.; ŠILLING, P.; TRÁVNÍČKOVÁ, K.; MOJŽIŠ, T.; MATULA, J. Leveraging Point Transformers for Detecting Anatomical Landmarks in Digital Dentistry. In Lecture Notes in Computer Science. Lecture Notes in Computer Science. Springer Science and Business Media Deutschland GmbH, 2025. no. 15571 LNCS,
p. 216-228. ISBN: 9783031889769. Detail - KUMAR, S.; THORBECKE, I.; BURDISSO, S.; VILLATORO-TELLO, E.; MANJUNATH, K.; HACIOGLU, K.; RANGAPPA, P.; MOTLÍČEK, P.; GANAPATHIRAJU, A.; STOLCKE, A. Performance Evaluation of SLAM-ASR: The Good, the Bad, the Ugly, and the Way Forward. In 2025 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW. Hyderabad, Indická republika: IEEE, 2025.
p. 1-5. ISBN: 979-8-3315-1932-2. Detail - LI, D.; XING, B.; LIU, X.; XIA, B.; WEN, B.; KÄLVIÄINEN, H. DEEMO: De-identity Multimodal Emotion Recognition and Reasoning. MM '25: Proceedings of the 33rd ACM International Conference on Multimedia. New York, NY, USA: ACM, 2025.
p. 5707-5716. ISBN: 979-8-4007-2035-2. Detail - LI, J.; MAK, M.; ROHDIN, J.; LEE, K.; HERMANSKY, H. Bayesian Learning for Domain-Invariant Speaker Verification and Anti-Spoofing. In Proceedings of the Annual Conference of the International Speech Communication Association Interspeech. Interspeech. Rotterdam: International Speech Communication Association, 2025.
p. 1123-1127. Detail - LI, S.; WANG, S.; HAN, J.; ZHANG, K.; WANG, W.; LI, H. REAL-T: Real Conversational Mixtures for Target Speaker Extraction. In Proceedings of the Annual Conference of the International Speech Communication Association Interspeech. Interspeech. Rotterdam, The Netherlands: International Speech Communication Association, 2025.
p. 1923-1927. Detail - LOJDA, J.; JOYCE, D.; SMRŽ, P.; KATHURIA, S.; STRNADEL, J.; QUINN, C.; ŠIMEK, V.; STAROŇ, P. Portable Simulation Models for Energy Aspects of IoT Devices in the LoLiPoP-IoT Project. 2025 28th Euromicro Conference on Digital System Design (DSD). Salerno: IEEE Computer Society, 2025.
p. 368-375. ISBN: 979-8-3315-8499-3. Detail - LOJDA, J.; STRNADEL, J.; SMRŽ, P.; ŠIMEK, V. Multi-Partner Project: LoLiPoP-IoT - Design and Simulation of Energy-Efficient Devices for the Internet of Things. In 2025 Design, Automation & Test in Europe Conference (DATE) Proceedings. Lyon: Institute of Electrical and Electronics Engineers, 2025.
p. 1-7. ISBN: 978-3-9826741-0-0. Detail - LUONG, H.; LI, H.; ZHANG, L.; LEE, K.; CHNG, E. LlamaPartialSpoof: An LLM-Driven Fake Speech Dataset Simulating Disinformation Generation. In Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing. Hyderabad, Indická republika: Institute of Electrical and Electronics Engineers Inc., 2025.
p. 1-5. ISBN: 979-8-3503-6874-1. Detail - MA, X.; ZHANG, R.; WEI, J.; LU, X.; XU, J.; ZHANG, L.; LU, W. Self-distillation-based domain exploration for source speaker verification under spoofed speech from unknown voice conversion. Speech communication, 2025, vol. 167, no. 103153,
p. 1-12. Detail - MADIKERI, S.; MOTLÍČEK, P.; SANCHEZ-CORTES, D.; RANGAPPA, P.; HUGHES, J.; TKACZUK, J.; LARA, A.; KHALIL, D.; ROHDIN, J.; ZHU, D.; KRISHNAN, A.; KLAKOW, D.; AHMADI, Z.; KOVAC, M.; BOBOS, D.; KALOGIROS, C.; ALEXOPOULOS, A.; MARRAUD, D. Autocrime-open multimodal platform for combating organized crime. Forensic Science International: Digital Investigation, 2025, vol. 54, no. 9,
p. 1-14. Detail - MAKAIOVÁ, L.; FAJČÍK, M.; JAROLÍM, A. Examining the Metrics for Document-Level Claim Extraction in Czech and Slovak. In Proceedings of the Nineteenth Workshop on Recent Advances in Slavonic Natural Languages Processing. Recent Advances in Slavonic Natural Language Processing. 2025. no. 2025,
p. 15-24. ISBN: 978-80-263-1858-3. Detail - Michal Rozsíval, Petr Matoušek, Jaromír Kotala. Poster: Multi-Agent LLM System for Cisco Router Configuration. In 2025 23rd International Symposium on Network Computing and Applications (NCA). Lisbon, Portugal: IEEE, 2025.
p. 306-307. ISBN: 979-8-3315-7842-8. Detail - PÁLKA, P.; LANDINI, F.; KLEMENT, D.; DIEZ SÁNCHEZ, M.; SILNOVA, A.; BURGET, L.; DELCROIX, M. Joint Training of Speaker Embedding Extractor, Speech and Overlap Detection for Diarization. In Proceedings of 33rd European Signal Processing Conference (EUSIPCO 2025). Palermo: IEEE Signal Processing Society, 2025.
p. 31-35. ISBN: 978-9-46-459362-4. Detail - PEČIVA, J. Vulkan Tutorial - Chapter 1: Devices, Instance and Loader. 2025.
p. 1-5. Detail - PECHER, B.; SRBA, I.; BIELIKOVÁ, M. A Survey on Stability of Learning with Limited Labelled Data and its Sensitivity to the Effects of Randomness. ACM Computing Surveys, 2025, vol. 57, no. 1,
p. 1-40. Detail - PENG, J.; ASHIHARA, T.; DELCROIX, M.; OCHIAI, T.; PLCHOT, O.; ARAKI, S.; ČERNOCKÝ, J. TS-SUPERB: A Target Speech Processing Benchmark for Speech Self-Supervised Learning Models. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Hyderabad: IEEE Signal Processing Society, 2025.
p. 1-5. ISBN: 979-8-3503-6874-1. Detail - PENG, J.; MOŠNER, L.; ZHANG, L.; PLCHOT, O.; STAFYLAKIS, T.; BURGET, L.; ČERNOCKÝ, J. CA-MHFA: A Context-Aware Multi-Head Factorized Attentive Pooling for SSL-Based Speaker Verification. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Hyderabad: IEEE Signal Processing Society, 2025.
p. 1-5. ISBN: 979-8-3503-6874-1. Detail - POLOK, A.; KLEMENT, D.; WIESNER, M.; KHUDANPUR, S.; ČERNOCKÝ, J.; BURGET, L. Target Speaker ASR with Whisper. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Hyderabad: IEEE Signal Processing Society, 2025.
p. 1-5. ISBN: 979-8-3503-6874-1. Detail - POTHULA, A.; AKKIRAJU, B.; BANDARUPALLI, S.; D, C.; KESIRAJU, S.; VUPPALA, A. End-to-End Speech Translation for Low-Resource Languages Using Weakly Labeled Data. In Interspeech 2025. Interspeech. Rotterdam: ISCA, 2025.
p. 41-45. Detail - RANGAPPA, P.; CAROFILIS, A.; PRAKASH, J.; KUMAR, S.; BURDISSO, S.; MADIKERI, S.; VILLATORO-TELLO, E.; SHARMA, B.; MOTLÍČEK, P.; HACIOGLU, K.; VENKATESAN, S.; VYAS, S.; STOLCKE, A. Efficient Data Selection for Domain Adaptation of ASR Using Pseudo-Labels and Multi-Stage Filtering. In Interspeech. Interspeech. Rotterdam, The Netherlands: Isca-Int Speech Communication Assoc, 2025.
p. 4928-4932. Detail - REPKA, S.; REICH, B.; ZOLOTAREV, F.; EEROLA, T.; ZEMČÍK, P. Mineral segmentation using electron microscope images and spectral sampling through multimodal graph neural networks. Pattern Recognition Letters, 2025, vol. 193, no. 193,
p. 79-85. Detail - SANZ-GOMEZ, S.; VERA-VARELA, C.; DE-LA-VEGA-SANCHEZ, D.; BARRIGON, M.; ALACREU-CRESPO, A.; GUIJA, J.; SANCHEZ, A.; DE LEON MARTINEZ, S.; BACA-GARCIA, E.; GINER, L. Impulsivity and aggression in suicide across age and sex: case–control study. BJPsych Open, 2025, vol. 11, no. 5, 9 p. Detail
- SEBUYOYA, R.; SEVCIKOVA, S.; YUSUF, B.; BARTOSIK, M. Integrating isothermal amplification techniques and LNA-based AI-assisted electrochemical bioassay for analysis of KRAS G12V point mutation. TALANTA, 2025, vol. 127709, no. 288,
p. 1-10. Detail - SEDLÁČEK, Š.; YUSUF, B.; ŠVEC, J.; HEGDE, P.; KESIRAJU, S.; PLCHOT, O.; ČERNOCKÝ, J. Approaching Dialogue State Tracking via Aligning Speech Encoders and LLMs. In Proceedings of the Annual Conference of the International Speech Communication Association Interspeech. Interspeech. Rotterdam, The Netherlands: International Speech Communication Association, 2025.
p. 1748-1752. Detail - SKOG, K.; KOHOUT, T.; KAŠPÁREK, T.; WOLFMAYR, M. Lossless Hyperspectral Image Compression in Comet Interceptor and Hera Missions with Restricted Bandwith. Remote Sensing, 2025, vol. 17, no. 899,
p. 1-18. ISSN: 2072-4292. Detail - ŠILLING, P.; ŠPANĚL, M. DEMIS: Electron Microscopy Image Stitching using Deep Learning Features and Global Optimisation. Proceedings of the 18th International Joint Conference on Biomedical Engineering Systems and Technologies - BIOIMAGING. Porto: Institute for Systems and Technologies of Information, Control and Communication, 2025.
p. 255-256. ISBN: 978-989-758-731-3. Detail - ŠIMEČKOVÁ, M.; KARAFIÁT, M.; PLCHOT, O. Using machine learning for automatic dialect detection. New methods in Czech dialectology. In Slovanské dialek ty v době dig itál ních technologií. Nářeční prameny a jejich současné zpracování. Praha: Slovanský ústav AV ČR, 2025.
p. 297-307. ISBN: 978-80-86420-99-8. Detail - TIAN, J.; SHI, J.; CHEN, W.; ARORA, S.; MASUYAMA, Y.; MAEKAKU, T.; WU, Y.; PENG, J.; BHARADWAJ, S.; ZHAO, Y.; CORNELL, S.; PENG, Y.; YUE, X.; YANG, C.; NEUBIG, G.; WATANABE, S. ESPnet-SpeechLM: An Open Speech Language Model Toolkit. In Proceedings of the 2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies: Long Papers, NAACL-HLT 2025. Hybrid, Albuquerque, New Mexico, USA: Association for Computational Linguistics (ACL), 2025.
p. 116-124. ISBN: 9798891761919. Detail - VAŠKO, M.; HEROUT, A. LossFIQA: A Shortcut Solution to Image Quality Assessment Using Loss for Faces and Beyond. IEEE Access, 2025, vol. 13, no. 7,
p. 126915-126924. Detail - VISHWANATH, U.; BHATTACHARJEE, T.; DEEKSHITHA, G.; UDUPA, S.; THIRUMALA, K.; KEERTHIPRIYA, M.; CHIKKTIMMEGOWDA, D.; BASKAR, D.; BELUR, Y.; VENGALIL, S.; NALINI, A.; GHOSH, P. Comparison of Acoustic and Textual Features for Dysarthria Severity Classification in Amyotrophic Lateral Sclerosis. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Interspeech. Rotterdam, The Netherlands: Isca-Int Speech Communication Assoc, 2025.
p. 803-807. Detail - VLNAS, M.; MILET, T.; ZEMČÍK, P. Low-error Reconstruction of Directional Functions with Spherical Harmonics. IEEE transactions on visualization and computer graphics, 2025, vol. 31, no. 10,
p. 8413-8424. ISSN: 1077-2626. Detail - VYKOPAL, I.; OSTERMANN, S.; ŠIMKO, M. Soft Language Prompts for Language Transfer. In Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers). Albuquerque: Association for Computational Linguistics, 2025.
p. 10294-10313. ISBN: 979-8-8917-6189-6. Detail - VYKOPAL, I.; PIKULIAK, M.; OSTERMANN, S.; ANIKINA, T.; GREGOR, M.; ŠIMKO, M. Large Language Models for Multilingual Previously Fact-Checked Claim Detection. Suzhou, China: Association for Computational Linguistics, 2025.
p. 15741-15765. ISBN: 979-8-8917-6335-7. Detail - XING, B.; YUAN, K.; YU, Z.; LIU, X.; KÄLVIÄINEN, H. AU-TTT: Vision Test-Time Training model for Facial Action Unit Detection. In Proceedings IEEE International Conference on Multimedia and Expo. Proceedings (IEEE International Conference on Multimedia and Expo). IEEE Computer Society, 2025. 11 p. ISBN: 9798331594954. Detail
- YAN, B.; HAMED, I.; SHIMIZU, S.; LODAGALA, V.; CHEN, W.; IAKOVENKO, O.; TALAFHA, B.; HUSSEIN, A.; POLOK, A.; CHANG, K.; KLEMENT, D.; ALTHUBAITI, S.; PENG, P.; WIESNER, M.; SOLORIO, T.; ALI, A.; KHUDANPUR, S.; WATANABE, S. CS-FLEURS: A Massively Multilingual and Code-Switched Speech Dataset. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Interspeech. Rotterdam, Nizozemí: ISCA, 2025.
p. 743-747. Detail - YU, Z.; LIU, X.; DAMER, N.; FAN, D.; SHI, J.; GUO, X.; LIN, X.; WEN, B.; KONG, A.; KÄLVIÄINEN, H.; SCHULLER, B.; CAO, X. SVC 2025 Chairs’ Welcome. Svc 2025 Proceedings of the 1st International Workshop and Challenge on Subtle Visual Computing Co Located with mm 2025. Association for Computing Machinery, Inc, 2025. ISBN: 9798400718373. Detail
- ZEINALI, H.; LEE, K.; ALAM, J.; BURGET, L. Text-dependent Speaker Verification Challenge 2024: Exploring Shared and User-defined Passphrases. In 2025 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP). Proceedings of the ... IEEE International Conference on Acoustics, Speech, and Signal Processing. Hyderabad, Indická republika: Institute of Electrical and Electronics Engineers Inc., 2025.
p. 1-5. Detail - ZHANG, R.; WEI, J.; LU, X.; ZHANG, L.; JIN, D.; LU, W.; XU, J. Multi-Sinkhorn Teacher Knowledge Aggregation Framework for Adaptive Audio Anti-Spoofing. IEEE Transactions on Audio, Speech, and Language Processing, 2025, no. 33,
p. 3850-3865. Detail - ZHANG, R.; WEI, J.; LU, X.; ZHANG, L.; JIN, D.; XU, J.; LU, W. SHDA: Sinkhorn Domain Attention for Cross-Domain Audio Anti-Spoofing. IEEE Transactions on Information Forensics and Security, 2025, no. 20,
p. 6474-6489. Detail - ZHANG, Y.; TIAN, B.; ZHANG, L.; DUAN, Z. PartialEdit: Identifying Partial Deepfakes in the Era of Neural Speech Editing. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Interspeech. Rotterdam, Nizozemí: ISCA, 2025.
p. 5353-5357. Detail