Department of Computer Graphics and Multimedia

Author Title Keyword Year Years Result Type

2026

CHLUBNA, T. vkCompViz: Universal C++ Library for GPU-Based Experiments. Journal of open source software, 2026, vol. 11, no. 117, 5 p. Detail
GURGUROV, D.; TRINLEY, K.; VYKOPAL, I.; VAN GENABITH, J.; OSTERMANN, S.; ZAMPARELLI, R. Multilingual Political Views of Large Language Models: Identification and Steering. Mumbai, India: Association for Computational Linguistics, 2026. p. 279-298. ISBN: 979-8-89176-303-6. Detail
KIŠŠ, M.; HRADIŠ, M.; DVOŘÁKOVÁ, M.; JIROUŠEK, V.; KERSCH, F. AnnoPage Dataset: Dataset of Non-Textual Elements in Documents with Fine-Grained Categorization. In Document Analysis and Recognition – ICDAR 2025 Workshops. Cham: Springer Nature Switzerland, 2026. p. 50-66. ISBN: 978-3-032-09370-7. Detail
POLOK, A.; KLEMENT, D.; KOCOUR, M.; HAN, J.; LANDINI, F.; YUSUF, B.; WIESNER, M.; KHUDANPUR, S.; ČERNOCKÝ, J.; BURGET, L. DiCoW: Diarization-Conditioned Whisper for Target Speaker Automatic Speech Recognition. COMPUTER SPEECH AND LANGUAGE, 2026, vol. 95, no. 1, p. 1-19. Detail

2025

AKKIRAJU, B.; POTHULA, A.; KESIRAJU, S.; VUPPALA, A. IIITH-BUT system for IWSLT 2025 low-resource Bhojpuri to Hindi speech translation. Proceedings of the 22nd International Conference on Spoken Language Translation (IWSLT 2025). Vienna, Austria: Association for Computational Linguistics, 2025. p. 333-339. ISBN: 979-8-89176-272-5. Detail
Alexander Polok, Jiangyu Han, Dominik Klement, Samuele Cornell, Jan Černocký, Lukáš Burget. BUT System for the MLC-SLM Challenge. ISCA: ISCA, 2025. p. 23-27. Detail
ANIKINA, T.; ČEGIŇ, J.; ŠIMKO, J.; OSTERMANN, S. A Rigorous Evaluation of LLM Data Generation Strategies for Low-Resource Languages. Suzhou, China: Association for Computational Linguistics, 2025. p. 8293-8314. ISBN: 979-8-89176-332-6. Detail
ANIKINA, T.; VYKOPAL, I.; KULA, S.; CHIKKALA, K.; SKACHKOVA, N.; YANG, J.; SOLOPOVA, V.; SCHMITT, V.; OSTERMANN, S. dfkinit2b at CheckThat! 2025: Leveraging LLMs and Ensemble of Methods for Multilingual Claim Normalization. Madrid: Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2025), 2025. Detail
ANTTI, N.; KOHOUT, T.; KAŠPÁREK, T. The Asteroid Spectral Imager (ASPECT) on the Milani CubeSat. SPACE SCIENCE REVIEWS, 2025, vol. 2025, no. 221, p. 1-27. ISSN: 1572-9672. Detail
BARAHONA, S.; SILNOVA, A.; MOŠNER, L.; PENG, J.; PLCHOT, O.; ROHDIN, J.; ZHANG, L.; HAN, J.; PALKA, P.; LANDINI, F.; BURGET, L.; STAFYLAKIS, T.; CUMANI, S.; BOBOŠ, D.; HLAVAČEK, M.; KODOVSKY, M.; PAVLIČEK, T. Analysis of ABC Frontend Audio Systems for the NIST-SRE24. In Proceedings of the Annual Conference of the International Speech Communication Association Interspeech. Interspeech. Rotterdam: International Speech Communication Association, 2025. p. 5763-5767. Detail
BAŘINA, D. Improved verification limit for the convergence of the Collatz conjecture. JOURNAL OF SUPERCOMPUTING, 2025, vol. 81, no. 1, p. 1-14. ISSN: 1573-0484. Detail
BELANEC, R.; OSTERMANN, S.; SRBA, I.; BIELIKOVÁ, M. Task Prompt Vectors: Effective Initialization through Multi-Task Soft Prompt Transfer. Springer, Berlin, Heidelberg, 2025. p. 77-94. ISBN: 978-3-662-72242-8. Detail
BEŇOVÁ, I.; GREGOR, M.; GATT, A. CV-Probes: Studying the interplay of lexical and world knowledge in visually grounded verb understanding. 2025. p. 4425-4433. ISBN: 1069-7977. Detail
CANCELLIERI, M.; DOČEKAL, M.; PRIDE, D.; GRUENPETER, M.; DOUARD, D.; KNOTH, P. Interoperable verification and dissemination of software assets in repositories using COAR Notify. 2025. Detail
ČEGIŇ, J.; PECHER, B.; ŠIMKO, J.; SRBA, I.; BIELIKOVÁ, M.; BRUSILOVSKY, P. Use Random Selection for Now: Investigation of Few-Shot Selection Strategies in LLM-based Text Augmentation. Suzhou, China: Association for Computational Linguistics, 2025. p. 5533-5550. ISBN: 979-8-89176-335-7. Detail
ČEGIŇ, J.; ŠIMKO, J. LLMs vs Established Text Augmentation Techniques for Classification: When do the Benefits Outweight the Costs?. Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers). Albuquerque, New Mexico: Association for Computational Linguistics, 2025. p. 10476-10496. ISBN: 979-8-8917-6189-6. Detail
CHEN, X.; LIN, I.; ZHANG, L.; DU, J.; WU, H.; LEE, H.; JANG, J. Codec-Based Deepfake Source Tracing via Neural Audio Codec Taxonomy. In Proceedings of the Annual Conference of the International Speech Communication Association Interspeech. Interspeech. Rotterdam, Nizozemí: International Speech Communication Association, 2025. p. 1538-1542. Detail
CHEN, X.; LU, W.; ZHANG, R.; XU, J.; LU, X.; ZHANG, L.; WEI, J. Continual Unsupervised Domain Adaptation for Audio Deepfake Detection. In Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing. Hyderabad, Indická republika: Institute of Electrical and Electronics Engineers Inc., 2025. p. 1-5. ISBN: 979-8-3503-6874-1. Detail
CHIKKALA, K.; ANIKINA, T.; SKACHKOVA, N.; VYKOPAL, I.; AGERRI, R.; GENABITH, J. Automatic Fact-checking in English and Telugu. Shoumen, Bulgaria: INCOMA Ltd., 2025. p. 140-151. Detail
CHLUBNA, T.; MILET, T.; ZEMČÍK, P. How Color Profile Affects the Visual Quality in Light Field Rendering and Novel View Synthesis. MULTIMEDIA TOOLS AND APPLICATIONS, 2025, vol. 84, no. 14, p. 11079-11095. Detail
CHLUBNA, T.; MILET, T.; ZEMČÍK, P. Light Field Video Streaming on GPU. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2025, vol. 2025, no. 138, 12 p. Detail
CHLUBNA, T.; VLNAS, M.; BAŘINA, D.; MILET, T.; ZEMČÍK, P. Focus-aware compression and image quality metric for 3D displays. SIGNAL PROCESSING, 2025, vol. 2026, no. 238, p. 1-14. ISSN: 0165-1684. Detail
CHLUBNA, T.; VLNAS, M.; MILET, T.; ZEMČÍK, P. Survey of FOSS 3D/2D Graphics Software Blender Usage in Science, Academia, and Industry. The visual computer, 2025, vol. 42, no. 1, p. 1-32. Detail
CHLUBNA, T.; ZEMČÍK, P. Comparative Survey of Image Compression Methods Across Different Pixel Formats and Bit Depths. Signal Image and Video Processing, 2025, vol. 19, no. 12, 13 p. Detail
CUMANI, S.; SILNOVA, A.; BARAHONA, S.; MOŠNER, L.; PLCHOT, O.; ROHDIN, J. Analysis of the ABC classification backends for NIST SRE24. In Proceedings of the Annual Conference of the International Speech Communication Association Interspeech. Interspeech. Rotterdam: International Speech Communication Association, 2025. p. 3978-3982. Detail
FAJČÍK, M.; DOČEKAL, M.; DOLEŽAL, J.; ONDŘEJ, K.; BENEŠ, K.; SMRŽ, P.; POLOK, A.; HRADIŠ, M. BenCzechMark : A Czech-Centric Multitask and Multimetric Benchmark for Large Language Models with Duel Scoring Mechanism. Transactions of the Association for Computational Linguistics, 2025, vol. 13, no. 9, p. 1068-1095. Detail
GURGUROV, D.; VYKOPAL, I.; GENABITH, J.; OSTERMANN, S. Small Models, Big Impact: Efficient Corpus and Graph-Based Adaptation of Small Multilingual Language Models for Low-Resource Languages. Vienna: Association for Computational Linguistics, 2025. ISBN: 979-8-89176-254-1. Detail
HAN, J.; LANDINI, F.; ROHDIN, J.; SILNOVA, A.; DIEZ SÁNCHEZ, M.; BURGET, L. Leveraging Self-Supervised Learning for Speaker Diarization. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Hyderabad: IEEE Signal Processing Society, 2025. p. 1-5. ISBN: 979-8-3503-6874-1. Detail
HAN, J.; LANDINI, F.; ROHDIN, J.; SILNOVA, A.; DIEZ, M.; ČERNOCKÝ, J.; BURGET, L. Fine-tune Before Structured Pruning: Towards Compact and Accurate Self-Supervised Models for Speaker Diarization. In Proceedings of the Annual Conference of the International Speech Communication Association Interspeech. Interspeech. Rotterdam, The Netherlands: International Speech Communication Association, 2025. p. 1583-1587. Detail
HANÁK, J.; NOVÁK, J.; CHUDÝ, P.; BEN-ASHER, J. Cross-Entropy Method for Laser Defense Applications. Journal of Aerospace Information Systems, 2025, vol. 22, no. 1, p. 53-58. ISSN: 2327-3097. Detail
HEGDE, P.; KESIRAJU, S.; ŠVEC, J.; SEDLÁČEK, Š.; YUSUF, B.; PLCHOT, O.; DEEPAK, K.; ČERNOCKÝ, J. Factors affecting the in-context learning abilities of LLMs for dialogue state tracking. In Proceedings of the Annual Conference of the International Speech Communication Association Interspeech. Interspeech. Rotterdam, The Netherlands: International Speech Communication Association, 2025. p. 4818-4822. Detail
HORI, T.; KOCOUR, M.; HAIDER, A.; MCDERMOTT, E.; ZHUANG, X. Delayed Fusion: Integrating Large Language Models into First-Pass Decoding in End-to-end Speech Recognition. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Hyderabad: IEEE Signal Processing Society, 2025. p. 1-5. ISBN: 979-8-3503-6874-1. Detail
Ivana Beňová, Jana Košecká, Michal Gregor, Martin Tamajka, Marcel Veselý, Marián Šimko. Beyond Image-Text Matching: Verb Understanding in Multimodal Transformers Using Guided Masking. In SOFSEM 2025: Theory and Practice of Computer Science. Lecture Notes in Computer Science. CHAM: Springer Nature, 2025. p. 80-93. ISBN: 978-3-031-82669-6. Detail
KHURANA, S.; KLEMENT, D.; LAURENT, A.; BOBOS, D.; NOVOSAD, J.; GAZDIK, P.; ZHANG, E.; HUANG, Z.; HUSSEIN, A.; MARXER, R.; MASUYAMA, Y.; AIHARA, R.; HORI, C.; GERMAIN, F.; WICHERN, G.; LE ROUX, J. Factorized RVQ-GAN For Disentangled Speech Tokenization. In Proceedings of the Annual Conference of the International Speech Communication Association Interspeech. Interspeech. Rotterdam, The Netherlands: International Speech Communication Association, 2025. p. 3514-3518. Detail
KIŠŠ, M.; HRADIŠ, M. Masked Self-Supervised Pre-Training for Text Recognition Transformers on Large-Scale Datasets. In Document Analysis and Recognition – ICDAR 2025 Workshops. Cham: Springer Nature Switzerland, 2025. p. 53-70. ISBN: 978-3-032-09367-7. Detail
KUBÍK, T.; GUIBAULT, F.; ŠPANĚL, M.; LOMBAERT, H. ToothForge: Automatic Dental Shape Generation using Synchronized Spectral Embeddings. Proceedings of Information Processing in Medical Imaging 2025. Kos: 2025. p. 1-14. Detail
LI, J.; MAK, M.; ROHDIN, J.; LEE, K.; HERMANSKY, H. Bayesian Learning for Domain-Invariant Speaker Verification and Anti-Spoofing. In Proceedings of the Annual Conference of the International Speech Communication Association Interspeech. Interspeech. Rotterdam: International Speech Communication Association, 2025. p. 1123-1127. Detail
LI, S.; WANG, S.; HAN, J.; ZHANG, K.; WANG, W.; LI, H. REAL-T: Real Conversational Mixtures for Target Speaker Extraction. In Proceedings of the Annual Conference of the International Speech Communication Association Interspeech. Interspeech. Rotterdam, The Netherlands: International Speech Communication Association, 2025. p. 1923-1927. Detail
LOJDA, J.; STRNADEL, J.; SMRŽ, P.; ŠIMEK, V. Multi-Partner Project: LoLiPoP-IoT - Design and Simulation of Energy-Efficient Devices for the Internet of Things. In 2025 Design, Automation & Test in Europe Conference (DATE) Proceedings. Lyon: Institute of Electrical and Electronics Engineers, 2025. p. 1-7. ISBN: 978-3-9826741-0-0. Detail
LUONG, H.; LI, H.; ZHANG, L.; LEE, K.; CHNG, E. LlamaPartialSpoof: An LLM-Driven Fake Speech Dataset Simulating Disinformation Generation. In Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing. Hyderabad, Indická republika: Institute of Electrical and Electronics Engineers Inc., 2025. p. 1-5. ISBN: 979-8-3503-6874-1. Detail
Michal Rozsíval, Petr Matoušek, Jaromír Kotala. Poster: Multi-Agent LLM System for Cisco Router Configuration. In 2025 23rd International Symposium on Network Computing and Applications (NCA). Lisbon, Portugal: IEEE, 2025. p. 306-307. ISBN: 979-8-3315-7842-8. Detail
PÁLKA, P.; LANDINI, F.; KLEMENT, D.; DIEZ SÁNCHEZ, M.; SILNOVA, A.; BURGET, L.; DELCROIX, M. Joint Training of Speaker Embedding Extractor, Speech and Overlap Detection for Diarization. Palermo: IEEE Signal Processing Society, 2025. p. 31-35. ISBN: 978-9-46-459362-4. Detail
PENG, J.; ASHIHARA, T.; DELCROIX, M.; OCHIAI, T.; PLCHOT, O.; ARAKI, S.; ČERNOCKÝ, J. TS-SUPERB: A Target Speech Processing Benchmark for Speech Self-Supervised Learning Models. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Hyderabad: IEEE Signal Processing Society, 2025. p. 1-5. ISBN: 979-8-3503-6874-1. Detail
PENG, J.; MOŠNER, L.; ZHANG, L.; PLCHOT, O.; STAFYLAKIS, T.; BURGET, L.; ČERNOCKÝ, J. CA-MHFA: A Context-Aware Multi-Head Factorized Attentive Pooling for SSL-Based Speaker Verification. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Hyderabad: IEEE Signal Processing Society, 2025. p. 1-5. ISBN: 979-8-3503-6874-1. Detail
POLOK, A.; KLEMENT, D.; WIESNER, M.; KHUDANPUR, S.; ČERNOCKÝ, J.; BURGET, L. Target Speaker ASR with Whisper. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Hyderabad: IEEE Signal Processing Society, 2025. p. 1-5. ISBN: 979-8-3503-6874-1. Detail
POTHULA, A.; AKKIRAJU, B.; BANDARUPALLI, S.; D, C.; KESIRAJU, S.; VUPPALA, A. End-to-End Speech Translation for Low-Resource Languages Using Weakly Labeled Data. In Interspeech 2025. Interspeech. Rotterdam: ISCA, 2025. p. 41-45. Detail
SEBUYOYA, R.; SEVCIKOVA, S.; YUSUF, B.; BARTOSIK, M. Integrating isothermal amplification techniques and LNA-based AI-assisted electrochemical bioassay for analysis of KRAS G12V point mutation. TALANTA, 2025, vol. 127709, no. 288, p. 1-10. Detail
SEDLÁČEK, Š.; YUSUF, B.; ŠVEC, J.; HEGDE, P.; KESIRAJU, S.; PLCHOT, O.; ČERNOCKÝ, J. Approaching Dialogue State Tracking via Aligning Speech Encoders and LLMs. In Proceedings of the Annual Conference of the International Speech Communication Association Interspeech. Interspeech. Rotterdam, The Netherlands: International Speech Communication Association, 2025. p. 1748-1752. Detail
ŠILLING, P.; ŠPANĚL, M. DEMIS: Electron Microscopy Image Stitching using Deep Learning Features and Global Optimisation. Proceedings of the 18th International Joint Conference on Biomedical Engineering Systems and Technologies - BIOIMAGING. Porto: Institute for Systems and Technologies of Information, Control and Communication, 2025. p. 255-256. ISBN: 978-989-758-731-3. Detail
SKOG, K.; KOHOUT, T.; KAŠPÁREK, T.; WOLFMAYR, M. Lossless Hyperspectral Image Compression in Comet Interceptor and Hera Missions with Restricted Bandwith. Remote Sensing, 2025, vol. 17, no. 899, p. 1-18. ISSN: 2072-4292. Detail
VLNAS, M.; MILET, T.; ZEMČÍK, P. Low-error Reconstruction of Directional Functions with Spherical Harmonics. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2025, vol. 31, no. 10, p. 8413-8424. ISSN: 1077-2626. Detail
VYKOPAL, I.; OSTERMANN, S.; ŠIMKO, M. Soft Language Prompts for Language Transfer. Albuquerque: Association for Computational Linguistics, 2025. p. 10294-10313. ISBN: 979-8-8917-6189-6. Detail
VYKOPAL, I.; PIKULIAK, M.; OSTERMANN, S.; ANIKINA, T.; GREGOR, M.; ŠIMKO, M. Large Language Models for Multilingual Previously Fact-Checked Claim Detection. Suzhou, China: Association for Computational Linguistics, 2025. p. 15741-15765. ISBN: 979-8-8917-6335-7. Detail
YAN, B.; HAMED, I.; SHIMIZU, S.; LODAGALA, V.; CHEN, W.; IAKOVENKO, O.; TALAFHA, B.; HUSSEIN, A.; POLOK, A.; CHANG, K.; KLEMENT, D.; ALTHUBAITI, S.; PENG, P.; WIESNER, M.; SOLORIO, T.; ALI, A.; KHUDANPUR, S.; WATANABE, S. CS-FLEURS: A Massively Multilingual and Code-Switched Speech Dataset. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Interspeech. Rotterdam, Nizozemí: ISCA, 2025. p. 743-747. Detail
ZHANG, R.; WEI, J.; LU, X.; ZHANG, L.; JIN, D.; LU, W.; XU, J. Multi-Sinkhorn Teacher Knowledge Aggregation Framework for Adaptive Audio Anti-Spoofing. IEEE Transactions on Audio, Speech, and Language Processing, 2025, no. 33, p. 3850-3865. Detail
ZHANG, R.; WEI, J.; LU, X.; ZHANG, L.; JIN, D.; XU, J.; LU, W. SHDA: Sinkhorn Domain Attention for Cross-Domain Audio Anti-Spoofing. IEEE Transactions on Information Forensics and Security, 2025, no. 20, p. 6474-6489. Detail
ZHANG, Y.; TIAN, B.; ZHANG, L.; DUAN, Z. PartialEdit: Identifying Partial Deepfakes in the Era of Neural Speech Editing. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Interspeech. Rotterdam, Nizozemí: ISCA, 2025. p. 5353-5357. Detail