Department of Computer Graphics and Multimedia
2026
- ALGASOV, A.; NEPOVINNYKH, E.; ZOLOTAREV, F.; EEROLA, T.; KÄLVIÄINEN, H.; STEWART, C.; OTARASHVILI, L.; HOLMBERG, J. On Combining Animal Re-Identification Models to Address Small Datasets. International journal of computer vision, 2026, vol. 134, no. 3,
p. 1-18. Detail - CHLUBNA, T. vkCompViz: Universal C++ Library for GPU-Based Experiments. Journal of open source software, 2026, vol. 11, no. 117, 5 p. Detail
- GURGUROV, D.; TRINLEY, K.; VYKOPAL, I.; VAN GENABITH, J.; OSTERMANN, S.; ZAMPARELLI, R. Multilingual Political Views of Large Language Models: Identification and Steering. Mumbai, India: Association for Computational Linguistics, 2026.
p. 279-298. ISBN: 979-8-89176-303-6. Detail - KIŠŠ, M.; HRADIŠ, M.; DVOŘÁKOVÁ, M.; JIROUŠEK, V.; KERSCH, F. AnnoPage Dataset: Dataset of Non-Textual Elements in Documents with Fine-Grained Categorization. In Document Analysis and Recognition – ICDAR 2025 Workshops. Cham: Springer Nature Switzerland, 2026.
p. 50-66. ISBN: 978-3-032-09370-7. Detail - POLOK, A.; KLEMENT, D.; KOCOUR, M.; HAN, J.; LANDINI, F.; YUSUF, B.; WIESNER, M.; KHUDANPUR, S.; ČERNOCKÝ, J.; BURGET, L. DiCoW: Diarization-Conditioned Whisper for Target Speaker Automatic Speech Recognition. COMPUTER SPEECH AND LANGUAGE, 2026, vol. 95, no. 1,
p. 1-19. Detail - VAŠKO, M.; HEROUT, A.; HRADIŠ, M. Archival Faces: Detection of Faces in Digitized Historical Documents. In Document Analysis and Recognition – ICDAR 2025 Workshops: Wuhan, China, September 20–21, 2025, Proceedings, Part II. Cham: Springer Nature Switzerland, 2026.
p. 17-34. ISBN: 978-3-032-09370-7. Detail
2025
- AKKIRAJU, B.; POTHULA, A.; KESIRAJU, S.; VUPPALA, A. IIITH-BUT system for IWSLT 2025 low-resource Bhojpuri to Hindi speech translation. Proceedings of the 22nd International Conference on Spoken Language Translation (IWSLT 2025). Vienna, Austria: Association for Computational Linguistics, 2025.
p. 333-339. ISBN: 979-8-89176-272-5. Detail - Alexander Polok, Jiangyu Han, Dominik Klement, Samuele Cornell, Jan Černocký, Lukáš Burget. BUT System for the MLC-SLM Challenge. ISCA: ISCA, 2025.
p. 23-27. Detail - ANIKINA, T.; ČEGIŇ, J.; ŠIMKO, J.; OSTERMANN, S. A Rigorous Evaluation of LLM Data Generation Strategies for Low-Resource Languages. Suzhou, China: Association for Computational Linguistics, 2025.
p. 8293-8314. ISBN: 979-8-89176-332-6. Detail - ANIKINA, T.; VYKOPAL, I.; KULA, S.; CHIKKALA, K.; SKACHKOVA, N.; YANG, J.; SOLOPOVA, V.; SCHMITT, V.; OSTERMANN, S. dfkinit2b at CheckThat! 2025: Leveraging LLMs and Ensemble of Methods for Multilingual Claim Normalization. Madrid: Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2025), 2025. Detail
- ANTTI, N.; KOHOUT, T.; KAŠPÁREK, T. The Asteroid Spectral Imager (ASPECT) on the Milani CubeSat. SPACE SCIENCE REVIEWS, 2025, vol. 2025, no. 221,
p. 1-27. ISSN: 1572-9672. Detail - BARAHONA, S.; SILNOVA, A.; MOŠNER, L.; PENG, J.; PLCHOT, O.; ROHDIN, J.; ZHANG, L.; HAN, J.; PALKA, P.; LANDINI, F.; BURGET, L.; STAFYLAKIS, T.; CUMANI, S.; BOBOŠ, D.; HLAVAČEK, M.; KODOVSKY, M.; PAVLIČEK, T. Analysis of ABC Frontend Audio Systems for the NIST-SRE24. In Proceedings of the Annual Conference of the International Speech Communication Association Interspeech. Interspeech. Rotterdam: International Speech Communication Association, 2025.
p. 5763-5767. Detail - BAŘINA, D. Improved verification limit for the convergence of the Collatz conjecture. JOURNAL OF SUPERCOMPUTING, 2025, vol. 81, no. 1,
p. 1-14. ISSN: 1573-0484. Detail - BELANEC, R.; OSTERMANN, S.; SRBA, I.; BIELIKOVÁ, M. Task Prompt Vectors: Effective Initialization through Multi-Task Soft Prompt Transfer. Springer, Berlin, Heidelberg, 2025.
p. 77-94. ISBN: 978-3-662-72242-8. Detail - BEŇOVÁ, I.; GREGOR, M.; GATT, A. CV-Probes: Studying the interplay of lexical and world knowledge in visually grounded verb understanding. 2025.
p. 4425-4433. ISBN: 1069-7977. Detail - CANCELLIERI, M.; DOČEKAL, M.; PRIDE, D.; GRUENPETER, M.; DOUARD, D.; KNOTH, P. Interoperable verification and dissemination of software assets in repositories using COAR Notify. 2025. Detail
- ČEGIŇ, J.; PECHER, B.; ŠIMKO, J.; SRBA, I.; BIELIKOVÁ, M.; BRUSILOVSKY, P. Use Random Selection for Now: Investigation of Few-Shot Selection Strategies in LLM-based Text Augmentation. Suzhou, China: Association for Computational Linguistics, 2025.
p. 5533-5550. ISBN: 979-8-89176-335-7. Detail - ČEGIŇ, J.; ŠIMKO, J. LLMs vs Established Text Augmentation Techniques for Classification: When do the Benefits Outweight the Costs?. In Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers). Albuquerque, New Mexico: Association for Computational Linguistics, 2025.
p. 10476-10496. ISBN: 979-8-8917-6189-6. Detail - CHEN, X.; LIN, I.; ZHANG, L.; DU, J.; WU, H.; LEE, H.; JANG, J. Codec-Based Deepfake Source Tracing via Neural Audio Codec Taxonomy. In Proceedings of the Annual Conference of the International Speech Communication Association Interspeech. Interspeech. Rotterdam, Nizozemí: International Speech Communication Association, 2025.
p. 1538-1542. Detail - CHEN, X.; LU, W.; ZHANG, R.; XU, J.; LU, X.; ZHANG, L.; WEI, J. Continual Unsupervised Domain Adaptation for Audio Deepfake Detection. In Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing. Hyderabad, Indická republika: Institute of Electrical and Electronics Engineers Inc., 2025.
p. 1-5. ISBN: 979-8-3503-6874-1. Detail - CHIKKALA, K.; ANIKINA, T.; SKACHKOVA, N.; VYKOPAL, I.; AGERRI, R.; GENABITH, J. Automatic Fact-checking in English and Telugu. Shoumen, Bulgaria: INCOMA Ltd., 2025.
p. 140-151. Detail - CHLUBNA, T.; MILET, T.; ZEMČÍK, P. How Color Profile Affects the Visual Quality in Light Field Rendering and Novel View Synthesis. MULTIMEDIA TOOLS AND APPLICATIONS, 2025, vol. 84, no. 14,
p. 11079-11095. Detail - CHLUBNA, T.; MILET, T.; ZEMČÍK, P. Light Field Video Streaming on GPU. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2025, vol. 2025, no. 138, 12 p. Detail
- CHLUBNA, T.; VLNAS, M.; BAŘINA, D.; MILET, T.; ZEMČÍK, P. Focus-aware compression and image quality metric for 3D displays. SIGNAL PROCESSING, 2025, vol. 2026, no. 238,
p. 1-14. ISSN: 0165-1684. Detail - CHLUBNA, T.; VLNAS, M.; MILET, T.; ZEMČÍK, P. Survey of FOSS 3D/2D Graphics Software Blender Usage in Science, Academia, and Industry. The visual computer, 2025, vol. 42, no. 1,
p. 1-32. Detail - CHLUBNA, T.; ZEMČÍK, P. Comparative Survey of Image Compression Methods Across Different Pixel Formats and Bit Depths. Signal Image and Video Processing, 2025, vol. 19, no. 12, 13 p. Detail
- CUMANI, S.; SILNOVA, A.; BARAHONA, S.; MOŠNER, L.; PLCHOT, O.; ROHDIN, J. Analysis of the ABC classification backends for NIST SRE24. In Proceedings of the Annual Conference of the International Speech Communication Association Interspeech. Interspeech. Rotterdam: International Speech Communication Association, 2025.
p. 3978-3982. Detail - DOHNAL, F.; ZEMAN, T.; BARTA, J.; PUPÍKOVÁ, J.; KINCL, P.; HUBÁČEK, M.; ŠTOLLER, J.; KLÍMA, O.; BAŘINA, D.; KUDLÁK, A.; RAK, J.; HUDYMA, N.; PAULUS, F. Ukrytí obyvatelstva před nebezpečím. Brno: Univerzita obrany, 2025. 119 s. ISBN: 978-80-7609-023-1. Detail
- DRAHY, V.; MARIK, R.; KÄLVIÄINEN, H. Non-stationary Signal Analysis: Detrending and Anomaly Detection. In Lecture Notes in Computer Science. Lecture Notes in Computer Science. CHAM: Springer Nature, 2025.
p. 45-59. ISBN: 978-3-031-95910-3. Detail - FAJČÍK, M.; DOČEKAL, M.; DOLEŽAL, J.; ONDŘEJ, K.; BENEŠ, K.; SMRŽ, P.; POLOK, A.; HRADIŠ, M. BenCzechMark : A Czech-Centric Multitask and Multimetric Benchmark for Large Language Models with Duel Scoring Mechanism. Transactions of the Association for Computational Linguistics, 2025, vol. 13, no. 9,
p. 1068-1095. Detail - GAO, R.; LIU, X.; HU, Z.; XING, B.; XIA, B.; YU, Z.; KÄLVIÄINEN, H. FSBench: A Figure Skating Benchmark for Advancing Artistic Sports Understanding. 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, 2025.
p. 13595-13605. Detail - HAN, J.; LANDINI, F.; ROHDIN, J.; SILNOVA, A.; DIEZ SÁNCHEZ, M.; BURGET, L. Leveraging Self-Supervised Learning for Speaker Diarization. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Hyderabad: IEEE Signal Processing Society, 2025.
p. 1-5. ISBN: 979-8-3503-6874-1. Detail - HAN, J.; LANDINI, F.; ROHDIN, J.; SILNOVA, A.; DIEZ, M.; ČERNOCKÝ, J.; BURGET, L. Fine-tune Before Structured Pruning: Towards Compact and Accurate Self-Supervised Models for Speaker Diarization. In Proceedings of the Annual Conference of the International Speech Communication Association Interspeech. Interspeech. Rotterdam, The Netherlands: International Speech Communication Association, 2025.
p. 1583-1587. Detail - HANÁK, J.; NOVÁK, J.; CHUDÝ, P.; BEN-ASHER, J. Cross-Entropy Method for Laser Defense Applications. Journal of Aerospace Information Systems, 2025, vol. 22, no. 1,
p. 53-58. ISSN: 2327-3097. Detail - HEGDE, P.; KESIRAJU, S.; ŠVEC, J.; SEDLÁČEK, Š.; YUSUF, B.; PLCHOT, O.; DEEPAK, K.; ČERNOCKÝ, J. Factors affecting the in-context learning abilities of LLMs for dialogue state tracking. In Proceedings of the Annual Conference of the International Speech Communication Association Interspeech. Interspeech. Rotterdam, The Netherlands: International Speech Communication Association, 2025.
p. 4818-4822. Detail - HORI, T.; KOCOUR, M.; HAIDER, A.; MCDERMOTT, E.; ZHUANG, X. Delayed Fusion: Integrating Large Language Models into First-Pass Decoding in End-to-end Speech Recognition. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Hyderabad: IEEE Signal Processing Society, 2025.
p. 1-5. ISBN: 979-8-3503-6874-1. Detail - Ivana Beňová, Jana Košecká, Michal Gregor, Martin Tamajka, Marcel Veselý, Marián Šimko. Beyond Image-Text Matching: Verb Understanding in Multimodal Transformers Using Guided Masking. In SOFSEM 2025: Theory and Practice of Computer Science. Lecture Notes in Computer Science. CHAM: Springer Nature, 2025.
p. 80-93. ISBN: 978-3-031-82669-6. Detail - KAREINEN, J.; EEROLA, T.; KRAFT, K.; LENSU, L.; SUIKKANEN, S.; KÄLVIÄINEN, H. Self-Supervised Pretraining for Fine-Grained Plankton Recognition. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops. IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops. IEEE Computer Society, 2025.
p. 2122-2132. ISBN: 9798331599942. Detail - KAREINEN, J.; SKYTTA, A.; EEROLA, T.; KRAFT, K.; LENSU, L.; SUIKKANEN, S.; LEHTINIEMI, M.; KÄLVIÄINEN, H. Open-Set Plankton Recognition. In Lecture Notes in Computer Science. Lecture Notes in Computer Science. CHAM: Springer Nature, 2025.
p. 168-184. ISBN: 978-3-031-91671-7. Detail - KHURANA, S.; KLEMENT, D.; LAURENT, A.; BOBOS, D.; NOVOSAD, J.; GAZDIK, P.; ZHANG, E.; HUANG, Z.; HUSSEIN, A.; MARXER, R.; MASUYAMA, Y.; AIHARA, R.; HORI, C.; GERMAIN, F.; WICHERN, G.; LE ROUX, J. Factorized RVQ-GAN For Disentangled Speech Tokenization. In Proceedings of the Annual Conference of the International Speech Communication Association Interspeech. Interspeech. Rotterdam, The Netherlands: International Speech Communication Association, 2025.
p. 3514-3518. Detail - KIŠŠ, M.; HRADIŠ, M. Masked Self-Supervised Pre-Training for Text Recognition Transformers on Large-Scale Datasets. In Document Analysis and Recognition – ICDAR 2025 Workshops. Cham: Springer Nature Switzerland, 2025.
p. 53-70. ISBN: 978-3-032-09367-7. Detail - KOHÚT, J.; DOČEKAL, M.; HRADIŠ, M.; VAŠKO, M. BiblioPage: A Dataset of Scanned Title Pages for Bibliographic Metadata Extraction. In Document Analysis and Recognition – ICDAR 2025. Cham: Springer Nature Switzerland, 2025.
p. 287-304. ISBN: 978-3-032-04623-9. Detail - KOHÚT, J.; HRADIŠ, M.;. Practical Fine-Tuning of Autoregressive Models on Limited Handwritten Texts. Document Analysis and Recognition – ICDAR 2025. Cham: Springer Nature Switzerland, 2025.
p. 22-39. ISBN: 978-3-032-04629-1. Detail - KOSTELNÍK, M.; HRADIŠ, M.; BENEŠ, K. TextBite: A Historical Czech Document Dataset for Logical Page Segmentation. In Document Analysis and Recognition – ICDAR 2025 Workshops. Cham: Springer Nature Switzerland, 2025.
p. 124-140. ISBN: 978-3-032-09367-7. Detail - LI, D.; XING, B.; LIU, X.; XIA, B.; WEN, B.; KÄLVIÄINEN, H. DEEMO: De-identity Multimodal Emotion Recognition and Reasoning. MM '25: Proceedings of the 33rd ACM International Conference on Multimedia. New York, NY, USA: ACM, 2025.
p. 5707-5716. ISBN: 979-8-4007-2035-2. Detail - LI, J.; MAK, M.; ROHDIN, J.; LEE, K.; HERMANSKY, H. Bayesian Learning for Domain-Invariant Speaker Verification and Anti-Spoofing. In Proceedings of the Annual Conference of the International Speech Communication Association Interspeech. Interspeech. Rotterdam: International Speech Communication Association, 2025.
p. 1123-1127. Detail - LI, S.; WANG, S.; HAN, J.; ZHANG, K.; WANG, W.; LI, H. REAL-T: Real Conversational Mixtures for Target Speaker Extraction. In Proceedings of the Annual Conference of the International Speech Communication Association Interspeech. Interspeech. Rotterdam, The Netherlands: International Speech Communication Association, 2025.
p. 1923-1927. Detail - LOJDA, J.; JOYCE, D.; SMRŽ, P.; KATHURIA, S.; STRNADEL, J.; QUINN, C.; ŠIMEK, V.; STAROŇ, P. Portable Simulation Models for Energy Aspects of IoT Devices in the LoLiPoP-IoT Project. 2025 28th Euromicro Conference on Digital System Design (DSD). Salerno: IEEE Computer Society, 2025.
p. 368-375. ISBN: 979-8-3315-8499-3. Detail - LOJDA, J.; STRNADEL, J.; SMRŽ, P.; ŠIMEK, V. Multi-Partner Project: LoLiPoP-IoT - Design and Simulation of Energy-Efficient Devices for the Internet of Things. In 2025 Design, Automation & Test in Europe Conference (DATE) Proceedings. Lyon: Institute of Electrical and Electronics Engineers, 2025.
p. 1-7. ISBN: 978-3-9826741-0-0. Detail - LUONG, H.; LI, H.; ZHANG, L.; LEE, K.; CHNG, E. LlamaPartialSpoof: An LLM-Driven Fake Speech Dataset Simulating Disinformation Generation. In Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing. Hyderabad, Indická republika: Institute of Electrical and Electronics Engineers Inc., 2025.
p. 1-5. ISBN: 979-8-3503-6874-1. Detail - Michal Rozsíval, Petr Matoušek, Jaromír Kotala. Poster: Multi-Agent LLM System for Cisco Router Configuration. In 2025 23rd International Symposium on Network Computing and Applications (NCA). Lisbon, Portugal: IEEE, 2025.
p. 306-307. ISBN: 979-8-3315-7842-8. Detail - NOVÁK, J.; CHUDÝ, P.; HANÁK, J. Weight-varying Model Predictive Control for Coupled Cyber-Physical Systems: Aerial Grasping Study. In Machine Learning, Optimization, and Data Science. Lecture Notes in Computer Science. Castiglione della Pescaia: Springer Nature Switzerland AG, 2025.
p. 13-27. ISBN: 978-3-031-82481-4. Detail - PÁLKA, P.; LANDINI, F.; KLEMENT, D.; DIEZ SÁNCHEZ, M.; SILNOVA, A.; BURGET, L.; DELCROIX, M. Joint Training of Speaker Embedding Extractor, Speech and Overlap Detection for Diarization. Palermo: IEEE Signal Processing Society, 2025.
p. 31-35. ISBN: 978-9-46-459362-4. Detail - PECHER, B.; SRBA, I.; BIELIKOVÁ, M. A Survey on Stability of Learning with Limited Labelled Data and its Sensitivity to the Effects of Randomness. ACM Computing Surveys, 2025, vol. 57, no. 1,
p. 1-40. Detail - PENG, J.; ASHIHARA, T.; DELCROIX, M.; OCHIAI, T.; PLCHOT, O.; ARAKI, S.; ČERNOCKÝ, J. TS-SUPERB: A Target Speech Processing Benchmark for Speech Self-Supervised Learning Models. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Hyderabad: IEEE Signal Processing Society, 2025.
p. 1-5. ISBN: 979-8-3503-6874-1. Detail - PENG, J.; MOŠNER, L.; ZHANG, L.; PLCHOT, O.; STAFYLAKIS, T.; BURGET, L.; ČERNOCKÝ, J. CA-MHFA: A Context-Aware Multi-Head Factorized Attentive Pooling for SSL-Based Speaker Verification. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Hyderabad: IEEE Signal Processing Society, 2025.
p. 1-5. ISBN: 979-8-3503-6874-1. Detail - POLOK, A.; KLEMENT, D.; WIESNER, M.; KHUDANPUR, S.; ČERNOCKÝ, J.; BURGET, L. Target Speaker ASR with Whisper. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Hyderabad: IEEE Signal Processing Society, 2025.
p. 1-5. ISBN: 979-8-3503-6874-1. Detail - POTHULA, A.; AKKIRAJU, B.; BANDARUPALLI, S.; D, C.; KESIRAJU, S.; VUPPALA, A. End-to-End Speech Translation for Low-Resource Languages Using Weakly Labeled Data. In Interspeech 2025. Interspeech. Rotterdam: ISCA, 2025.
p. 41-45. Detail - SEBUYOYA, R.; SEVCIKOVA, S.; YUSUF, B.; BARTOSIK, M. Integrating isothermal amplification techniques and LNA-based AI-assisted electrochemical bioassay for analysis of KRAS G12V point mutation. TALANTA, 2025, vol. 127709, no. 288,
p. 1-10. Detail - SEDLÁČEK, Š.; YUSUF, B.; ŠVEC, J.; HEGDE, P.; KESIRAJU, S.; PLCHOT, O.; ČERNOCKÝ, J. Approaching Dialogue State Tracking via Aligning Speech Encoders and LLMs. In Proceedings of the Annual Conference of the International Speech Communication Association Interspeech. Interspeech. Rotterdam, The Netherlands: International Speech Communication Association, 2025.
p. 1748-1752. Detail - ŠILLING, P.; ŠPANĚL, M. DEMIS: Electron Microscopy Image Stitching using Deep Learning Features and Global Optimisation. Proceedings of the 18th International Joint Conference on Biomedical Engineering Systems and Technologies - BIOIMAGING. Porto: Institute for Systems and Technologies of Information, Control and Communication, 2025.
p. 255-256. ISBN: 978-989-758-731-3. Detail - ŠIMEČKOVÁ, M.; KARAFIÁT, M.; PLCHOT, O. Using machine learning for automatic dialect detection. New methods in Czech dialectology. In Slovanské dialek ty v době dig itál ních technologií. Nářeční prameny a jejich současné zpracování. Praha: Slovanský ústav AV ČR, 2025.
p. 297-307. ISBN: 978-80-86420-99-8. Detail - SKOG, K.; KOHOUT, T.; KAŠPÁREK, T.; WOLFMAYR, M. Lossless Hyperspectral Image Compression in Comet Interceptor and Hera Missions with Restricted Bandwith. Remote Sensing, 2025, vol. 17, no. 899,
p. 1-18. ISSN: 2072-4292. Detail - VAŠKO, M.; HEROUT, A. LossFIQA: A Shortcut Solution to Image Quality Assessment Using Loss for Faces and Beyond. IEEE Access, 2025, vol. 13, no. 7,
p. 126915-126924. Detail - VLNAS, M.; MILET, T.; ZEMČÍK, P. Low-error Reconstruction of Directional Functions with Spherical Harmonics. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2025, vol. 31, no. 10,
p. 8413-8424. ISSN: 1077-2626. Detail - VYKOPAL, I.; PIKULIAK, M.; OSTERMANN, S.; ANIKINA, T.; GREGOR, M.; ŠIMKO, M. Large Language Models for Multilingual Previously Fact-Checked Claim Detection. Suzhou, China: Association for Computational Linguistics, 2025.
p. 15741-15765. ISBN: 979-8-8917-6335-7. Detail - XING, B.; YUAN, K.; YU, Z.; LIU, X.; KÄLVIÄINEN, H. AU-TTT: Vision Test-Time Training model for Facial Action Unit Detection. In Proceedings IEEE International Conference on Multimedia and Expo. Proceedings (IEEE International Conference on Multimedia and Expo). IEEE Computer Society, 2025. 11 p. ISBN: 9798331594954. Detail
- YAN, B.; HAMED, I.; SHIMIZU, S.; LODAGALA, V.; CHEN, W.; IAKOVENKO, O.; TALAFHA, B.; HUSSEIN, A.; POLOK, A.; CHANG, K.; KLEMENT, D.; ALTHUBAITI, S.; PENG, P.; WIESNER, M.; SOLORIO, T.; ALI, A.; KHUDANPUR, S.; WATANABE, S. CS-FLEURS: A Massively Multilingual and Code-Switched Speech Dataset. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Interspeech. Rotterdam, Nizozemí: ISCA, 2025.
p. 743-747. Detail - YU, Z.; LIU, X.; DAMER, N.; FAN, D.; SHI, J.; GUO, X.; LIN, X.; WEN, B.; KONG, A.; KÄLVIÄINEN, H.; SCHULLER, B.; CAO, X. SVC 2025 Chairs’ Welcome. Svc 2025 Proceedings of the 1st International Workshop and Challenge on Subtle Visual Computing Co Located with mm 2025. Association for Computing Machinery, Inc, 2025. ISBN: 9798400718373. Detail
- ZHANG, R.; WEI, J.; LU, X.; ZHANG, L.; JIN, D.; LU, W.; XU, J. Multi-Sinkhorn Teacher Knowledge Aggregation Framework for Adaptive Audio Anti-Spoofing. IEEE Transactions on Audio, Speech, and Language Processing, 2025, no. 33,
p. 3850-3865. Detail - ZHANG, R.; WEI, J.; LU, X.; ZHANG, L.; JIN, D.; XU, J.; LU, W. SHDA: Sinkhorn Domain Attention for Cross-Domain Audio Anti-Spoofing. IEEE Transactions on Information Forensics and Security, 2025, no. 20,
p. 6474-6489. Detail - ZHANG, Y.; TIAN, B.; ZHANG, L.; DUAN, Z. PartialEdit: Identifying Partial Deepfakes in the Era of Neural Speech Editing. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Interspeech. Rotterdam, Nizozemí: ISCA, 2025.
p. 5353-5357. Detail