Ústav počítačové grafiky a multimédií

Autor Název Klíčové slovo Rok Roků Typ výsledku

2025

ANIKINA, T.; VYKOPAL, I.; KULA, S.; CHIKKALA, K.; SKACHKOVA, N.; YANG, J.; SOLOPOVA, V.; SCHMITT, V.; OSTERMANN, S. dfkinit2b at CheckThat! 2025: Leveraging LLMs and Ensemble of Methods for Multilingual Claim Normalization. Madrid: Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2025), 2025. Detail
ANTTI, N.; KOHOUT, T.; KAŠPÁREK, T. The Asteroid Spectral Imager (ASPECT) on the Milani CubeSat. SPACE SCIENCE REVIEWS, 2025, vol. 2025, no. 221, p. 1-27. ISSN: 1572-9672. Detail
BAŘINA, D. Improved verification limit for the convergence of the Collatz conjecture. JOURNAL OF SUPERCOMPUTING, 2025, vol. 81, no. 1, p. 1-14. ISSN: 1573-0484. Detail
FAJČÍK, M.; DOČEKAL, M.; DOLEŽAL, J.; ONDŘEJ, K.; BENEŠ, K.; SMRŽ, P.; POLOK, A.; HRADIŠ, M. BenCzechMark : A Czech-Centric Multitask and Multimetric Benchmark for Large Language Models with Duel Scoring Mechanism. Transactions of the Association for Computational Linguistics, 2025, vol. 13, no. 9, p. 1068-1095. Detail
GURGUROV, D.; VYKOPAL, I.; GENABITH, J.; OSTERMANN, S. Small Models, Big Impact: Efficient Corpus and Graph-Based Adaptation of Small Multilingual Language Models for Low-Resource Languages. Vienna: Association for Computational Linguistics, 2025. ISBN: 979-8-89176-254-1. Detail
HAN, J.; LANDINI, F.; ROHDIN, J.; SILNOVA, A.; DIEZ SÁNCHEZ, M.; BURGET, L. Leveraging Self-Supervised Learning for Speaker Diarization. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Hyderabad: IEEE Signal Processing Society, 2025. p. 1-5. ISBN: 979-8-3503-6874-1. Detail
HANÁK, J.; NOVÁK, J.; CHUDÝ, P.; BEN-ASHER, J. Cross-Entropy Method for Laser Defense Applications. Journal of Aerospace Information Systems, 2025, vol. 22, no. 1, p. 53-58. ISSN: 2327-3097. Detail
HORI, T.; KOCOUR, M.; HAIDER, A.; MCDERMOTT, E.; ZHUANG, X. Delayed Fusion: Integrating Large Language Models into First-Pass Decoding in End-to-end Speech Recognition. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Hyderabad: IEEE Signal Processing Society, 2025. p. 1-5. ISBN: 979-8-3503-6874-1. Detail
CHLUBNA, T.; MILET, T.; ZEMČÍK, P. How Color Profile Affects the Visual Quality in Light Field Rendering and Novel View Synthesis. MULTIMEDIA TOOLS AND APPLICATIONS, 2025, vol. 84, no. 14, p. 11079-11095. Detail
CHLUBNA, T.; MILET, T.; ZEMČÍK, P. Light Field Video Streaming on GPU. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2025, vol. 2025, no. 138, 12 p. Detail
CHLUBNA, T.; VLNAS, M.; BAŘINA, D.; MILET, T.; ZEMČÍK, P. Focus-aware compression and image quality metric for 3D displays. SIGNAL PROCESSING, 2025, vol. 2026, no. 238, p. 1-14. ISSN: 0165-1684. Detail
CHLUBNA, T.; ZEMČÍK, P. Comparative Survey of Image Compression Methods Across Different Pixel Formats and Bit Depths. Signal Image and Video Processing, 2025, vol. 19, no. 12, 13 p. Detail
KUBÍK, T.; GUIBAULT, F.; ŠPANĚL, M.; LOMBAERT, H. ToothForge: Automatic Dental Shape Generation using Synchronized Spectral Embeddings. Proceedings of Information Processing in Medical Imaging 2025. Kos: 2025. p. 1-14. Detail
LOJDA, J.; STRNADEL, J.; SMRŽ, P.; ŠIMEK, V. Multi-Partner Project: LoLiPoP-IoT - Design and Simulation of Energy-Efficient Devices for the Internet of Things. In 2025 Design, Automation & Test in Europe Conference (DATE) Proceedings. Lyon: Institute of Electrical and Electronics Engineers, 2025. p. 1-7. ISBN: 978-3-9826741-0-0. Detail
PÁLKA, P.; LANDINI, F.; KLEMENT, D.; DIEZ SÁNCHEZ, M.; SILNOVA, A.; DELCROIX, M.; BURGET, L. Joint Training of Speaker Embedding Extractor, Speech and Overlap Detection for Diarization. Palermo: IEEE Signal Processing Society, 2025. p. 31-35. ISBN: 978-9-46-459362-4. Detail
PENG, J.; ASHIHARA, T.; DELCROIX, M.; OCHIAI, T.; PLCHOT, O.; ARAKI, S.; ČERNOCKÝ, J. TS-SUPERB: A Target Speech Processing Benchmark for Speech Self-Supervised Learning Models. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Hyderabad: IEEE Signal Processing Society, 2025. p. 1-5. ISBN: 979-8-3503-6874-1. Detail
PENG, J.; MOŠNER, L.; ZHANG, L.; PLCHOT, O.; STAFYLAKIS, T.; BURGET, L.; ČERNOCKÝ, J. CA-MHFA: A Context-Aware Multi-Head Factorized Attentive Pooling for SSL-Based Speaker Verification. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Hyderabad: IEEE Signal Processing Society, 2025. p. 1-5. ISBN: 979-8-3503-6874-1. Detail
POLOK, A.; KLEMENT, D.; WIESNER, M.; KHUDANPUR, S.; ČERNOCKÝ, J.; BURGET, L. Target Speaker ASR with Whisper. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Hyderabad: IEEE Signal Processing Society, 2025. p. 1-5. ISBN: 979-8-3503-6874-1. Detail
SKOG, K.; KOHOUT, T.; KAŠPÁREK, T.; WOLFMAYR, M. Lossless Hyperspectral Image Compression in Comet Interceptor and Hera Missions with Restricted Bandwith. Remote Sensing, 2025, vol. 17, no. 899, p. 1-18. ISSN: 2072-4292. Detail
ŠILLING, P.; ŠPANĚL, M. DEMIS: Electron Microscopy Image Stitching using Deep Learning Features and Global Optimisation. Proceedings of the 18th International Joint Conference on Biomedical Engineering Systems and Technologies - BIOIMAGING. Porto: Institute for Systems and Technologies of Information, Control and Communication, 2025. p. 255-256. ISBN: 978-989-758-731-3. Detail
VLNAS, M.; MILET, T.; ZEMČÍK, P. Low-error Reconstruction of Directional Functions with Spherical Harmonics. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2025, vol. 31, no. 10, p. 8413-8424. ISSN: 1077-2626. Detail
VYKOPAL, I.; OSTERMANN, S.; ŠIMKO, M. Soft Language Prompts for Language Transfer. Albuquerque: Association for Computational Linguistics, 2025. p. 10294-10313. ISBN: 979-8-8917-6189-6. Detail

2024

ADAMEC, V.; BERGLOWIEC, P.; SVATOŇ, V.; SCHWARZ, P.; MÜLLER, L. Využití umělé inteligence v systému příjmu tísňových volání v podmínkách České republiky. SPEKTRUM, 2024, roč. 2024, č. 2, s. 3-8. ISSN: 1804-1639. Detail
ALAM, J.; BARAHONA QUIRÓS, S.; BOBOŠ, D.; BURGET, L.; CUMANI, S.; DAHMANE, M.; HAN, J.; HLAVÁČEK, M.; KODOVSKÝ, M.; LANDINI, F.; MOŠNER, L.; PÁLKA, P.; PAVLÍČEK, T.; PENG, J.; PLCHOT, O.; RAJASEKHAR, P.; ROHDIN, J.; SILNOVA, A.; STAFYLAKIS, T.; ZHANG, L. ABC SYSTEM DESCRIPTION FOR NIST SRE 2024. Proceedings of NIST SRE 2024. San Juan: National Institute of Standards and Technology, 2024. p. 1-9. Detail
BENEŠ, K.; KOCOUR, M.; BURGET, L. Hystoc: Obtaining Word Confidences for Fusion of End-To-End ASR Systems. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024. p. 11276-11280. ISBN: 979-8-3503-4485-1. Detail
BHATTACHARJEE, M.; NIGMATULINA, I.; PRASAD, A.; RANGAPPA, P.; MADIKERI, S.; MOTLÍČEK, P.; HELMKE, H.; KLEINERT, M. Contextual Biasing Methods for Improving Rare Word Detection in Automatic Speech Recognition. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024. p. 12652-12656. ISBN: 979-8-3503-4485-1. Detail
BOBÁK, P.; ČMOLÍK, L.; ČADÍK, M. Reinforced Labels: Multi-Agent Deep Reinforcement Learning for Point-Feature Label Placement. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, vol. 30, no. 9, p. 5908-5922. ISSN: 1077-2626. Detail
BURDISSO, S.; RAMIREZ, A.; VILLATORO-TELLO, E.; SÁNCHEZ-VEGA, F.; LÓPEZ-MONROY, P.; MOTLÍČEK, P. DAIC-WOZ: On the Validity of Using the Therapist's prompts in Automatic Depression Detection from Clinical Interviews. Proceedings of the 6th Clinical Natural Language Processing Workshop. Association for Computational Linguistics. Mexico City: Association for Computational Linguistics, 2024. p. 82-90. Detail
ČEGIŇ, J.; PECHER, B.; ŠIMKO, J.; SRBA, I.; BIELIKOVÁ, M. Effects of diversity incentives on sample diversity and downstream model performance in LLM-based text augmentation. Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Bangkok: Association for Computational Linguistics, 2024. p. 13148-13171. ISBN: 979-8-8917-6094-3. Detail
DE BENITO GORRON, D.; ŽMOLÍKOVÁ, K.; TORRE TOLEDANO, D. Analysis and interpretation of joint source separation and sound event detection in domestic environments. PLoS One, 2024, vol. 19, no. 7, p. 1-30. ISSN: 1932-6203. Detail
DE LEON MARTINEZ, S. Understanding User Behavior in Carousel Recommendation Systems for Click Modeling and Learning to Rank. Proceedings of the Seventeenth ACM International Conference on Web Search and Data Mining. New York: Association for Computing Machinery, 2024. p. 1139-1141. ISBN: 979-8-4007-0371-3. Detail
DEKEL, S.; KELLER, Y.; ČADÍK, M. Estimating Extreme 3D Image Rotations using Cascaded Attention. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Seattle: IEEE Computer Society, 2024. p. 2588-2598. ISBN: 979-8-3503-5301-3. Detail
ESPUNA, A.; PRASAD, A.; MOTLÍČEK, P.; MADIKERI, S.; SCHUEPBACH, C. Normalising Flows for Speaker and Language Recognition Backend. Proceedings of Odyssey 2024: The Speaker and Language Recognition Workshop. Quebec: International Speech Communication Association, 2024. p. 74-80. Detail
HAN, J.; LANDINI, F.; ROHDIN, J.; DIEZ SÁNCHEZ, M.; BURGET, L.; CAO, Y.; LU, H.; ČERNOCKÝ, J. Diacorrect: Error Correction Back-End for Speaker Diarization. In ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Seoul: IEEE Signal Processing Society, 2024. p. 11181-11185. ISBN: 979-8-3503-4485-1. Detail
HANÁK, J.; NOVÁK, J.; CHUDÝ, P. Cognitive Modeling Approach for Generating Authentic Tactical Agent Behavior. In AIAA/IEEE Digital Avionics Systems Conference - Proceedings. IEEE/AIAA ... Digital Avionics Systems Conference. San Diego: Institute of Electrical and Electronics Engineers, 2024. no. 9, p. 1-15. ISBN: 979-8-3503-4961-0. ISSN: 2155-7195. Detail
HANÁK, J.; NOVÁK, J.; CHUDÝ, P. Tactical Scenario Adaptation for Pilot Training. In AIAA/IEEE Digital Avionics Systems Conference - Proceedings. IEEE/AIAA ... Digital Avionics Systems Conference. San Diego: Institute of Electrical and Electronics Engineers, 2024. no. 9, p. 1-7. ISBN: 979-8-3503-4961-0. ISSN: 2155-7195. Detail
CHLUBNA, T.; MILET, T.; ZEMČÍK, P. Automatic 3D-Display-Friendly Scene Extraction from Video Sequences and Optimal Focusing Distance Identification. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, vol. 83, no. 7, p. 74535-74562. ISSN: 1573-7721. Detail
CHLUBNA, T.; MILET, T.; ZEMČÍK, P. How Capturing Camera Trajectory Distortion Affects User Experience on Looking Glass 3D Display. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, vol. 2024, no. 83, p. 20265-20287. ISSN: 1573-7721. Detail
CHLUBNA, T.; MILET, T.; ZEMČÍK, P. Lightweight All-Focused Light Field Rendering. COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, vol. 244, no. 7, p. 7-8. ISSN: 1077-3142. Detail
CHLUBNA, T.; MILET, T.; ZEMČÍK, P. Out-of-Focus Artifacts Mitigation and Autofocus Methods for 3D Displays. Visual Informatics, 2024, vol. 9, no. 1, p. 31-42. ISSN: 2468-502X. Detail
CHLUBNA, T.; ZEMČÍK, P.; MILET, T. Efficient Random-Access GPU Video Decoding for Light-Field Rendering. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, vol. 2024, no. 102, p. 1-14. ISSN: 1047-3203. Detail
KAPINUS, M.; BERAN, V.; MATERNA, Z.; BAMBUŠEK, D. Augmented Reality Spatial Programming Paradigm Applied to End-User Robot Programming. ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2024, vol. 89, no. 89, p. 1-13. ISSN: 0736-5845. Detail
KAŠPÁREK, T.; CHUDÝ, P. Pulsar Signal Adaptive Surrogate Modeling. Aerospace, 2024, vol. 11, no. 10, p. 1-22. ISSN: 2226-4310. Detail
KIŠŠ, M.; HRADIŠ, M. Self-supervised Pre-training of Text Recognizers. In Barney Smith, E.H., Liwicki, M., Peng, L. (eds) Document Analysis and Recognition - ICDAR 2024. Lecture Notes in Computer Science. Atény: Springer Nature Switzerland AG, 2024. p. 218-235. ISBN: 978-3-031-70545-8. Detail
KLEMENT, D.; DIEZ SÁNCHEZ, M.; LANDINI, F.; BURGET, L.; SILNOVA, A.; DELCROIX, M.; TAWARA, N. Discriminative Training of VBx Diarization. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024. p. 11871-11875. ISBN: 979-8-3503-4485-1. Detail
KRÁL, J.; HRADIŠ, M.; BUŽGA, M.; KUNOVSKÝ, L. Exploring the benefits and challenges of AI-driven large language models in gastroenterology: Think out of the box. BIOMEDICAL PAPERS-OLOMOUC, 2024, vol. 168, no. 4, p. 277-283. ISSN: 1213-8118. Detail
KUBÍK, T.; ŠPANĚL, M. LMVSegRNN and Poseidon3D: Addressing Challenging Teeth Segmentation Cases in 3D Dental Surface Orthodontic Scans. Bioengineering-Basel, 2024, vol. 11, no. 10, p. 1-18. ISSN: 2306-5354. Detail
KUMAR, S.; MADIKERI, S.; NIGMATULINA, I.; VILLATORO-TELLO, E.; MOTLÍČEK, P.; PANDIA, K.; DUBAGUNTA, P.; GANAPATHIRAJU, A. Multitask Speech Recognition and Speaker Change Detection for Unknown Number of Speakers. ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Seoul: IEEE Signal Processing Society, 2024. p. 12592-12596. ISBN: 979-8-3503-4485-1. Detail
KUNEŠOVÁ, M.; ZAJÍC, Z.; ŠMÍDL, L.; KARAFIÁT, M. Comparison of wav2vec 2.0 models on three speech processing tasks. International Journal of Speech Technology, 2024, vol. 27, no. 4, p. 847-859. ISSN: 1572-8110. Detail
LANDINI, F.; DIEZ SÁNCHEZ, M.; STAFYLAKIS, T.; BURGET, L. DiaPer: End-to-End Neural Diarization With Perceiver-Based Attractors. IEEE Transactions on Audio Speech and Language Processing, 2024, vol. 32, no. 7, p. 3450-3465. ISSN: 1558-7916. Detail
LEHEČKA, D.; JEBAVÝ, F.; KERSCH, F.; PAVČÍK, F.; JANA, H.; FREMROVÁ, K.; KIŠŠ, M.; LHOTÁK, M.; DVOŘÁKOVÁ, M.; BEŽOVÁ, M.; HRADIŠ, M.; ŽABIČKA, P.; JIROUŠEK, V. Orbis Pictus: Zpřístupnění netextových dat z digitálních knihoven. 2024, roč. 2024, č. 2, s. 22-31. ISSN: 1336-0779. Detail
LOJDA, J.; STRNADEL, J.; SMRŽ, P.; ŠIMEK, V. First Steps Towards Unified Low-Power IoT Design: The "DYNAMIC" Framework. 2024 IEEE East-West Design and Test Symposium, EWDTS 2024 - Proceedings. Yerevan: Institute of Electrical and Electronics Engineers, 2024. p. 1-6. ISBN: 979-8-3315-1576-8. Detail
LOJDA, J.; STRNADEL, J.; ŠIMEK, V.; SMRŽ, P.; HAYES, M.; POPP, R. The LoLiPoP-IoT Project: Long Life Power Platforms for Internet of Things. In Proceedings - 2024 27th Euromicro Conference on Digital System Design, DSD 2024. Paris: Institute of Electrical and Electronics Engineers, 2024. p. 604-611. ISBN: 979-8-3503-8038-5. Detail
MACIEJEWSKI, M.; KLEMENT, D.; HUANG, R.; WIESNER, M.; KHUDANPUR, S. Evaluating the Santa Barbara Corpus: Challenges of the Breadth of Conversational Spoken Language. In Proceedings of Interspeech 2024. Proceedings of Interspeech. Kos: International Speech Communication Association, 2024. no. 9, p. 2155-2160. ISSN: 1990-9772. Detail
MOŠNER, L.; SERIZEL, R.; BURGET, L.; PLCHOT, O.; VINCENT, E.; PENG, J.; ČERNOCKÝ, J. Multi-Channel Extension of Pre-trained Models for Speaker Verification. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Proceedings of Interspeech. Kos: International Speech Communication Association, 2024. no. 9, p. 2135-2139. ISSN: 1990-9772. Detail
MOTLÍČEK, P.; DIKICI, E.; MADIKERI, S.; RANGAPPA, P.; BACKFRIED, G.; ROHDIN, J.; SCHWARZ, P.; KOVÁČ, M.; MALÝ, K.; BOBOŠ, D.; KLAKOW, D.; SERGIDOU, E. ROXSD: The ROXANNE Multimodal and Simulated Dataset for Advancing Criminal Investigations. Proceedings of Odyssey 2024: The Speaker and Language Recognition Workshop. Québec City: International Speech Communication Association, 2024. p. 17-24. Detail
NOVÁK, J.; HANÁK, J.; CHUDÝ, P. Hybrid Modeling Approach for Optimization Based Control of Multirotor Unmanned Aerial Vehicles. In ICAS Proceedings. ICAS Proceedings. Florence: International Council of the Aeronautical Sciences, 2024. no. 10, p. 1-10. ISSN: 2958-4647. Detail
NOVÁK, J.; HANÁK, J.; CHUDÝ, P. Predictive Control Driven Tactical Maneuvering. In ICAS Proceedings. ICAS Proceedings. Florence: International Council of the Aeronautical Sciences, 2024. no. 9, p. 1-12. ISSN: 2958-4647. Detail
NOVÁK, J.; HANÁK, J.; CHUDÝ, P. Reliability-Based Control System Optimization in Uncertain Conditions. In AIAA Aviation Forum and ASCEND, 2024. Las Vegas: American Institute of Aeronautics and Astronautics, 2024. p. 1-15. ISBN: 978-1-62410-716-0. Detail
NOVÁK, J.; CHUDÝ, P. Dynamic Soaring in Uncertain Wind Conditions: Polynomial Chaos Expansion Approach. In Machine Learning, Optimization, and Data Science. Lecture Notes in Computer Science. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Grasmere: Springer Nature Switzerland AG, 2024. no. 14505, p. 104-115. ISBN: 978-3-031-53968-8. ISSN: 0302-9743. Detail
NOVÁK, J.; CHUDÝ, P.; HANÁK, J. Model Predictive Control Driven Aerial Grasping with Soft Operational Constraints. In ICAS Proceedings. ICAS Proceedings. Florence: International Council of the Aeronautical Sciences, 2024. no. 10, p. 1-15. ISSN: 2958-4647. Detail
PECHER, B.; ČEGIŇ, J.; BELANEC, R.; SRBA, I.; ŠIMKO, J.; BIELIKOVÁ, M. Fighting Randomness With Randomness: Mitigating Optimisation Instability of Fine-Tuning Using Ensemble and Noise Regularisation. Findings of the Association for Computational Linguistics: EMNLP 2024. Miami: Association for Computational Linguistics, 2024. p. 11005-11044. ISBN: 979-8-8917-6168-1. Detail
PECHER, B.; SRBA, I.; BIELIKOVÁ, M. On Sensitivity of Learning with Limited Labelled Data to the Effects of Randomness: Impact of Interactions and Systematic Choices. Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing. Miami: Association for Computational Linguistics, 2024. p. 522-556. ISBN: 979-8-8917-6164-3. Detail
PECHER, B.; SRBA, I.; BIELIKOVÁ, M. A Survey on Stability of Learning with Limited Labelled Data and its Sensitivity to the Effects of Randomness. ACM Computing Surveys, 2024, vol. 57, no. 1, p. 1-40. ISSN: 0360-0300. Detail
PENG, J.; DELCROIX, M.; OCHIAI, T.; ASHIHARA, T.; PLCHOT, O.; ARAKI, S.; ČERNOCKÝ, J. Probing Self-Supervised Learning Models With Target Speech Extraction. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024. p. 535-539. ISBN: 979-8-3503-7451-3. Detail
PENG, J.; DELCROIX, M.; OCHIAI, T.; PLCHOT, O.; ARAKI, S.; ČERNOCKÝ, J. Target Speech Extraction with Pre-Trained Self-Supervised Learning Models. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024. p. 10421-10425. ISBN: 979-8-3503-4485-1. Detail
PEŠÁN, J.; JUŘÍK, V.; KARAFIÁT, M.; ČERNOCKÝ, J. BESST Dataset: A Multimodal Resource for Speech-based Stress Detection and Analysis. In Proceedings of Interspeech 2024. Proceedings of Interspeech. Kos: International Speech Communication Association, 2024. no. 9, p. 1355-1359. ISSN: 1990-9772. Detail
PEŠÁN, J.; JUŘÍK, V.; RŮŽIČKOVÁ, A.; SVOBODA, V.; JANOUŠEK, O.; NĚMCOVÁ, A.; BOJANOVSKÁ, H.; ALDABAGHOVÁ, J.; KYSLÍK, F.; VODIČKOVÁ, K.; SODOMOVÁ, A.; BARTYS, P.; CHUDÝ, P.; ČERNOCKÝ, J. Speech production under stress for machine learning: multimodal dataset of 79 cases and 8 signals. Scientific Data, 2024, vol. 11, no. 1, p. 1-9. ISSN: 2052-4463. Detail
POLOK, A.; KLEMENT, D.; HAN, J.; SEDLÁČEK, Š.; YUSUF, B.; MACIEJEWSKI, M.; WIESNER, M.; BURGET, L. BUT/JHU System Description for CHiME-8 NOTSOFAR-1 Challenge. Proceedings of CHiME 2024 Workshop. Kos Island: International Speech Communication Association, 2024. p. 18-22. Detail
PRASAD, A.; CAROFILIS, A.; VANDERREYDT, G.; KHALIL, D.; MADIKERI, S.; MOTLÍČEK, P.; SCHUEPBACH, C. Fine-Tuning Self-Supervised Models for Language Identification Using Orthonormal Constraint. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024. p. 11921-11925. ISBN: 979-8-3503-4485-1. Detail
PRASAD, A.; MADIKERI, S.; KHALIL, D.; MOTLÍČEK, P.; SCHUEPBACH, C. Speech and Language Recognition with Low-rank Adaptation of Pretrained Models. In Proceedings of Interspeech. Proceedings of Interspeech. Kos Island: International Speech Communication Association, 2024. no. 9, p. 2825-2829. ISSN: 1990-9772. Detail
RANGAPPA, P.; MUSCAT, A.; SANCHEZ-LARA, A.; MOTLÍČEK, P.; ANTONOPOULOU, M.; FOURFOURIS, I.; SKARLATOS, A.; AVGERINOS, N.; TSANGARIS, M.; KOSTKA, K. Detecting Criminal Networks via Non-Content Communication Data Analysis Techniques from the TRACY Project. Proceedings of the15th EAI International Conference on Digital Forensics & Cyber Crime (EAI-ICDF2C24). Dubrovnik: 2024. p. 1-15. Detail
ROHDIN, J.; ZHANG, L.; PLCHOT, O.; STANĚK, V.; MIHOLA, D.; PENG, J.; STAFYLAKIS, T.; BEVERAKI, D.; SILNOVA, A.; BRUKNER, J.; BURGET, L. BUT systems and analyses for the ASVspoof 5 Challenge. Proceedings of ASV spoof 2024 Workshop. Kos Island: International Speech Communication Association, 2024. p. 24-31. Detail
SERGIDOU, E.; YPMA, R.; ROHDIN, J.; WORRING, M.; GERADTS, Z.; BOSMA, W. Fusing linguistic and acoustic information for automated forensic speaker comparison. SCIENCE & JUSTICE, 2024, vol. 64, no. 5, p. 485-497. ISSN: 1355-0306. Detail
STAFYLAKIS, T.; SILNOVA, A.; ROHDIN, J.; PLCHOT, O.; BURGET, L. Challenging margin-based speaker embedding extractors by using the variational information bottleneck. In Proceedings of Interspeech 2024. Proceedings of Interspeech. Kos: International Speech Communication Association, 2024. no. 9, p. 3220-3224. ISSN: 1990-9772. Detail
STRNADEL, J.; LOJDA, J.; SMRŽ, P.; ŠIMEK, V. On SMC-Based Dependability Analysis in LoLiPoP-IoT Project. Steffen, B. (eds) Bridging the Gap Between AI and Reality (AISolA 2024). Lecture Notes in Computer Science. Limenas Hersonissou: Springer Nature Switzerland AG, 2024. p. 420-425. ISBN: 978-3-031-75434-0. ISSN: 0302-9743. Detail
STRNADEL, J.; LOJDA, J.; SMRŽ, P.; ŠIMEK, V. Machine Learning in Context of IoT/Edge Devices and LoLiPoP-IoT Project. In Proceedings of 32nd Austrian Workshop on Microelectronics (Austrochip 2024). Vienna: Institute of Electrical and Electronics Engineers, US, 2024. p. 1-4. ISBN: 979-8-3315-1617-8. Detail
VILLATORO-TELLO, E.; MADIKERI, S.; SHARMA, B.; KHALIL, D.; KUMAR, S.; NIGMATULINA, I.; MOTLÍČEK, P.; GANAPATHIRAJU, A. Probability-Aware Word-Confusion-Network-to-Text Alignment Approach for Intent Classification. ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Seoul: IEEE Signal Processing Society, 2024. p. 12617-12621. ISBN: 979-8-3503-4485-1. Detail
VINCENT, J.; KOHOUT, T.; KAŠPÁREK, T. Macroscale Roughness Reveals the Complex History of Asteroids Didymos and Dimorphos. Planetary Science Journal, 2024, vol. 5, no. 10, p. 1-29. ISSN: 2632-3338. Detail
VYKOPAL, I.; PIKULIAK, M.; SRBA, I.; MÓRO, R.; MACKO, D.; BIELIKOVÁ, M. Disinformation Capabilities of Large Language Models. Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Bangkok: Association for Computational Linguistics, 2024. p. 14830-14847. ISBN: 979-8-8917-6094-3. Detail
WANG, S.; CHEN, Z.; HAN, B.; WANG, H.; XIANG, X.; ROHDIN, J.; SILNOVA, A.; QIAN, Y.; LI, H. Advancing speaker embedding learning: Wespeaker toolkit for research and production. SPEECH COMMUNICATION, 2024, vol. 162, no. 103104, p. 1-12. ISSN: 0167-6393. Detail
WANNER, L.; ČERNOCKÝ, J.; EGOROVA, E.; KLUSCH, M.; MAVROPOULOS, A. Support of Migrant Reception, Integration, and Social Inclusion by Intelligent Technologies. Information, 2024, vol. 15, no. 11, p. 1-33. ISSN: 2078-2489. Detail
YUSUF, B.; BASKAR, M.; ROSENBERG, A.; RAMABHADRAN, B. Speculative Speech Recognition by Audio-Prefixed Low-Rank Adaptation of Language Models. In Proceedings of Interspeech 2024. Proceedings of Interspeech. Kos: International Speech Communication Association, 2024. no. 9, p. 792-796. ISSN: 1990-9772. Detail
YUSUF, B.; ČERNOCKÝ, J.; SARAÇLAR, M. Pretraining End-to-End Keyword Search with Automatically Discovered Acoustic Units. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Proceedings of Interspeech. Kos: International Speech Communication Association, 2024. no. 9, p. 5068-5072. ISSN: 1990-9772. Detail
YUSUF, B.; SARAÇLAR, M. Written Term Detection Improves Spoken Term Detection. IEEE-ACM Transactions on Audio Speech and Language Processing, 2024, vol. 32, no. 06, p. 3213-3223. ISSN: 2329-9290. Detail
ZHANG, L.; STAFYLAKIS, T.; LANDINI, F.; DIEZ SÁNCHEZ, M.; SILNOVA, A.; BURGET, L. Do End-to-End Neural Diarization Attractors Need to Encode Speaker Characteristic Information?. Proceedings of Odyssey 2024: The Speaker and Language Recognition Workshop. Québec City: International Speech Communication Association, 2024. p. 123-130. Detail
ZHANG, L.; WANG, X.; COOPER, E.; DIEZ SÁNCHEZ, M.; LANDINI, F.; EVANS, N.; YAMAGISHI, J. Spoof Diarization: "What Spoofed When" in Partially Spoofed Audio. In Proceedings of Interspeech 2024. Proceedings of Interspeech. Kos: International Speech Communication Association, 2024. no. 9, p. 502-506. ISSN: 1990-9772. Detail
ZHANG, R.; WEI, J.; LU, X.; LU, W.; JIN, D.; ZHANG, L.; XU, J. Unsupervised Adaptive Speaker Recognition by Coupling-Regularized Optimal Transport. IEEE-ACM Transactions on Audio Speech and Language Processing, 2024, vol. 32, no. 1, p. 3603-3617. ISSN: 2329-9290. Detail