Ústav počítačové grafiky a multimédií
2025
- LOJDA Jakub, STRNADEL Josef, SMRŽ Pavel a ŠIMEK Václav. Multi-Partner Project: LoLiPoP-IoT - Design and Simulation of Energy-Efficient Devices for the Internet of Things. In: Lyon: Institute of Electrical and Electronics Engineers, 2025, s. 7. Detail
2024
- PECHER Branislav, SRBA Ivan a BIELIKOVÁ Mária. A Survey on Stability of Learning with Limited Labelled Data and its Sensitivity to the Effects of Randomness. ACM Computing Surveys, roč. 57, č. 1, 2024, s. 1-40. ISSN 0360-0300. Detail
- ALAM Jahangir, BARAHONA Quirós Sara, BOBOŠ Dominik, BURGET Lukáš, CUMANI Sandro, DAHMANE Mohamed, HAN Jiangyu, HLAVÁČEK Miroslav, KODOVSKÝ Martin, LANDINI Federico Nicolás, MOŠNER Ladislav, PÁLKA Petr, PAVLÍČEK Tomáš, PENG Junyi, PLCHOT Oldřich, RAJASEKHAR Gnana Praveen, ROHDIN Johan A., SILNOVA Anna, STAFYLAKIS Themos a ZHANG Lin. ABC SYSTEM DESCRIPTION FOR NIST SRE 2024. In: Proceedings of NIST SRE 2024. San Juan: National Institute of Standards and Technology, 2024, s. 1-9. Detail
- WANG Shuai, CHEN Zhengyang, HAN Bing, WANG Hongji, XIANG Xu, ROHDIN Johan A., SILNOVA Anna, QIAN Yanmin a LI Haizhou a kol. Advancing speaker embedding learning: Wespeaker toolkit for research and production. Speech Communication, roč. 162, č. 103104, 2024, s. 1-12. ISSN 0167-6393. Detail
- KAPINUS Michal, BERAN Vítězslav, MATERNA Zdeněk a BAMBUŠEK Daniel. Augmented Reality Spatial Programming Paradigm Applied to End-User Robot Programming. Robotics and Computer-Integrated Manufacturing, roč. 89, č. 89, 2024, s. 1-13. ISSN 0736-5845. Detail
- CHLUBNA Tomáš, MILET Tomáš a ZEMČÍK Pavel. Automatic 3D-Display-Friendly Scene Extraction from Video Sequences and Optimal Focusing Distance Identification. Multimedia Tools and Applications, roč. 83, č. 7, 2024, s. 1-29. ISSN 1573-7721. Detail
- PEŠÁN Jan, JUŘÍK Vojtěch, KARAFIÁT Martin a ČERNOCKÝ Jan. BESST Dataset: A Multimodal Resource for Speech-based Stress Detection and Analysis. In: Proceedings of Interspeech 2024. Kos: International Speech Communication Association, 2024, s. 1355-1359. ISSN 1990-9772. Detail
- ROHDIN Johan A., ZHANG Lin, PLCHOT Oldřich, STANĚK Vojtěch, MIHOLA David, PENG Junyi, STAFYLAKIS Themos, BEVERAKI Dmitriy, SILNOVA Anna, BRUKNER Jan a BURGET Lukáš. BUT systems and analyses for the ASVspoof 5 Challenge. In: Proceedings of ASV spoof 2024 Workshop. Kos Island: International Speech Communication Association, 2024, s. 24-31. Detail
- POLOK Alexander, KLEMENT Dominik, HAN Jiangyu, SEDLÁČEK Šimon, YUSUF Bolaji, MACIEJEWSKI Matthew, WIESNER Matthew a BURGET Lukáš. BUT/JHU System Description for CHiME-8 NOTSOFAR-1 Challenge. In: Proceedings of CHiME 2024 Workshop. Kos Island: International Speech Communication Association, 2024, s. 18-22. Detail
- HANÁK Jiří, NOVÁK Jiří a CHUDÝ Peter. Cognitive Modeling Approach for Generating Authentic Tactical Agent Behavior. In: AIAA/IEEE Digital Avionics Systems Conference - Proceedings. San Diego: Institute of Electrical and Electronics Engineers, 2024, s. 1-15. ISBN 979-8-3503-4961-0. ISSN 2155-7195. Detail
- KUNEŠOVÁ Marie, ZAJÍC Zbyněk, ŠMÍDL Luboš a KARAFIÁT Martin. Comparison of wav2vec 2.0 models on three speech processing tasks. International Journal of Speech Technology, roč. 27, č. 4, 2024, s. 847-859. ISSN 1572-8110. Detail
- BHATTACHARJEE Mrinmoy, NIGMATULINA Iuliia, PRASAD Amrutha, RANGAPPA Pradeep, MADIKERI Srikanth, MOTLÍČEK Petr, HELMKE Hartmut a KLEINERT Matthias. Contextual Biasing Methods for Improving Rare Word Detection in Automatic Speech Recognition. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024, s. 12652-12656. ISBN 979-8-3503-4485-1. Detail
- HANÁK Jiří, NOVÁK Jiří, CHUDÝ Peter a BEN-ASHER Joseph Z. Cross-Entropy Method for Laser Defense Applications. Journal of Aerospace Information Systems, roč. 22, č. 1, 2024, s. 53-58. ISSN 2327-3097. Detail
- HAN Jiangyu, LANDINI Federico Nicolás, ROHDIN Johan A., DIEZ Sánchez Mireia, BURGET Lukáš, CAO Yuhang, LU Heng a ČERNOCKÝ Jan. Diacorrect: Error Correction Back-End for Speaker Diarization. In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Seoul: IEEE Signal Processing Society, 2024, s. 11181-11185. ISBN 979-8-3503-4485-1. Detail
- LANDINI Federico Nicolás, DIEZ Sánchez Mireia, STAFYLAKIS Themos a BURGET Lukáš. DiaPer: End-to-End Neural Diarization With Perceiver-Based Attractors. IEEE Transactions on Audio, Speech, and Language Processing, roč. 32, č. 7, 2024, s. 3450-3465. ISSN 1558-7916. Detail
- KLEMENT Dominik, DIEZ Sánchez Mireia, LANDINI Federico Nicolás, BURGET Lukáš, SILNOVA Anna, DELCROIX Marc a TAWARA Naohiro. Discriminative Training of VBx Diarization. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024, s. 11871-11875. ISBN 979-8-3503-4485-1. Detail
- VYKOPAL Ivan, PIKULIAK Matúš, SRBA Ivan, MÓRO Róbert, MACKO Dominik a BIELIKOVÁ Mária. Disinformation Capabilities of Large Language Models. In: Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Bangkok: Association for Computational Linguistics, 2024, s. 14830-14847. ISBN 979-8-8917-6094-3. Detail
- ZHANG Lin, STAFYLAKIS Themos, LANDINI Federico Nicolás, DIEZ Sánchez Mireia, SILNOVA Anna a BURGET Lukáš. Do End-to-End Neural Diarization Attractors Need to Encode Speaker Characteristic Information?. In: Proceedings of Odyssey 2024: The Speaker and Language Recognition Workshop. Québec City: International Speech Communication Association, 2024, s. 123-130. Detail
- NOVÁK Jiří a CHUDÝ Peter. Dynamic Soaring in Uncertain Wind Conditions: Polynomial Chaos Expansion Approach. In: Machine Learning, Optimization, and Data Science. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Grasmere: Springer Nature Switzerland AG, 2024, s. 104-115. ISBN 978-3-031-53968-8. ISSN 0302-9743. Detail
- ČEGIŇ Ján, PECHER Branislav, ŠIMKO Jakub, SRBA Ivan a BIELIKOVÁ Mária. Effects of diversity incentives on sample diversity and downstream model performance in LLM-based text augmentation. In: Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Bangkok: Association for Computational Linguistics, 2024, s. 13148-13171. ISBN 979-8-8917-6094-3. Detail
- CHLUBNA Tomáš, ZEMČÍK Pavel a MILET Tomáš. Efficient Random-Access GPU Video Decoding for Light-Field Rendering. Journal of Visual Communication and Image Representation, roč. 2024, č. 102, s. 1-14. ISSN 1047-3203. Detail
- DEKEL Shay, KELLER Yosi a ČADÍK Martin. Estimating Extreme 3D Image Rotations using Cascaded Attention. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Seattle: IEEE Computer Society, 2024, s. 2588-2598. ISBN 979-8-3503-5301-3. Detail
- PECHER Branislav, ČEGIŇ Ján, BELANEC Róbert, SRBA Ivan, ŠIMKO Jakub a BIELIKOVÁ Mária. Fighting Randomness With Randomness: Mitigating Optimisation Instability of Fine-Tuning Using Ensemble and Noise Regularisation. In: Findings of the Association for Computational Linguistics: EMNLP 2024. Miami: Association for Computational Linguistics, 2024, s. 11005-11044. ISBN 979-8-8917-6168-1. Detail
- PRASAD Amrutha, CAROFILIS Andrés, VANDERREYDT Geoffroy, KHALIL Driss, MADIKERI Srikanth, MOTLÍČEK Petr a SCHUEPBACH Christof. Fine-Tuning Self-Supervised Models for Language Identification Using Orthonormal Constraint. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024, s. 11921-11925. ISBN 979-8-3503-4485-1. Detail
- LOJDA Jakub, STRNADEL Josef, SMRŽ Pavel a ŠIMEK Václav. First Steps Towards Unified Low-Power IoT Design: The "DYNAMIC" Framework. In: 2024 IEEE East-West Design and Test Symposium, EWDTS 2024 - Proceedings. Yerevan: Institute of Electrical and Electronics Engineers, 2024, s. 1-6. ISBN 979-8-3315-1576-8. Detail
- CHLUBNA Tomáš, MILET Tomáš a ZEMČÍK Pavel. How Capturing Camera Trajectory Distortion Affects User Experience on Looking Glass 3D Display. Multimedia Tools and Applications, roč. 2024, č. 83, s. 20265-20287. ISSN 1573-7721. Detail
- NOVÁK Jiří, HANÁK Jiří a CHUDÝ Peter. Hybrid Modeling Approach for Optimization Based Control of Multirotor Unmanned Aerial Vehicles. In: ICAS Proceedings. Florence: International Council of the Aeronautical Sciences, 2024, s. 1-10. ISSN 2958-4647. Detail
- BENEŠ Karel, KOCOUR Martin a BURGET Lukáš. Hystoc: Obtaining Word Confidences for Fusion of End-To-End ASR Systems. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024, s. 11276-11280. ISBN 979-8-3503-4485-1. Detail
- STAFYLAKIS Themos, SILNOVA Anna, ROHDIN Johan A., PLCHOT Oldřich a BURGET Lukáš. Challenging margin-based speaker embedding extractors by using the variational information bottleneck. In: Proceedings of Interspeech 2024. Kos: International Speech Communication Association, 2024, s. 3220-3224. ISSN 1990-9772. Detail
- ČIEF Matej. Learning Action Embeddings for Off-Policy Evaluation. In: ECIR 2024: Advances in Information Retrieval. Advances in Information Retrieval. Glasgow: Springer Nature Switzerland AG, 2024, s. 108-122. Detail
- CHLUBNA Tomáš, MILET Tomáš a ZEMČÍK Pavel. Lightweight All-Focused Light Field Rendering. Computer Vision and Image Understanding, roč. 244, č. 7, 2024, s. 7-8. ISSN 1077-3142. Detail
- KUBÍK Tibor a ŠPANĚL Michal. LMVSegRNN and Poseidon3D: Addressing Challenging Teeth Segmentation Cases in 3D Dental Surface Orthodontic Scans. Bioengineering, roč. 11, č. 10, 2024, s. 1-18. ISSN 2306-5354. Detail
- STRNADEL Josef, LOJDA Jakub, SMRŽ Pavel a ŠIMEK Václav. Machine Learning in Context of IoT/Edge Devices and LoLiPoP-IoT Project. In: Proceedings of 32nd Austrian Workshop on Microelectronics (Austrochip 2024). Vienna: Institute of Electrical and Electronics Engineers, 2024, s. 1-4. ISBN 979-8-3315-1617-8. Detail
- NOVÁK Jiří, CHUDÝ Peter a HANÁK Jiří. Model Predictive Control Driven Aerial Grasping with Soft Operational Constraints. In: ICAS Proceedings. Florence: International Council of the Aeronautical Sciences, 2024, s. 1-15. ISSN 2958-4647. Detail
- MOŠNER Ladislav, SERIZEL Romain, BURGET Lukáš, PLCHOT Oldřich, VINCENT Emmanuel, PENG Junyi a ČERNOCKÝ Jan. Multi-Channel Extension of Pre-trained Models for Speaker Verification. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Kos: International Speech Communication Association, 2024, s. 2135-2139. ISSN 1990-9772. Detail
- ESPUNA Fontcuberta Aleix, PRASAD Amrutha, MOTLÍČEK Petr, MADIKERI Srikanth a SCHUEPBACH Christof. Normalising Flows for Speaker and Language Recognition Backend. In: Proceedings of Odyssey 2024: The Speaker and Language Recognition Workshop. Quebec: International Speech Communication Association, 2024, s. 74-80. Detail
- PECHER Branislav, SRBA Ivan a BIELIKOVÁ Mária. On Sensitivity of Learning with Limited Labelled Data to the Effects of Randomness: Impact of Interactions and Systematic Choices. In: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing. Miami: Association for Computational Linguistics, 2024, s. 522-556. ISBN 979-8-8917-6164-3. Detail
- STRNADEL Josef, LOJDA Jakub, SMRŽ Pavel a ŠIMEK Václav. On SMC-Based Dependability Analysis in LoLiPoP-IoT Project. In: Steffen, B. (eds) Bridging the Gap Between AI and Reality (AISolA 2024). Lecture Notes in Computer Science, roč. 15217. Limenas Hersonissou: Springer Nature Switzerland AG, 2024, s. 420-445. ISBN 978-3-031-75434-0. ISSN 0302-9743. Detail
- ČIEF Matej a KOMPAN Michal. Pessimistic Off-Policy Optimization for Learning to Rank. In: 27TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE. Frontiers in Artificial Intelligence and Applications. Santiago de Compostela, 2024, s. 1896-1903. ISBN 978-1-64368-548-9. Detail
- NOVÁK Jiří, HANÁK Jiří a CHUDÝ Peter. Predictive Control Driven Tactical Maneuvering. In: ICAS Proceedings. Florence: International Council of the Aeronautical Sciences, 2024, s. 1-12. ISSN 2958-4647. Detail
- YUSUF Bolaji, ČERNOCKÝ Jan a SARAÇLAR Murat. Pretraining End-to-End Keyword Search with Automatically Discovered Acoustic Units. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Kos: International Speech Communication Association, 2024, s. 5068-5072. ISSN 1990-9772. Detail
- PENG Junyi, DELCROIX Marc, OCHIAI Tsubasa, ASHIHARA Takanori, PLCHOT Oldřich, ARAKI Shoko a ČERNOCKÝ Jan. Probing Self-Supervised Learning Models With Target Speech Extraction. In: 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops, ICASSPW 2024 - Proceedings. Seoul: IEEE Signal Processing Society, 2024, s. 535-539. ISBN 979-8-3503-7451-3. Detail
- KAŠPÁREK Tomáš a CHUDÝ Peter. Pulsar Signal Adaptive Surrogate Modeling. Aerospace, roč. 11, č. 10, 2024, s. 1-22. ISSN 2226-4310. Detail
- BOBÁK Petr, ČMOLÍK Ladislav a ČADÍK Martin. Reinforced Labels: Multi-Agent Deep Reinforcement Learning for Point-Feature Label Placement. IEEE Transactions on Visualization and Computer Graphics, roč. 30, č. 9, 2024, s. 5908-5922. ISSN 1077-2626. Detail
- NOVÁK Jiří, HANÁK Jiří a CHUDÝ Peter. Reliability-Based Control System Optimization in Uncertain Conditions. In: AIAA Aviation Forum and ASCEND, 2024. Las Vegas: American Institute of Aeronautics and Astronautics, 2024, s. 1-15. ISBN 978-1-62410-716-0. Detail
- MOTLÍČEK Petr, DIKICI Erinç, MADIKERI Srikanth, RANGAPPA Pradeep, BACKFRIED Gerhard, ROHDIN Johan A., SCHWARZ Petr, KOVÁČ Marek, MALÝ Květoslav, BOBOŠ Dominik, KLAKOW Dietrich a SERGIDOU Eleni Konstantina a kol. ROXSD: The ROXANNE Multimodal and Simulated Dataset for Advancing Criminal Investigations. In: Proceedings of Odyssey 2024: The Speaker and Language Recognition Workshop. Québec City: International Speech Communication Association, 2024, s. 17-24. Detail
- KIŠŠ Martin a HRADIŠ Michal. Self-supervised Pre-training of Text Recognizers. In: Barney Smith, E.H., Liwicki, M., Peng, L. (eds) Document Analysis and Recognition - ICDAR 2024. Lecture Notes in Computer Science, roč. 14807. Atény: Springer Nature Switzerland AG, 2024, s. 218-235. ISBN 978-3-031-70545-8. Detail
- KUBÍK Tibor, ŠILLING Petr a ŠPANĚL Michal. Souhrnná výzkumná zpráva k projektu TESCAN 3DIM - Automatizace zpracování obrazových a 3D dat pomocí hlubokého učení. Brno: TESCAN 3DIM, s.r.o., 2024. Detail
- YUSUF Bolaji, BASKAR Karthick Murali, ROSENBERG Andrew a RAMABHADRAN Bhuvana. Speculative Speech Recognition by Audio-Prefixed Low-Rank Adaptation of Language Models. In: Proceedings of Interspeech 2024. Kos: International Speech Communication Association, 2024, s. 792-796. ISSN 1990-9772. Detail
- PRASAD Amrutha, MADIKERI Srikanth, KHALIL Driss, MOTLÍČEK Petr a SCHUEPBACH Christof. Speech and Language Recognition with Low-rank Adaptation of Pretrained Models. In: Proceedings of Interspeech. Kos Island: International Speech Communication Association, 2024, s. 2825-2829. ISSN 1990-9772. Detail
- PEŠÁN Jan, JUŘÍK Vojtěch, RŮŽIČKOVÁ Alexandra, SVOBODA Vojtěch, JANOUŠEK Oto, NĚMCOVÁ Andrea, BOJANOVSKÁ Hana, ALDABAGHOVÁ Jasmína, KYSLÍK Filip, VODIČKOVÁ Kateřina, SODOMOVÁ Adéla, BARTYS Patrik, CHUDÝ Peter a ČERNOCKÝ Jan. Speech production under stress for machine learning: multimodal dataset of 79 cases and 8 signals. Nature Scientific Data, roč. 11, č. 1, 2024, s. 1-9. ISSN 2052-4463. Detail
- ZHANG Lin, WANG Xin, COOPER Erica, DIEZ Sánchez Mireia, LANDINI Federico Nicolás, EVANS Nicholas a YAMAGISHI Junichi. Spoof Diarization: "What Spoofed When" in Partially Spoofed Audio. In: Proceedings of Interspeech 2024. Kos: International Speech Communication Association, 2024, s. 502-506. ISSN 1990-9772. Detail
- WANNER Leo, ČERNOCKÝ Jan, EGOROVA Ekaterina, KLUSCH Matthias a MAVROPOULOS Athanasios a kol. Support of Migrant Reception, Integration, and Social Inclusion by Intelligent Technologies. Information, roč. 15, č. 11, 2024, s. 1-33. ISSN 2078-2489. Detail
- HANÁK Jiří, NOVÁK Jiří a CHUDÝ Peter. Tactical Scenario Adaptation for Pilot Training. In: AIAA/IEEE Digital Avionics Systems Conference - Proceedings. San Diego: Institute of Electrical and Electronics Engineers, 2024, s. 1-7. ISBN 979-8-3503-4961-0. ISSN 2155-7195. Detail
- PENG Junyi, DELCROIX Marc, OCHIAI Tsubasa, PLCHOT Oldřich, ARAKI Shoko a ČERNOCKÝ Jan. Target Speech Extraction with Pre-Trained Self-Supervised Learning Models. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024, s. 10421-10425. ISBN 979-8-3503-4485-1. Detail
- LOJDA Jakub, STRNADEL Josef, ŠIMEK Václav, SMRŽ Pavel, HAYES Michael a POPP Ralf. The LoLiPoP-IoT Project: Long Life Power Platforms for Internet of Things. In: Proceedings - 2024 27th Euromicro Conference on Digital System Design, DSD 2024. Paris: Institute of Electrical and Electronics Engineers, 2024, s. 604-611. ISBN 979-8-3503-8038-5. Detail
- DE Leon Martinez Santiago Jose. Understanding User Behavior in Carousel Recommendation Systems for Click Modeling and Learning to Rank. In: Proceedings of the Seventeenth ACM International Conference on Web Search and Data Mining. New York : Association for Computing Machinery, 2024, s. 1139-1141. ISBN 979-8-4007-0371-3. Detail
- NOVÁK Jiří, CHUDÝ Peter a HANÁK Jiří. Weight-varying Model Predictive Control for Coupled Cyber-Physical Systems: Aerial Grasping Study. In: Machine Learning, Optimization, and Data Science. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Castiglione della Pescaia: Springer Nature Switzerland AG, 2024, s. 1-15. ISSN 0302-9743. Detail
- YUSUF Bolaji a SARAÇLAR Murat. Written Term Detection Improves Spoken Term Detection. IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, roč. 32, č. 06, 2024, s. 3213-3223. ISSN 2329-9290. Detail