Ing.

Kateřina Žmolíková

Ph.D.

Vědecký pracovník


izmolikova@fit.vut.cz
L230.1 Kancelář
143646/osobní číslo VUT

Publikace

  • 2023

    ŽMOLÍKOVÁ Kateřina, DELCROIX Marc, OCHIAI Tsubasa, ČERNOCKÝ Jan, KINOSHITA Keisuke a YU Dong. Neural Target Speech Extraction: An overview. IEEE Signal Processing Magazine, roč. 40, č. 3, 2023, s. 8-29. ISSN 1558-0792.
    Detail

  • 2022

    ŠVEC Ján, ŽMOLÍKOVÁ Kateřina, KOCOUR Martin, DELCROIX Marc, OCHIAI Tsubasa, MOŠNER Ladislav a ČERNOCKÝ Jan. Analysis of impact of emotions on target speech extraction and speech separation. In: Proceedings of The 17th International Workshop on Acoustic Signal Enhancement (IWAENC 2022). Bamberg: IEEE Signal Processing Society, 2022, s. 1-5. ISBN 978-1-6654-6867-1.
    Detail

    DELCROIX Marc, KINOSHITA Keisuke, OCHIAI Tsubasa, ŽMOLÍKOVÁ Kateřina, SATO Hiroshi a NAKATANI Tomohiro. Listen only to me! How well can target speech extraction handle false alarms?. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Incheon: International Speech Communication Association, 2022, s. 216-220. ISSN 1990-9772.
    Detail

    KOCOUR Martin, ŽMOLÍKOVÁ Kateřina, ONDEL Yang Lucas Antoine Francois, ŠVEC Ján, DELCROIX Marc, OCHIAI Tsubasa, BURGET Lukáš a ČERNOCKÝ Jan. Revisiting joint decoding based multi-talker speech recognition with DNN acoustic model. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Incheon: International Speech Communication Association, 2022, s. 4955-4959. ISSN 1990-9772.
    Detail

    DE Benito Gorron Diego, ŽMOLÍKOVÁ Kateřina a TORRE Toledano Doroteo. Source Separation for Sound Event Detection in domestic environments using jointly trained models. In: Proceedings of The 17th International Workshop on Acoustic Signal Enhancement (IWAENC 2022). Bamberg: IEEE Signal Processing Society, 2022, s. 1-5. ISBN 978-1-6654-6867-1.
    Detail

  • 2021

    ŽMOLÍKOVÁ Kateřina, DELCROIX Marc, RAJ Desh, WATANABE Shinji a ČERNOCKÝ Jan. Auxiliary Loss Function for Target Speech Extraction and Recognition with Weak Supervision Based on Speaker Characteristics. In: Proceedings of 2021 Interspeech. Brno: International Speech Communication Association, 2021, s. 1464-1468. ISSN 1990-9772.
    Detail

    LANDINI Federico Nicolás, LOZANO Díez Alicia, BURGET Lukáš, DIEZ Sánchez Mireia, SILNOVA Anna, ŽMOLÍKOVÁ Kateřina, GLEMBEK Ondřej, MATĚJKA Pavel, STAFYLAKIS Themos a BRUMMER Johan Nikolaas Langenhoven. BUT System Description for The Third DIHARD Speech Diarization Challenge. In: Proceedings available at Dihard Challenge Github. on-line by LDC and University of Pennsylvania, 2021, s. 1-5.
    Detail

    ŽMOLÍKOVÁ Kateřina, DELCROIX Marc, BURGET Lukáš, NAKATANI Tomohiro a ČERNOCKÝ Jan. Integration of Variational Autoencoder and Spatial Clustering for Adaptive Multi-Channel Neural Speech Separation. In: 2021 IEEE Spoken Language Technology Workshop, SLT 2021 - Proceedings. Shenzhen - virtual : IEEE Signal Processing Society, 2021, s. 889-896. ISBN 978-1-7281-7066-4.
    Detail

    VYDANA Hari K., KARAFIÁT Martin, ŽMOLÍKOVÁ Kateřina, BURGET Lukáš a ČERNOCKÝ Jan. Jointly Trained Transformers Models for Spoken Language Translation. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Toronto, Ontario: IEEE Signal Processing Society, 2021, s. 7513-7517. ISBN 978-1-7281-7605-5.
    Detail

    DELCROIX Marc, ŽMOLÍKOVÁ Kateřina, OCHIAI Tsubasa, KINOSHITA Keisuke a NAKATANI Tomohiro. Speaker activity driven neural speech extraction. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Toronto: IEEE Signal Processing Society, 2021, s. 6099-6103. ISBN 978-1-7281-7605-5.
    Detail

  • 2020

    ŽMOLÍKOVÁ Kateřina, KOCOUR Martin, LANDINI Federico Nicolás, BENEŠ Karel, KARAFIÁT Martin, VYDANA Hari K., LOZANO Díez Alicia, PLCHOT Oldřich, BASKAR Murali K., ŠVEC Ján, MOŠNER Ladislav, MALENOVSKÝ Vladimír, BURGET Lukáš, YUSUF Bolaji, NOVOTNÝ Ondřej, GRÉZL František, SZŐKE Igor a ČERNOCKÝ Jan. BUT System for CHiME-6 Challenge. In: Proceedings of CHiME 2020 Virtual Workshop. Barcelona: University of Sheffield, 2020, s. 1-3.
    Detail

    LANDINI Federico Nicolás, WANG Shuai, DIEZ Sánchez Mireia, BURGET Lukáš, MATĚJKA Pavel, ŽMOLÍKOVÁ Kateřina, MOŠNER Ladislav, SILNOVA Anna, PLCHOT Oldřich, NOVOTNÝ Ondřej, ZEINALI Hossein a ROHDIN Johan A. But System for the Second Dihard Speech Diarization Challenge. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Barcelona: IEEE Signal Processing Society, 2020, s. 6529-6533. ISBN 978-1-5090-6631-5.
    Detail

    DELCROIX Marc, OCHIAI Tsubasa, ŽMOLÍKOVÁ Kateřina, KINOSHITA Keisuke, TAWARA Naohiro, NAKATANI Tomohiro a ARAKI Shoko. Improving Speaker Discrimination of Target Speech Extraction With Time-Domain Speakerbeam. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Barcelona: IEEE Signal Processing Society, 2020, s. 691-695. ISBN 978-1-5090-6631-5.
    Detail

  • 2019

    DELCROIX Marc, ŽMOLÍKOVÁ Kateřina, OCHIAI Tsubasa, KINOSHITA Keisuke, ARAKI Shoko a NAKATANI Tomohiro. Compact Network for Speakerbeam Target Speaker Extraction. In: Proceedings of ICASSP. Brighton: IEEE Signal Processing Society, 2019, s. 6965-6969. ISBN 978-1-5386-4658-8.
    Detail

    DELCROIX Marc, ŽMOLÍKOVÁ Kateřina, OCHIAI Tsubasa, KINOSHITA Keisuke, ARAKI Shoko a NAKATANI Tomohiro. Evaluation of SpeakerBeam target speech extraction in real noisy and reverberant conditions. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF JAPAN, roč. 2019, č. 2, s. 1-2. ISSN 0369-4232.
    Detail

    ŽMOLÍKOVÁ Kateřina, DELCROIX Marc, KINOSHITA Keisuke, OCHIAI Tsubasa, NAKATANI Tomohiro, BURGET Lukáš a ČERNOCKÝ Jan. SpeakerBeam: Speaker Aware Neural Network for Target Speaker Extraction in Speech Mixtures. IEEE Journal of Selected Topics in Signal Processing, roč. 13, č. 4, 2019, s. 800-814. ISSN 1932-4553.
    Detail

  • 2018

    DIEZ Sánchez Mireia, LANDINI Federico Nicolás, BURGET Lukáš, ROHDIN Johan A., SILNOVA Anna, ŽMOLÍKOVÁ Kateřina, NOVOTNÝ Ondřej, VESELÝ Karel, GLEMBEK Ondřej, PLCHOT Oldřich, MOŠNER Ladislav a MATĚJKA Pavel. BUT system for DIHARD Speech Diarization Challenge 2018. In: Proceedings of Interspeech 2018. Hyderabad: International Speech Communication Association, 2018, s. 2798-2802. ISSN 1990-9772.
    Detail

    ŽMOLÍKOVÁ Kateřina, DELCROIX Marc, KINOSHITA Keisuke, HIGUCHI Takuya, NAKATANI Tomohiro a ČERNOCKÝ Jan. Optimization of Speaker-aware Multichannel Speech Extraction with ASR Criterion. In: Proceedings of ICASSP 2018. Calgary: IEEE Signal Processing Society, 2018, s. 6702-6706. ISBN 978-1-5386-4658-8.
    Detail

    DELCROIX Marc, ŽMOLÍKOVÁ Kateřina, KINOSHITA Keisuke, OGAWA Atsunori a NAKATANI Tomohiro. Single Channel Target Speaker Extraction and Recognition with Speaker Beam. In: Proceedings of ICASSP 2018. Calgary: IEEE Signal Processing Society, 2018, s. 5554-5558. ISBN 978-1-5386-4658-8.
    Detail

    DELCROIX Marc, ŽMOLÍKOVÁ Kateřina, KINOSHITA Keisuke, ARAKI Shoko, OGAWA Atsunori a NAKATANI Tomohiro. SpeakerBeam: A New Deep Learning Technology for Extracting Speech of a Target Speaker Based on the Speaker's Voice Characteristics. NTT Technical Review, roč. 16, č. 11, 2018, s. 19-24. ISSN 1348-3447.
    Detail

  • 2017

    HIGUCHI Takuya, KINOSHITA Keisuke, DELCROIX Marc, ŽMOLÍKOVÁ Kateřina a NAKATANI Tomohiro. Deep clustering-based beamforming for separation with unknown number of sources. In: Proceedings of Interspeech 2017. Stockholm: International Speech Communication Association, 2017, s. 1183-1187. ISSN 1990-9772.
    Detail

    ŽMOLÍKOVÁ Kateřina, DELCROIX Marc, KINOSHITA Keisuke, HIGUCHI Takuya, OGAWA Atsunori a NAKATANI Tomohiro. Learning Speaker Representation for Neural Network Based Multichannel Speaker Extraction. In: Proceedings of ASRU 2017. Okinawa: IEEE Signal Processing Society, 2017, s. 8-15. ISBN 978-1-5090-4788-8.
    Detail

    ŽMOLÍKOVÁ Kateřina, DELCROIX Marc, KINOSHITA Keisuke, HIGUCHI Takuya, OGAWA Atsunori a NAKATANI Tomohiro. Speaker-aware neural network based beamformer for speaker extraction in speech mixtures. In: Proceedings of Interspeech 2017. Stocholm: International Speech Communication Association, 2017, s. 2655-2659. ISSN 1990-9772.
    Detail

    ŽMOLÍKOVÁ Kateřina. Summary report of project "Speech enhancement front-end for robust automatic speech recognition with large amount of training data" for Year 2017. Brno: NTT Corporation, 2017.
    Detail

    KARAFIÁT Martin, VESELÝ Karel, ŽMOLÍKOVÁ Kateřina, DELCROIX Marc, WATANABE Shinji, BURGET Lukáš, ČERNOCKÝ Jan a SZŐKE Igor. Training Data Augmentation and Data Selection. New Era for Robust Speech Recognition: Exploiting Deep Learning. Computer Science, Artificial Intelligence. Heidelberg: Springer International Publishing, 2017, s. 245-260. ISBN 978-3-319-64679-4.
    Detail

  • 2016

    ŽMOLÍKOVÁ Kateřina, KARAFIÁT Martin, VESELÝ Karel, DELCROIX Marc, WATANABE Shinji, BURGET Lukáš a ČERNOCKÝ Jan. Data selection by sequence summarizing neural network in mismatch condition training. In: Proceedings of Interspeech 2016. San Francisco: International Speech Communication Association, 2016, s. 2354-2358. ISBN 978-1-5108-3313-5.
    Detail

    VESELÝ Karel, WATANABE Shinji, ŽMOLÍKOVÁ Kateřina, KARAFIÁT Martin, BURGET Lukáš a ČERNOCKÝ Jan. Sequence Summarizing Neural Network for Speaker Adaptation. In: Proceedings of the 41th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), 2016. Shanghai: IEEE Signal Processing Society, 2016, s. 5315-5319. ISBN 978-1-4799-9988-0.
    Detail

Nahoru