Faculty of Information Technology, BUT

Publications

  • 2018

    DIEZ Sánchez Mireia, LANDINI Federico Nicolás, BURGET Lukáš, ROHDIN Johan A., SILNOVA Anna, ŽMOLÍKOVÁ Kateřina, NOVOTNÝ Ondřej, VESELÝ Karel, GLEMBEK Ondřej, PLCHOT Oldřich, MOŠNER Ladislav and MATĚJKA Pavel. BUT system for DIHARD Speech Diarization Challenge 2018. In: Proceedings of Interspeech 2018. Hyderabad: International Speech Communication Association, 2018, pp. 2798-2802. ISSN 1990-9772.
    Detail

    ŽMOLÍKOVÁ Kateřina, DELCROIX Marc, KINOSHITA Keisuke, HIGUCHI Takuya, NAKATANI Tomohiro and ČERNOCKÝ Jan. Optimization of Speaker-aware Multichannel Speech Extraction with ASR Criterion. In: Proceedings of ICASSP 2018. Calgary: IEEE Signal Processing Society, 2018, pp. 6702-6706. ISBN 978-1-5386-4658-8.
    Detail

    DELCROIX Marc, ŽMOLÍKOVÁ Kateřina, KINOSHITA Keisuke, OGAWA Atsunori and NAKATANI Tomohiro. Single Channel Target Speaker Extraction and Recognition with Speaker Beam. In: Proceedings of ICASSP 2018. Calgary: IEEE Signal Processing Society, 2018, pp. 5554-5558. ISBN 978-1-5386-4658-8.
    Detail

  • 2017

    HIGUCHI Takuya, KINOSHITA Keisuke, DELCROIX Marc, ŽMOLÍKOVÁ Kateřina and NAKATANI Tomohiro. Deep clustering-based beamforming for separation with unknown number of sources. In: Proceedings of Interspeech 2017. Stockholm: International Speech Communication Association, 2017, pp. 1183-1187. ISSN 1990-9772.
    Detail

    ŽMOLÍKOVÁ Kateřina, DELCROIX Marc, KINOSHITA Keisuke, HIGUCHI Takuya, OGAWA Atsunori and NAKATANI Tomohiro. Learning Speaker Representation for Neural Network Based Multichannel Speaker Extraction. In: Proceedings of ASRU 2017. Okinawa: IEEE Signal Processing Society, 2017, pp. 8-15. ISBN 978-1-5090-4788-8.
    Detail

    ŽMOLÍKOVÁ Kateřina, DELCROIX Marc, KINOSHITA Keisuke, HIGUCHI Takuya, OGAWA Atsunori and NAKATANI Tomohiro. Speaker-aware neural network based beamformer for speaker extraction in speech mixtures. In: Proceedings of Interspeech 2017. Stocholm: International Speech Communication Association, 2017, pp. 2655-2659. ISSN 1990-9772.
    Detail

    ŽMOLÍKOVÁ Kateřina. Summary report of project "Speech enhancement front-end for robust automatic speech recognition with large amount of training data" for Year 2017. Brno: NTT Corporation, 2017.
    Detail

    KARAFIÁT Martin, VESELÝ Karel, ŽMOLÍKOVÁ Kateřina, DELCROIX Marc, WATANABE Shinji, BURGET Lukáš, ČERNOCKÝ Jan and SZŐKE Igor. Training Data Augmentation and Data Selection. New Era for Robust Speech Recognition: Exploiting Deep Learning. Computer Science, Artificial Intelligence. Heidelberg: Springer International Publishing, 2017, pp. 245-260. ISBN 978-3-319-64679-4.
    Detail

  • 2016

    ŽMOLÍKOVÁ Kateřina, KARAFIÁT Martin, VESELÝ Karel, DELCROIX Marc, WATANABE Shinji, BURGET Lukáš and ČERNOCKÝ Jan. Data selection by sequence summarizing neural network in mismatch condition training. In: Proceedings of Interspeech 2016. San Francisco: International Speech Communication Association, 2016, pp. 2354-2358. ISBN 978-1-5108-3313-5.
    Detail

    VESELÝ Karel, WATANABE Shinji, ŽMOLÍKOVÁ Kateřina, KARAFIÁT Martin, BURGET Lukáš and ČERNOCKÝ Jan. Sequence Summarizing Neural Network for Speaker Adaptation. In: Proceedings of the 41th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), 2016. Shanghai: IEEE Signal Processing Society, 2016, pp. 5315-5319. ISBN 978-1-4799-9988-0.
    Detail

Back to top