Project Details
DARPA Robust Automatic Transcription of Speech (RATS) - RATS Patrol II
Project Period: 23. 2. 2015 – 31. 3. 2017
Project Type: contract
Partner: Raytheon BBN Technologies Corp
speech recognition, speaker recognition, language recognition, keyword spotting,
robustness, noise, transmission channels
Existing speech signal processing technologies are inadequate for most noisy or
degraded speech signals that are important to military intelligence. The Robust
Automatic Transcription of Speech (RATS) program is creating algorithms and
software for performing the following tasks on potentially speech-containing
signals received over communication channels that are extremely noisy and/or
highly distorted: Speech Activity Detection, Language Identification, Speaker
Identification and Key Word Spotting.
Burget Lukáš, doc. Ing., Ph.D. (DCGM)
Černocký Jan, prof. Dr. Ing. (DCGM)
Fér Radek, Ing.
Glembek Ondřej, Ing., Ph.D.
Heřmanský Hynek, prof. Ing., Dr. Eng. (DCGM)
Karafiát Martin, Ing., Ph.D. (DCGM)
Kobes Michal
Novotný Ondřej, Ing., Ph.D.
Ogawa Tetsuji
Ondel Lucas Antoine Francois, Mgr., Ph.D. (SSDIT)
Plchot Oldřich, Ing., Ph.D. (DCGM)
Popková Anna, Ing.
Silnova Anna, M.Sc., Ph.D. (DCGM)
Skácel Miroslav, Ing. (EÚ OEI)
Veselý Karel, Ing., Ph.D. (DCGM)
2016
- LI, R.; MALLIDI, S.; PLCHOT, O.; BURGET, L.; DEHAK, N. Exploiting Hidden-Layer Responses of Deep Neural Networks for Language Recognition. In Proceedings of Interspeech 2016. San Francisco: International Speech Communication Association, 2016.
p. 3265-3269. ISBN: 978-1-5108-3313-5. Detail - MATĚJKA, P.; GLEMBEK, O.; NOVOTNÝ, O.; PLCHOT, O.; GRÉZL, F.; BURGET, L.; ČERNOCKÝ, J. Analysis Of DNN Approaches To Speaker Identification. In Proceedings of the 41th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), 2016. Shanghai: IEEE Signal Processing Society, 2016.
p. 5100-5104. ISBN: 978-1-4799-9988-0. Detail - NOVOTNÝ, O.; MATĚJKA, P.; GLEMBEK, O.; PLCHOT, O.; GRÉZL, F.; BURGET, L.; ČERNOCKÝ, J. Analysis of the DNN-Based SRE Systems in Multi-language Conditions. In Proceedings of SLT 2016. San Diego: IEEE Signal Processing Society, 2016.
p. 199-204. ISBN: 978-1-5090-4903-5. Detail - NOVOTNÝ, O.; MATĚJKA, P.; PLCHOT, O.; GLEMBEK, O.; BURGET, L.; ČERNOCKÝ, J. Analysis of Speaker Recognition Systems in Realistic Scenarios of the SITW 2016 Challenge. In Proceedings of Interspeech 2016. San Francisco: International Speech Communication Association, 2016.
p. 828-832. ISBN: 978-1-5108-3313-5. Detail - PEŠÁN, J.; BURGET, L.; ČERNOCKÝ, J. Sequence Summarizing Neural Networks for Spoken Language Recognition. In Proceedings of Interspeech 2016. San Francisco: International Speech Communication Association, 2016.
p. 3285-3289. ISBN: 978-1-5108-3313-5. Detail - PLCHOT, O.; BURGET, L.; ARONOWITZ, H.; MATĚJKA, P. Audio Enhancing With DNN Autoencoder For Speaker Recognition. In Proceedings of the 41th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), 2016. Shanghai: IEEE Signal Processing Society, 2016.
p. 5090-5094. ISBN: 978-1-4799-9988-0. Detail - PLCHOT, O.; MATĚJKA, P.; FÉR, R.; GLEMBEK, O.; NOVOTNÝ, O.; PEŠÁN, J.; VESELÝ, K.; ONDEL YANG, L.; KARAFIÁT, M.; GRÉZL, F.; KESIRAJU, S.; BURGET, L.; BRUMMER, J.; SWART, A.; CUMANI, S.; MALLIDI, S.; LI, R. BAT System Description for NIST LRE 2015. In Proceedings of Odyssey 2016, The Speaker and Language Recognition Workshop. Proceedings of Odyssey: The Speaker and Language Recognition Workshop Odyssey 2014, Joensuu, Finland. Bilbao: International Speech Communication Association, 2016. no. 06,
p. 166-173. ISSN: 2312-2846. Detail
2015
- CUMANI, S.; PLCHOT, O.; FÉR, R. Exploiting i-vector posterior covariances for short-duration language recognition. In Proceedings of Interspeech 2015. Proceedings of Interspeech. Dresden: International Speech Communication Association, 2015. no. 09,
p. 1002-1006. ISBN: 978-1-5108-1790-6. ISSN: 1990-9772. Detail - FÉR, R.; MATĚJKA, P.; GRÉZL, F.; PLCHOT, O.; ČERNOCKÝ, J. Multilingual Bottleneck Features for Language Recognition. In Proceedings of Interspeech 2015. Proceedings of Interspeech. Dresden: International Speech Communication Association, 2015. no. 09,
p. 389-393. ISBN: 978-1-5108-1790-6. ISSN: 1990-9772. Detail - PEŠÁN, J.; BURGET, L.; HEŘMANSKÝ, H.; VESELÝ, K. DNN derived filters for processing of modulation spectrum of speech. In Proceedings of Interspeech 2015. Proceedings of Interspeech. Dresden: International Speech Communication Association, 2015. no. 09,
p. 1908-1911. ISBN: 978-1-5108-1790-6. ISSN: 1990-9772. Detail
2017
- Summary report for project "Robust Automatic Speech Transcription" in Year 2017, summary research report, 2017
Authors: MATĚJKA, P.
2016
- Summary report for project "Robust Automatic Speech Transcription" in Year 2016, summary research report, 2016
Authors: MATĚJKA, P.
2015
- Summary report for project "Robust Automatic Speech Transcription" in Year 2015, summary research report, 2015
Authors: MATĚJKA, P.; PLCHOT, O.; NOVOTNÝ, O.; FÉR, R.
2016
- ABC NIST SRE 2016 SYSTEM DESCRIPTION, report, 2016
Authors: BRUMMER, J.; SWART, A.; PRIETO, J.; GARCIA PERERA, L.; MATĚJKA, P.; PLCHOT, O.; DIEZ SÁNCHEZ, M.; SILNOVA, A.; JIANG, X.; NOVOTNÝ, O.; ROHDIN, J.; GLEMBEK, O.; GRÉZL, F.; BURGET, L.; ONDEL YANG, L.; PEŠÁN, J.; ČERNOCKÝ, J.; KENNY, P.; ALAM, J.; BHATTACHARYA, G.; ZEINALI, H.