Peer-reviewed and preprint work on speech recognition, spoken language understanding, and speech-and-audio LLMs. Full list available via Google Scholar.
2025
TSD
2025
I. Thorbecke, E. Villatoro-Tello, J. P. Zuluaga, S. Kumar, S. Burdisso, P. Rangappa, A. Carofilis, S. Madikeri, P. Motlicek, K. Pandia, K. Hacioglu, A. Stolcke.
Single-pass trie unifies global vocabulary biasing with utterance-level context biasing for transducer ASR.
ICASSP
2025
P. Rangappa, S. Madikeri, J. P. Zuluaga, J. Villatoro-Tello, P. Motlicek.
A domain classifier plus pseudo-label filtering cuts ASR fine-tuning compute by ~40% at matched WER.
ICASSP
2025
S. Madikeri, J. P. Zuluaga, P. Rangappa, J. Villatoro-Tello, P. Motlicek.
Streaming ASR atop a frozen self-supervised backbone, without sacrificing non-streaming accuracy.
2024
EMNLP
2024
S. Kumar, S. Madikeri, J. P. Zuluaga, I. Thorbecke, E. Villatoro-Tello, S. Burdisso, P. Motlicek, K. S, A. Ganapathiraju.
A single transducer framework that jointly handles ASR, speaker change, and named-entity tagging — one model, one decoding pass.
EMNLP Findings
2024
I. Thorbecke, J. P. Zuluaga, E. Villatoro-Tello, S. Kumar, P. Rangappa, S. Burdisso, P. Motlicek, K. S, A. Ganapathiraju.
Distill Whisper into a streaming transducer in hours instead of days — practical recipe for low-latency, low-resource ASR.
arXiv
2024
I. Nigmatulina, S. Madikeri, E. Villatoro-Tello, P. Motlicek, J. P. Zuluaga, K. Pandia, A. Ganapathiraju.
Aho-Corasick keyword spotting fused with an LM during transducer decoding — boosts rare-word recognition without retraining.
Contextual Biasing for Keyword Recognition on Low-Resource Air Traffic Control Data
Interspeech
2024
M. Kocour, K. Veselý, I. Szoke, S. Kesiraju, J. P. Zuluaga, A. Blatt, M. Motlíček, J. Černocký.
Contextual biasing with deep fusion for domain-specific keyword recognition in ATC.
JMLR
2024
M. Ravanelli, T. Parcollet, A. Moumen, S. de Langen, C. Subakan, P. Plantinga, Y. Liao, S. Cornell, D. Roman, S. Moradi, D. Chander, D. Petermann, Y. Wang, J. P. Zuluaga, et al.
Co-authored the 1.0 release of SpeechBrain — a PyTorch toolkit for conversational AI.
2023
EMNLP
2023
J. P. Zuluaga, Z. Huang, X. Niu, R. Paturi, S. Srinivasan, P. Mathur, B. Thompson, M. Federico.
First end-to-end speech translation system that handles speaker turns and overlapped speech on a single channel.
Interspeech
2023
F. Mai, J. P. Zuluaga, T. Parcollet, P. Motlicek.
Replaces Conformer attention with HyperMixer, matching accuracy at a fraction of the compute.
Interspeech
2023
I. Nigmatulina, S. Madikeri, E. Villatoro-Tello, P. Motlicek, J. P. Zuluaga, K. Pandia, A. Ganapathiraju.
Adds contextual biasing to a GPU CTC/attention decoder for online ASR with minimal latency overhead.
ICASSP
2023
E. Villatoro-Tello, S. Madikeri, J. P. Zuluaga, B. Sharma, S. Sarfjoo, I. Nigmatulina, P. Motlicek, A. Ivanov, A. Ganapathiraju.
Systematic comparison of text, acoustic, and lattice features for SLU — lattices recover information lost in 1-best ASR.
arXiv
2023
J. P. Zuluaga, A. Prasad, I. Nigmatulina, S. Sarfjoo, P. Motlicek, M. Kleinert, H. Helmke, O. Ohneiser, Q. Zhan.
End-to-end virtual pilot agent — ASR, intent parsing, and TTS — for training and evaluating air traffic controllers.
arXiv
2023
J. P. Zuluaga, K. Vesely, A. Blatt, P. Motlicek, et al.
Postmortem of the ATCO2 project — what worked, what didn't, and what's next for low-resource ASR in safety-critical domains.
Interspeech
2023
J. P. Zuluaga, S. Sarfjoo, A. Prasad, I. Nigmatulina, P. Motlicek, K. Ondrej, O. Ohneiser, H. Helmke.
Accent classification benchmark on Common Voice using large self-supervised models — **Best Student Paper nominee**.
arXiv
2023
J. P. Zuluaga, K. Vesely, I. Szoke, P. Motlicek, M. Kocour, M. Rigault, K. Choukri, A. Prasad, S. Sarfjoo, I. Nigmatulina, C. Cevenini, P. Kolcarek, A. Tart, J. Cernocky, D. Klakow.
5,000 hours of Air Traffic Control communications — the largest open ATC speech corpus.
2022
IEEE SLT
2022
J. P. Zuluaga, A. Prasad, I. Nigmatulina, S. Sarfjoo, P. Motlicek, M. Kleinert, H. Helmke, O. Ohneiser, Q. Zhan.
Systematic study of self-supervised pretraining under domain shift — 20–40% relative WER cut on Air Traffic Control.
IEEE SLT
2022
J. P. Zuluaga, S. Sarfjoo, A. Prasad, I. Nigmatulina, P. Motlicek, K. Vesely, M. Kocour, I. Szoke.
Joint speaker-role and speaker-change detection for ATC using BERT — 27% DER reduction over audio-only baselines.
ICASSP
2022
J. P. Zuluaga, I. Nigmatulina, A. Prasad, P. Motlicek, K. Vesely, M. Kocour, I. Szoke.
CASE @ EMNLP
2022
S. Burdisso, J. P. Zuluaga, E. Villatoro-Tello, M. Fajcik, M. Singh, P. Smrz, P. Motlicek.
Few-shot prompt-based causal relation extraction — winner-tier system on the Causal News Corpus 2022 shared task.
CASE @ EMNLP
2022
M. Fajcik, M. Singh, J. P. Zuluaga, E. Villatoro-Tello, S. Burdisso, P. Motlicek, P. Smrz.
Generative LM for joint cause-effect-signal triple extraction — companion submission to the same Causal News shared task.
arXiv
2022
J. P. Zuluaga, A. Prasad, I. Nigmatulina, P. Motlicek, M. Kleinert, H. Helmke, O. Ohneiser, Q. Zhan.
Speech + NLP pipeline that drives a pseudo-pilot simulator for ATCO training — built on top of the ATCO2 corpus.
2021
Interspeech
2021
J. P. Zuluaga, I. Nigmatulina, A. Prasad, P. Motlicek, K. Vesely, M. Kocour, I. Szoke.
Boosting of Contextual Information in ASR for Air-Traffic Call-Sign Recognition
Interspeech
2021
I. Nigmatulina, R. Braun, J. P. Zuluaga, P. Motlicek.
arXiv
2021
A. Prasad, J. P. Zuluaga, P. Motlicek, S. Sarfjoo, I. Nigmatulina, K. Vesely, M. Kocour, I. Szoke.
arXiv
2021
I. Nigmatulina, R. Braun, J. P. Zuluaga, P. Motlicek.
Domain-Adversarial Based Model with Phonological Knowledge for Cross-Lingual Speech Recognition
Electronics
2021
Q. Xu, Y. Li, J. Shen, J. Liu, Y. Yang, J. P. Zuluaga.
2020
Automatic Call Sign Detection: Matching Air Surveillance Data with Air Traffic Spoken Communications
MDPI Proceedings
2020
J. P. Zuluaga, M. Kocour, K. Vesely, I. Szoke, P. Motlicek, M. Rigault, K. Choukri, A. Tart, J. Cernocky.
Automatic Speech Recognition Benchmark for Air-Traffic Communications
Interspeech
2020
J. P. Zuluaga, P. Motlicek, Q. Zhan, K. Vesely, R. Braun.
arXiv
2020
S. Madikeri, S. Tong, J. P. Zuluaga, A. Vyas, P. Motlicek, H. Bourlard.
PyTorch package for LF-MMI acoustic model training, bridging Kaldi and modern PyTorch workflows.
CMBBE
2020
J. P. Zuluaga, Z. Al Masry, K. Benaggoune, S. Meraghni, N. Zerhouni.
2019
A Survey of Breast Cancer Screening Techniques: Thermography and Electrical Impedance Tomography
Journal of Medical Engineering & Technology
2019
J. P. Zuluaga, Z. Al Masry, K. Benaggoune, S. Meraghni, N. Zerhouni.
A Portable Breast Cancer Detection System Based on Smartphone with Infrared Camera
Vibroengineering Procedia
2019
J. P. Zuluaga, Z. Al Masry, N. Zerhouni.