Publications – Romain SERIZEL

Publications HAL de romain serizel

2024

Journal articles

titre: Evaluating and predicting the audibility of acoustic alarms in the workplace using experimental methods and deep learning
auteur: François Effa, Jean-Pierre Arz, Romain Serizel, Nicolas Grimault
article: Applied Acoustics, 2024, 219, pp.109955. ⟨10.1016/j.apacoust.2024.109955⟩
typdoc: Journal articles
DOI: DOI : 10.1016/j.apacoust.2024.109955
Accès au texte intégral et bibtex

Conference papers

titre: Mixture of Mixups for Multi-label Classification of Rare Anuran Sounds
auteur: Ilyass Moummad, Nicolas Farrugia, Romain Serizel, Jeremy Froidevaux, Vincent Lostanlen
article: EUSIPCO 2024, Aug 2024, Lyon, France
typdoc: Conference papers
Accès au texte intégral et bibtex

titre: RoboVox: A Single/Multi-channel Far-field Speaker Recognition Benchmark for a Mobile Robot
auteur: Mohammad Mohammadamini, Driss Matrouf, Michael Rouvier, Jean-Francois Bonastre, Romain Serizel, Theophile Gonos
article: LREC_COLING, ELRA, May 2024, Turino, Italy
typdoc: Conference papers
Accès au texte intégral et bibtex

titre: Unsupervised speech enhancement with diffusion-based generative models
auteur: Berné Nortier, Mostafa Sadeghi, Romain Serizel
article: International Conference on Acoustics Speech and Signal Processing (ICASSP), IEEE, Apr 2024, Seoul (Korea), South Korea. ⟨10.48550/arXiv.2309.10450⟩
typdoc: Conference papers
DOI: DOI : 10.48550/arXiv.2309.10450
Accès au texte intégral et bibtex

titre: A weighted-variance variational autoencoder model for speech enhancement
auteur: Ali Golmakani, Mostafa Sadeghi, Xavier Alameda-Pineda, Romain Serizel
article: ICASSP 2024 – International Conference on Acoustics Speech and Signal Processing, IEEE, Apr 2024, Seoul (Korea), South Korea. pp.1-5
typdoc: Conference papers
Accès au texte intégral et bibtex

titre: Posterior sampling algorithms for unsupervised speech enhancement with recurrent variational autoencoder
auteur: Mostafa Sadeghi, Romain Serizel
article: International Conference on Acoustics Speech and Signal Processing (ICASSP), IEEE, Apr 2024, Seoul (Korea), South Korea. ⟨10.48550/arXiv.2309.10439⟩
typdoc: Conference papers
DOI: DOI : 10.48550/arXiv.2309.10439
Accès au texte intégral et bibtex

titre: Diffusion-based speech enhancement with a weighted generative-supervised learning loss
auteur: Jean-Eudes Ayilo, Mostafa Sadeghi, Romain Serizel
article: International Conference on Acoustics Speech and Signal Processing (ICASSP), IEEE, Apr 2024, Seoul (Korea), South Korea. ⟨10.48550/arXiv.2309.10457⟩
typdoc: Conference papers
DOI: DOI : 10.48550/arXiv.2309.10457
Accès au texte intégral et bibtex

2023

Conference papers

titre: From Discrete Tokens to High-Fidelity Audio Using Multi-Band Diffusion
auteur: Robin San Roman, Yossi Adi, Antoine Deleforge, Romain Serizel, Gabriel Synnaeve, Alexandre Défossez
article: NeurIPS 2023 – Conference on Neural Information Processing Systems, Dec 2023, New Orleans, United States. ⟨10.48550/arXiv.2308.02560⟩
typdoc: Conference papers
DOI: DOI : 10.48550/arXiv.2308.02560
Accès au texte intégral et bibtex

titre: Pretraining Representations for Bioacoustic Few-Shot Detection using Supervised Contrastive Learning
auteur: Ilyass Moummad, Romain Serizel, Nicolas Farrugia
article: Detection and Classification of Acoustic Scenes and Events 2023, Sep 2023, TAMPERE, Finland
typdoc: Conference papers
Accès au texte intégral et bibtex

titre: Monitoring environmental impact of DCASE systems: Why and how ?
auteur: Constance Douwes, Francesca Ronchini, Romain Serizel
article: Detection and Classification of Acoustic Scene and Events (DCASE) Workshop, Sep 2023, Tampere (Finlande), Finland
typdoc: Conference papers
Accès au bibtex

titre: Post-Processing Independent Evaluation of Sound Event Detection Systems
auteur: Janek Ebbers, Reinhold Haeb-Umbach, Romain Serizel
article: DCASE 2023 – 8th Workshop on Detection and Classification of Acoustic Scenes and Events, Sep 2023, Tampere, Finland. ⟨10.48550/arXiv.2306.15440⟩
typdoc: Conference papers
DOI: DOI : 10.48550/arXiv.2306.15440
Accès au texte intégral et bibtex

titre: BinauRec: A dataset to test the influence of the use of room impulse responses on binaural speech enhancement
auteur: Louis Delebecque, Romain Serizel
article: EUSIPCO23, EURASIP, Sep 2023, Helsiinki, Finland. ⟨10.23919/EUSIPCO58844.2023.10289772⟩
typdoc: Conference papers
DOI: DOI : 10.23919/EUSIPCO58844.2023.10289772
Accès au texte intégral et bibtex

titre: Self-supervised learning with diffusion-based multichannel speech enhancement for speaker verification under noisy conditions
auteur: Sandipana Dowerah, Ajinkya Kulkarni, Romain Serizel, Denis Jouvet
article: INTERSPEECH 2023, Aug 2023, Dublin (Ireland), Ireland. pp.3849-3853, ⟨10.21437/Interspeech.2023-1890⟩
typdoc: Conference papers
DOI: DOI : 10.21437/Interspeech.2023-1890
Accès au texte intégral et bibtex

titre: Performance above all ? energy consumption vs. performance for machine listening, a study on dcase task 4 baseline
auteur: Romain Serizel, Samuele Cornell, Nicolas Turpault
article: ICASSP 2023 – 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Jun 2023, Rhodes Island, France. pp.1-5, ⟨10.1109/ICASSP49357.2023.10095938⟩
typdoc: Conference papers
DOI: DOI : 10.1109/ICASSP49357.2023.10095938
Accès au texte intégral et bibtex

titre: CONVOLUTIONAL NEURAL NETWORK FOR AUDIBILITY ASSESSMENT OF ACOUSTIC ALARMS
auteur: François Effa, Romain Serizel, Jean-Pierre Arz, Nicolas Grimault
article: International Conference on Acoustics, Speech and Signal Processing Search form Search ICASSP IEEE 2023, Jun 2023, Rhodes, Greece
typdoc: Conference papers
Accès au texte intégral et bibtex

titre: Fast and efficient speech enhancement with variational autoencoders
auteur: Mostafa Sadeghi, Romain Serizel
article: International Conference on Acoustics Speech and Signal Processing (ICASSP), IEEE, Jun 2023, Rhodes island, Greece
typdoc: Conference papers
Accès au texte intégral et bibtex

titre: Lightweight Annotation and Class Weight Training for Automatic Estimation of Alarm Audibility in Noise
auteur: François Effa, Romain Serizel, Jean-Pierre Arz, Nicolas Grimault
article: ICASSP 2023 – 2023 IEEE International Conference on Acoustics, Speech and Signal Processing, Jun 2023, Rhodes Island, Greece. pp.1-5, ⟨10.1109/ICASSP49357.2023.10094730⟩
typdoc: Conference papers
DOI: DOI : 10.1109/ICASSP49357.2023.10094730
Accès au texte intégral et bibtex

titre: SPICE+: Evaluation of automatic audio captioning systems with pre-trained language models
auteur: Félix Gontier, Romain Serizel, Christophe Cerisara
article: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2023), Jun 2023, Rhodes Island, Greece
typdoc: Conference papers
Accès au texte intégral et bibtex

titre: Audio-visual speech enhancement with a deep kalman filter generative model
auteur: Ali Golmakani, Mostafa Sadeghi, Romain Serizel
article: International Conference on Acoustics Speech and Signal Processing (ICASSP), IEEE, Jun 2023, Rhodes island, Greece
typdoc: Conference papers
Accès au texte intégral et bibtex

titre: Joint optimization of diffusion probabilistic-based multichannel speech enhancement with far-field speaker verification
auteur: Sandipana Dowerah, Romain Serizel, Denis Jouvet, M Mohammadamini, Driss Matrouf
article: IEEE SLT 2022, Jan 2023, Doha, Qatar
typdoc: Conference papers
Accès au texte intégral et bibtex

Reports

titre: Robovox: Far-Field Speaker Recognition By A Mobile Robot (Evaluation Plan)
auteur: Mohammad Mohammadamini, Mickael Rouvier, Driss Matrouf, Jean-François Bonastre, Romain Serizel, Denis Jouvet, Théophile Gonos
article: Avignon Université. 2023
typdoc: Reports
Accès au texte intégral et bibtex

titre: Supervised contrastive learning for pre-training bioacoustic few-shot systems
auteur: Ilyass Moummad, Romain Serizel, Nicolas Farrugia
article: IMT Atlantique; LORIA. 2023
typdoc: Reports
Accès au texte intégral et bibtex

2022

Conference papers

titre: Integrating isolated examples with weakly-supervised sound event detection: a direct approach
auteur: Mohammad Abdollahi, Romain Serizel, Alain Rakotomamonjy, Gilles Gasso
article: 7th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE), Nov 2022, Nancy, France
typdoc: Conference papers
Accès au texte intégral et bibtex

titre: How to Leverage DNN-based speech enhancement for multi-channel speaker verification?
auteur: Sandipana Dowerah, Romain Serizel, Denis Jouvet, Mohammad Mohammadamini, Driss Matrouf
article: 4th International Conference on Advances in Signal Processing and Artificial Intelligence (ASPAI’ 2022), Oct 2022, Corfu, Greece
typdoc: Conference papers
Accès au texte intégral et bibtex

titre: Barlow Twins self-supervised learning for robust speaker recognition
auteur: Mohammad Mohammadamini, Driss Matrouf, Jean-François A Bonastre, Sandipana Dowerah, Romain Serizel, Denis Jouvet
article: Interspeech 2022 – Human and Humanizing Speech Technology, Sep 2022, Incheon, South Korea. ⟨10.21437/Interspeech.2022-11301⟩
typdoc: Conference papers
DOI: DOI : 10.21437/Interspeech.2022-11301
Accès au texte intégral et bibtex

titre: A Comprehensive Exploration of Noise Robustness and Noise Compensation in ResNet and TDNN-based Speaker Recognition Systems
auteur: Mohammad Mohammadamini, Driss Matrouf, Jean-François Bonastre, Sandipana Dowerah, Romain Serizel, Denis Jouvet
article: EUSIPCO 2022 – 30th European Signal Processing Conference, Aug 2022, Belgrade, Serbia
typdoc: Conference papers
Accès au texte intégral et bibtex

titre: Learning noise robust ResNet-based speaker embedding for speaker recognition
auteur: Mohammad Mohammadamini, Driss Matrouf, Jean-François Bonastre, Sandipana Dowerah, Romain Serizel, Denis Jouvet
article: Odyssey 2022 : The Speaker and Language Recognition Workshop, Jun 2022, Beijing, China
typdoc: Conference papers
Accès au texte intégral et bibtex

titre: Threshold independent evaluation of sound event detection scores
auteur: Janek Ebbers, Reinhold Haeb-Umbach, Romain Serizel
article: ICASSP 2022 – IEEE International Conference on Acoustics, Speech and Signal Processing, May 2022, Singapore, Singapore. ⟨10.1109/ICASSP43922.2022.9747556⟩
typdoc: Conference papers
DOI: DOI : 10.1109/ICASSP43922.2022.9747556
Accès au texte intégral et bibtex

titre: A benchmark of state-of-the-art sound event detection systems evaluated on synthetic soundscapes
auteur: Francesca Ronchini, Romain Serizel
article: ICASSP 2022 – IEEE International Conference on Acoustics, Speech and Signal Processing, May 2022, Singapore/Virtual, Singapore. ⟨10.1109/ICASSP43922.2022.9747577⟩
typdoc: Conference papers
DOI: DOI : 10.1109/ICASSP43922.2022.9747577
Accès au texte intégral et bibtex

titre: Evaluation de l’audibilité ressentie des alarmes sonores dans le bruit
auteur: Jean-Pierre Arz, François Effa, Nicolas Grimault, Romain Serizel
article: 16ème Congrès Français d’Acoustique, CFA2022, Société Française d’Acoustique; Laboratoire de Mécanique et d’Acoustique, Apr 2022, Marseille, France
typdoc: Conference papers
Accès au bibtex

titre: Modélisation de la détection d’alarmes sonores dans le bruit
auteur: François Effa, Jean-Pierre Arz, Nicolas Grimault, Ossen El Sawaf, Romain Serizel
article: 16ème Congrès Français d’Acoustique, CFA2022, Société Française d’Acoustique; Laboratoire de Mécanique et d’Acoustique, Apr 2022, Marseille, France
typdoc: Conference papers
Accès au bibtex

Habilitation à diriger des recherches

titre: Contributions to speech processing and ambient sound analysis
auteur: Romain Serizel
article: Computer Science [cs]. Université de Lorraine, 2022
typdoc: Habilitation à diriger des recherches
Accès au texte intégral et bibtex

Proceedings

titre: Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2022)
auteur: Mathieu Lagrange, Annamaria Mesaros, Thomas Pellegrini, Gael Richard, Romain Serizel, Dan Stowell
article: Tampere University, pp.1-225, 2022, 978-952-03-2677-7
typdoc: Proceedings
Accès au texte intégral et bibtex

Preprints, Working Papers, …

titre: Towards an efficient computation of masks for multichannel speech enhancement
auteur: Louis Delebecque, Romain Serizel, Nicolas Furnon
article: 2022
typdoc: Preprints, Working Papers, …
Accès au texte intégral et bibtex

titre: Le comportement des systèmes de reconnaissance du locuteur de l’état de l’art face aux variabiliés acoustiques
auteur: Mohammad Mohammadamini, Driss Matrouf, Jean-François Bonatsre, Sandipana Dowerah, Romain Serizel, Denis Jouvet
article: 2022
typdoc: Preprints, Working Papers, …
Accès au texte intégral et bibtex

2021

Journal articles

titre: DNN-based mask estimation for distributed speech enhancement in spatially unconstrained microphone arrays
auteur: Nicolas Furnon, Romain Serizel, Slim Essid, Irina Illina
article: IEEE/ACM Transactions on Audio, Speech and Language Processing, 2021, 29, pp.2310 – 2323. ⟨10.1109/TASLP.2021.3092838⟩
typdoc: Journal articles
DOI: DOI : 10.1109/TASLP.2021.3092838
Accès au texte intégral et bibtex

Conference papers

titre: Automated audio captioning by fine-tuning bart with audioset tags
auteur: Félix Gontier, Romain Serizel, Christophe Cerisara
article: DCASE 2021 – 6th Workshop on Detection and Classification of Acoustic Scenes and Events, Nov 2021, Virtual, Spain
typdoc: Conference papers
Accès au texte intégral et bibtex

titre: The impact of non-target events in synthetic soundscapes for sound event detection
auteur: Francesca Ronchini, Romain Serizel, Nicolas Turpault, Samuele Cornell
article: DCASE 2021 – Detection and Classification of Acoustic Scenes and Events, Nov 2021, Barcelona/Virtual, Spain
typdoc: Conference papers
Accès au texte intégral et bibtex

titre: Attention-based distributed speech enhancement for unconstrained microphone arrays with varying number of nodes
auteur: Nicolas Furnon, Romain Serizel, Slim Essid, Irina Illina
article: EUSIPCO 2021 – 29th European Signal Processing Conference, IEEE, Aug 2021, Dublin / Virtual, Ireland. ⟨10.23919/EUSIPCO54536.2021.9616358⟩
typdoc: Conference papers
DOI: DOI : 10.23919/EUSIPCO54536.2021.9616358
Accès au texte intégral et bibtex

titre: Compensate multiple distortions for speaker recognition systems
auteur: Mohammad Mohammadamini, Driss Matrouf, Jean-Francois Bonastre, Romain Serizel, Sandipana Dowerah, Denis Jouvet
article: EUSIPCO 2021 – 29th European Signal Processing Conference, Aug 2021, Dublin / Virtual, Ireland. ⟨10.23919/EUSIPCO54536.2021.9615983⟩
typdoc: Conference papers
DOI: DOI : 10.23919/EUSIPCO54536.2021.9615983
Accès au texte intégral et bibtex

titre: Sound Event Detection and Separation: a Benchmark on Desed Synthetic Soundscapes
auteur: Nicolas Turpault, Romain Serizel, Scott Wisdom, Hakan Erdogan, John R Hershey, Eduardo Fonseca, Prem Seetharaman, Justin Salamon
article: ICASSP 2021 – 46th International Conference on Acoustics, Speech, and Signal Processing, Jun 2021, Toronto/Virtual, Canada. ⟨10.1109/ICASSP39728.2021.9414789⟩
typdoc: Conference papers
DOI: DOI : 10.1109/ICASSP39728.2021.9414789
Accès au texte intégral et bibtex

titre: What’s All the FUSS About Free Universal Sound Separation Data?
auteur: Scott Wisdom, Hakan Erdogan, Daniel P W Ellis, Romain Serizel, Nicolas Turpault, Eduardo Fonseca, Justin Salamon, Prem Seetharaman, John R Hershey
article: ICASSP 2021 – 46th International Conference on Acoustics, Speech, and Signal Processing, Jun 2021, Toronto/Virtual, Canada. ⟨10.1109/ICASSP39728.2021.9414774⟩
typdoc: Conference papers
DOI: DOI : 10.1109/ICASSP39728.2021.9414774
Accès au texte intégral et bibtex

titre: Distributed speech separation in spatially unconstrained microphone arrays
auteur: Nicolas Furnon, Romain Serizel, Irina Illina, Slim Essid
article: ICASSP 2021 – 46th International Conference on Acoustics, Speech, and Signal Processing, Jun 2021, Toronto / Virtual, Canada. ⟨10.1109/ICASSP39728.2021.9414758⟩
typdoc: Conference papers
DOI: DOI : 10.1109/ICASSP39728.2021.9414758
Accès au texte intégral et bibtex

titre: Improving Sound Event Detection Metrics: Insights from DCASE 2020
auteur: Giacomo Ferroni, Nicolas Turpault, Juan Azcarreta, Francesco Tuveri, Romain Serizel, Çagdaş Bilen, Sacha Krstulović
article: ICASSP 2021 – 46th International Conference on Acoustics, Speech, and Signal Processing, Jun 2021, Toronto/Virtual, Canada. ⟨10.1109/ICASSP39728.2021.9414711⟩
typdoc: Conference papers
DOI: DOI : 10.1109/ICASSP39728.2021.9414711
Accès au texte intégral et bibtex

titre: UIAI System for Short-Duration Speaker Verification Challenge 2020
auteur: Md Sahidullah, Achintya Kumar Sarkar, Ville Vestman, Xuechen Liu, Romain Serizel, Tomi Kinnunen, Zheng-Hua Tan, Emmanuel Vincent
article: SLT 2021 – IEEE Spoken Language Technology Workshop, IEEE, Jan 2021, Shenzhen / Virtual, China. ⟨10.1109/SLT48900.2021.9383596⟩
typdoc: Conference papers
DOI: DOI : 10.1109/SLT48900.2021.9383596
Accès au texte intégral et bibtex

titre: Foreground-Background Ambient Sound Scene Separation
auteur: Michel Olvera, Emmanuel Vincent, Romain Serizel, Gilles Gasso
article: EUSIPCO 2020 – 28th European Signal Processing Conference, Jan 2021, Amsterdam / Virtual, Netherlands. ⟨10.23919/Eusipco47968.2020.9287436⟩
typdoc: Conference papers
DOI: DOI : 10.23919/Eusipco47968.2020.9287436
Accès au texte intégral et bibtex

Preprints, Working Papers, …

titre: SAMbA: Speech enhancement with Asynchronous ad-hoc Microphone Arrays
auteur: Nicolas Furnon, Romain Serizel, Slim Essid, Irina Illina
article: 2021
typdoc: Preprints, Working Papers, …
DOI: DOI : 10.48550/arXiv.2307.16582
Accès au texte intégral et bibtex

titre: MULTICHANNEL SPEECH ENHANCEMENT FOR SPEAKER VERIFICATION IN NOISY AND REVERBERANT ENVIRONMENTS
auteur: Sandipana Dowerah, Romain Serizel, Denis Jouvet, Mohammad Mohammadamini, Driss Matrouf
article: 2021
typdoc: Preprints, Working Papers, …
Accès au bibtex

titre: Analysis of weak labels for sound event tagging
auteur: Nicolas Turpault, Romain Serizel, Emmanuel Vincent
article: 2021
typdoc: Preprints, Working Papers, …
Accès au texte intégral et bibtex

2020

Journal articles

titre: Joint NN-Supported Multichannel Reduction of Acoustic Echo, Reverberation and Noise
auteur: Guillaume Carbajal, Romain Serizel, Emmanuel Vincent, Eric Humbert
article: IEEE/ACM Transactions on Audio, Speech and Language Processing, 2020, ⟨10.1109/TASLP.2020.3008974⟩
typdoc: Journal articles
DOI: DOI : 10.1109/TASLP.2020.3008974
Accès au texte intégral et bibtex

Conference papers

titre: Improving Sound Event Detection In Domestic Environments Using Sound Separation
auteur: Nicolas Turpault, Scott Wisdom, Hakan Erdogan, John R Hershey, Romain Serizel, Eduardo Fonseca, Prem Seetharaman, Justin Salamon
article: DCASE Workshop 2020 – Detection and Classification of Acoustic Scenes and Events, Nov 2020, Tokyo / Virtual, Japan
typdoc: Conference papers
Accès au texte intégral et bibtex

titre: Training Sound Event Detection On A Heterogeneous Dataset
auteur: Nicolas Turpault, Romain Serizel
article: DCASE Workshop, Nov 2020, Tokyo, Japan
typdoc: Conference papers
Accès au texte intégral et bibtex

titre: DNN-Based Distributed Multichannel Mask Estimation for Speech Enhancement in Microphone Arrays
auteur: Nicolas Furnon, Romain Serizel, Irina Illina, Slim Essid
article: ICASSP 2020 – 45th International Conference on Acoustics, Speech, and Signal Processing, May 2020, Barcelona, Spain
typdoc: Conference papers
Accès au texte intégral et bibtex

titre: Sound event detection in synthetic domestic environments
auteur: Romain Serizel, Nicolas Turpault, Ankit Shah, Justin Salamon
article: ICASSP 2020 – 45th International Conference on Acoustics, Speech, and Signal Processing, May 2020, Barcelona, Spain
typdoc: Conference papers
Accès au texte intégral et bibtex

titre: Limitations of weak labels for embedding and tagging
auteur: Nicolas Turpault, Romain Serizel, Emmanuel Vincent
article: ICASSP 2020 – 45th International Conference on Acoustics, Speech, and Signal Processing, May 2020, Barcelona, Spain
typdoc: Conference papers
Accès au texte intégral et bibtex

titre: A brief introduction to multichannel noise reduction with deep neural networks
auteur: Romain Serizel
article: SpiN 2020 – 12th Speech in Noise Workshop, Jan 2020, Toulouse, France
typdoc: Conference papers
Accès au texte intégral et bibtex

2019

Journal articles

titre: Audio-Based Search and Rescue with a Drone: Highlights from the IEEE Signal Processing Cup 2019 Student Competition
auteur: Antoine Deleforge, Diego Di Carlo, Martin Strauss, Romain Serizel, Lucio Marcenaro
article: IEEE Signal Processing Magazine, 2019, 36 (5), pp.138-144. ⟨10.1109/MSP.2019.2924687⟩
typdoc: Journal articles
DOI: DOI : 10.1109/MSP.2019.2924687
Accès au texte intégral et bibtex

titre: CRNN-based multiple DoA estimation using acoustic intensity features for Ambisonics recordings
auteur: Lauréline Perotin, Romain Serizel, Emmanuel Vincent, Alexandre Guérin
article: IEEE Journal of Selected Topics in Signal Processing, 2019, Special Issue on Acoustic Source Localization and Tracking in Dynamic Real-life Scenes, 13 (1), pp.22-33. ⟨10.1109/jstsp.2019.2900164⟩
typdoc: Journal articles
DOI: DOI : 10.1109/jstsp.2019.2900164
Accès au texte intégral et bibtex

Conference papers

titre: Regression versus classification for neural network based audio source localization
auteur: Lauréline Perotin, Alexandre Défossez, Emmanuel Vincent, Romain Serizel, Alexandre Guérin
article: WASPAA 2019 – IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, IEEE, Oct 2019, New Paltz, United States
typdoc: Conference papers
Accès au texte intégral et bibtex

titre: Sound event detection in domestic environments with weakly labeled data and soundscape synthesis
auteur: Nicolas Turpault, Romain Serizel, Ankit Parag Shah, Justin Salamon
article: Workshop on Detection and Classification of Acoustic Scenes and Events, Oct 2019, New York City, United States
typdoc: Conference papers
Accès au texte intégral et bibtex

titre: Sound Event Detection from Partially Annotated Data: Trends and Challenges
auteur: Romain Serizel, Nicolas Turpault
article: IcETRAN conference, Jun 2019, Srebrno Jezero, Serbia
typdoc: Conference papers
Accès au texte intégral et bibtex

titre: Semi-supervised triplet loss based learning of ambient audio embeddings
auteur: Nicolas Turpault, Romain Serizel, Emmanuel Vincent
article: ICASSP 2019, May 2019, Brighton, United Kingdom
typdoc: Conference papers
Accès au texte intégral et bibtex

Reports

titre: Joint NN-Supported Multichannel Reduction of Acoustic Echo, Reverberation and Noise: Supporting Document
auteur: Guillaume Carbajal, Romain Serizel, Emmanuel Vincent, Eric Humbert
article: [Research Report] RR-9303, INRIA Nancy; Invoxia SAS. 2019
typdoc: Reports
Accès au texte intégral et bibtex

Preprints, Working Papers, …

titre: The Speed Submission to DIHARD II: Contributions & Lessons Learned
auteur: Md Sahidullah, Jose Patino, Samuele Cornell, Ruiqing Yin, Sunit Sivasankaran, Hervé Bredin, Pavel Korshunov, Alessio Brutti, Romain Serizel, Emmanuel Vincent, Nicholas Evans, Sébastien Marcel, Stefano Squartini, Claude Barras
article: 2019
typdoc: Preprints, Working Papers, …
Accès au texte intégral et bibtex

2018

Journal articles

titre: Rank-1 Constrained Multichannel Wiener Filter for Speech Recognition in Noisy Environments
auteur: Ziteng Wang, Emmanuel Vincent, Romain Serizel, Yonghong Yan
article: Computer Speech and Language, 2018, 49, pp.37-51. ⟨10.1016/j.csl.2017.11.003⟩
typdoc: Journal articles
DOI: DOI : 10.1016/j.csl.2017.11.003
Accès au texte intégral et bibtex

Conference papers

titre: Large-Scale Weakly Labeled Semi-Supervised Sound Event Detection in Domestic Environments
auteur: Romain Serizel, Nicolas Turpault, Hamid Eghbal-Zadeh, Ankit Parag Shah
article: Workshop on Detection and Classification of Acoustic Scenes and Events, Nov 2018, Woking, United Kingdom
typdoc: Conference papers
Accès au texte intégral et bibtex

titre: CRNN-based joint azimuth and elevation localization with the Ambisonics intensity vector
auteur: Lauréline Perotin, Romain Serizel, Emmanuel Vincent, Alexandre Guérin
article: IWAENC 2018 – 16th International Workshop on Acoustic Signal Enhancement, Sep 2018, Tokyo, Japan
typdoc: Conference papers
Accès au texte intégral et bibtex

titre: Multichannel Audio Modeling with Elliptically Stable Tensor Decomposition
auteur: Mathieu Fontaine, Fabian-Robert Stöter, Antoine Liutkus, Umut Şimşekli, Romain Serizel, Roland Badeau
article: LVA/ICA 2018 – 14th International Conference on Latent Variable Analysis and Signal Separation, Jul 2018, Surrey, United Kingdom. pp.13-23, ⟨10.1007/978-3-319-93764-9_2⟩
typdoc: Conference papers
DOI: DOI : 10.1007/978-3-319-93764-9_2
Accès au texte intégral et bibtex

titre: Multiple-input neural network-based residual echo suppression
auteur: Guillaume Carbajal, Romain Serizel, Emmanuel Vincent, Eric Humbert
article: ICASSP 2018 – IEEE International Conference on Acoustics, Speech and Signal Processing, Apr 2018, Calgary, Canada. pp.1-5
typdoc: Conference papers
Accès au texte intégral et bibtex

titre: Multichannel speech separation with recurrent neural networks from high-order ambisonics recordings
auteur: Lauréline Perotin, Romain Serizel, Emmanuel Vincent, Alexandre Guérin
article: 43rd IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2018), Apr 2018, Calgary, Canada
typdoc: Conference papers
Accès au texte intégral et bibtex

2017

Journal articles

titre: Feature Learning with Matrix Factorization Applied to Acoustic Scene Classification
auteur: Victor Bisot, Romain Serizel, Slim Essid, Gael Richard
article: IEEE/ACM Transactions on Audio, Speech and Language Processing, 2017, 25 (6), pp.1216 – 1229. ⟨10.1109/TASLP.2017.2690570⟩
typdoc: Journal articles
DOI: DOI : 10.1109/TASLP.2017.2690570
Accès au texte intégral et bibtex

Conference papers

titre: Nonnegative Feature Learning Methods for Acoustic Scene Classification
auteur: Victor Bisot, Romain Serizel, Slim Essid, Gael Richard
article: DCASE 2017 – Workshop on Detection and Classification of Acoustic Scenes and Events, Nov 2017, Munich, Germany
typdoc: Conference papers
Accès au texte intégral et bibtex

titre: Leveraging deep neural networks with nonnegative representations for improved environmental sound classification
auteur: Victor Bisot, Romain Serizel, Slim Essid, Gael Richard
article: IEEE International Workshop on Machine Learning for Signal Processing MLSP, Sep 2017, Tokyo, Japan
typdoc: Conference papers
Accès au texte intégral et bibtex

titre: Supervised Group Nonnegative Matrix Factorisation With Similarity Constraints And Applications To Speaker Identification
auteur: Romain Serizel, Victor Bisot, Slim Essid, Gael Richard
article: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Mar 2017, New Orleans, United States
typdoc: Conference papers
Accès au texte intégral et bibtex

Book sections

titre: Acoustic Features for Environmental Sound Analysis
auteur: Romain Serizel, Victor Bisot, Slim Essid, Gael Richard
article: Tuomas Virtanen; Mark D. Plumbley; Dan Ellis. Computational Analysis of Sound Scenes and Events, Springer International Publishing AG, pp.71-101, 2017, 978-3-319-63449-4. ⟨10.1007/978-3-319-63450-0_4⟩
typdoc: Book sections
DOI: DOI : 10.1007/978-3-319-63450-0_4
Accès au texte intégral et bibtex

titre: Multiview approaches to event detection and scene analysis
auteur: Slim Essid, Sanjeel Parekh, Ngoc Q. K. Duong, Romain Serizel, Alexey Ozerov, Fabio Antonacci, Augusto Sarti
article: Tuomas Virtanen; Mark D. Plumbley; Dan Ellis. Computational Analysis of Sound Scenes and Events, Springer, pp.243-276, 2017, 978-3319634494. ⟨10.1007/978-3-319-63450-0_9⟩
typdoc: Book sections
DOI: DOI : 10.1007/978-3-319-63450-0_9
Accès au texte intégral et bibtex

titre: Multiview Approaches to Event Detection and Scene Analysis
auteur: Slim Essid, Sanjeel Parekh, Ngoc Q. K. Duong, Romain Serizel, Alexey Ozerov, Fabio Antonacci, Augusto Sarti
article: Computational Analysis of Sound Scenes and Events, Springer International Publishing AG, 2017
typdoc: Book sections
Accès au bibtex

2016

Journal articles

titre: Deep-neural network approaches for speech recognition with heterogeneous groups of speakers including children
auteur: Romain Serizel, Diego Giuliani
article: Natural Language Engineering, 2016, 1, pp.0 – 0
typdoc: Journal articles
Accès au texte intégral et bibtex

Conference papers

titre: Machine listening techniques as a complement to video image analysis in forensics
auteur: Romain Serizel, Victor Bisot, Slim Essid, Gael Richard
article: IEEE International Conference on Image Processing, Sep 2016, Phoenix, AZ, United States. pp.948-952, ⟨10.1109/ICIP.2016.7532497⟩
typdoc: Conference papers
DOI: DOI : 10.1109/ICIP.2016.7532497
Accès au texte intégral et bibtex

titre: Mini-batch stochastic approaches for accelerated multiplicative updates in nonnegative matrix factorisation with beta-divergence
auteur: Romain Serizel, Slim Essid, Gael Richard
article: IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2016), Sep 2016, Salerne, Italy
typdoc: Conference papers
Accès au texte intégral et bibtex

titre: SUPERVISED NONNEGATIVE MATRIX FACTORIZATION FOR ACOUSTIC SCENE CLASSIFICATION
auteur: Victor Bisot, Romain Serizel, Slim Essid, Gael Richard
article: IEEE international evaluation campaign on detection and classification of acousitc scenes and events (DCASE 2016), Sep 2016, Budapest, Hungary
typdoc: Conference papers
Accès au texte intégral et bibtex

titre: Acoustic scene classification with matrix factorization for unsupervised feature learning
auteur: Victor Bisot, Romain Serizel, Slim Essid, Gael Richard
article: ICASSP, Mar 2016, Shangai, China
typdoc: Conference papers
Accès au bibtex

titre: Group Non-Negative Matrix Factorisation With Speaker And Session Similarity Constraints For Speaker Identification
auteur: Romain Serizel, Slim Essid, Gael Richard
article: IEEE International Conference on Acoustics, Speech, and Signal Processing, Mar 2016, Shangai, China
typdoc: Conference papers
Accès au texte intégral et bibtex

titre: Group nonnegative matrix factorisation with speaker and session variability compensation for speaker identification
auteur: Romain Serizel, Slim Essid, Gael Richard
article: ICASSP, Mar 2016, Shangai, China. pp.5470 – 5474
typdoc: Conference papers
Accès au bibtex

2015

Conference papers

titre: A brief introduction to deep neural networks and their application to automatic speech recognition
auteur: Romain Serizel
article: Séminaire de l’équipe Perception, Feb 2015, Grenoble, France
typdoc: Conference papers
Accès au bibtex

2014

Journal articles

titre: Low-rank Approximation Based Multichannel Wiener Filter Algorithms for Noise Reduction with Application in Cochlear Implants
auteur: Romain Serizel, Marc Moonen, Bas van Dijk, Jan Wouters
article: IEEE/ACM Transactions on Audio, Speech and Language Processing, 2014, 22, pp.785 – 799. ⟨10.1109/TASLP.2014.2304240⟩
typdoc: Journal articles
DOI: DOI : 10.1109/TASLP.2014.2304240
Accès au texte intégral et bibtex

Conference papers

titre: Vocal tract length normalisation approaches to DNN-based children’s and adults’ speech recognition
auteur: Romain Serizel, Diego Giuliani
article: 2014 IEEE Spoken Language Technology Workshop (SLT 2014), Dec 2014, South Lake Tahoe, CA, United States. pp.135-140, ⟨10.1109/SLT.2014.7078563⟩
typdoc: Conference papers
DOI: DOI : 10.1109/SLT.2014.7078563
Accès au texte intégral et bibtex

titre: Deep neural network adaptation for children’s and adults’ speech recognition
auteur: Romain Serizel, Diego Giuliani
article: Italian Computational Linguistics Conference (CLiC-it), Dec 2014, Pise, Italy
typdoc: Conference papers
Accès au texte intégral et bibtex

2013

Journal articles

titre: A Speech Distortion Weighting Based Approach to Integrated Active Noise Control and Noise Reduction in Hearing Aids
auteur: Romain Serizel, Marc Moonen, Jan Wouters, Søren Holdt Jensen
article: Signal Processing, 2013, 93 (9), pp.2440-2452
typdoc: Journal articles
Accès au texte intégral et bibtex

titre: Binaural Integrated Active Noise Control and Noise Reduction in Hearing Aids
auteur: Romain Serizel, Marc Moonen, Jan Wouters, Søren Holdt Jensen
article: IEEE Transactions on Audio, Speech and Language Processing, 2013, 21 (5), pp.1113-1118
typdoc: Journal articles
Accès au texte intégral et bibtex

Conference papers

titre: Rank-1 Approximation Based Multichannel Wiener Filtering Algorithms For Noise Reduction In Cochlear Implants
auteur: Romain Serizel, Marc Moonen, Bas Van Dijk, Jan Wouters
article: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), May 2013, Vancouver, Canada
typdoc: Conference papers
Accès au texte intégral et bibtex

2012

Journal articles

titre: A Zone of Quiet Based Approach to Integrated Active Noise Control and Noise Reduction for Speech Enhancement in Hearing Aids
auteur: Romain Serizel, Marc Moonen, Jan Wouters, Søren Holdt Jensen
article: IEEE Transactions on Audio, Speech and Language Processing, 2012, 20 (6), pp.1685 – 1697
typdoc: Journal articles
Accès au texte intégral et bibtex

2011

Journal articles

titre: Output SNR analysis of integrated active noise control and noise reduction in hearing aids under a single speech source scenario
auteur: Romain Serizel, Marc Moonen, Jan Wouters, Søren Holdt Jensen
article: Signal Processing, 2011, 91 (8), pp.1719-1729
typdoc: Journal articles
Accès au texte intégral et bibtex

2010

Conference papers

titre: Output SNR analysis of integrated active noise control and noise reduction in hearing aids under a single speech source scenario
auteur: Romain Serizel, Marc Moonen, Jan Wouters, Søren Holdt Jensen
article: European Signal Processing Conference (EUSIPCO), Aug 2010, Aalborg, Denmark
typdoc: Conference papers
Accès au texte intégral et bibtex

2009

Conference papers

titre: A Zone of Quiet Based Approach to Integrated Active Noise Control and Noise Reduction in Hearing Aids
auteur: Romain Serizel, Marc Moonen, Jan Wouters, Søren Holdt Jensen
article: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Oct 2009, New Paltz, United States
typdoc: Conference papers
Accès au texte intégral et bibtex

titre: A Weighted Approach for Integrated Active Noise Control and Noise Reduction in Hearing Aids
auteur: Romain Serizel, Marc Moonen, Jan Wouters, Søren Holdt Jensen
article: European Signal Processing Conference (EUSIPCO), Aug 2009, Glasgow, United Kingdom
typdoc: Conference papers
Accès au texte intégral et bibtex

2008

Journal articles

titre: Accuracy Constraint Determination in Fixed-Point System Design
auteur: Daniel Menard, Romain Serizel, Romuald Rocher, Olivier Sentieys
article: EURASIP Journal on Embedded Systems, 2008, 2008 (1), ⟨10.1155/2008/242584⟩
typdoc: Journal articles
DOI: DOI : 10.1155/2008/242584
Accès au bibtex

Conference papers

titre: Combined Active Noise Control and noise reduction in Hearing Aids
auteur: Romain Serizel, Marc Moonen, Jan Wouters, Søren Holdt Jensen
article: International Workshop on Acoustic Echo and Noise Control (IWAENC), Sep 2008, Seattle, United States
typdoc: Conference papers
Accès au texte intégral et bibtex

2007

Conference papers

titre: Noise model for Accuracy Constraint Determination in Fixed-Point Systems
auteur: Daniel Menard, Romain Serizel, Romuald Rocher, Olivier Sentieys
article: Workshop on Design and Architectures for Signal and Image Processing DASIP 2007, Nov 2007, Grenoble, France
typdoc: Conference papers
Accès au bibtex