Publications

The up-to-date list of my publications can be found on Google Scholar .

2023

  1. Efficient Large-scale Audio Tagging via Transformer-to-CNN Knowledge Distillation
    Florian Schmid, Khaled Koutini, and Gerhard Widmer
    In 2023 IEEE international conference on acoustics, speech and signal processing (ICASSP), Jun 2023

2022

  1. Learning General Audio Representations With Large-Scale Training of Patchout Audio Transformers
    Khaled Koutini, Shahed Masoudian, Florian Schmid, Hamid Eghbal-zadeh, Jan Schlüter, and Gerhard Widmer
    NeurIPS challenge, Holistic Evaluation of Audio Representations (HEAR). Proceedings of Machine Learning Research, Jun 2022
  2. Efficient Training of Audio Transformers with Patchout
    Khaled Koutini, Jan Schlüter, Hamid Eghbal-zadeh, and Gerhard Widmer
    In Interspeech 2022, 23nd Annual Conference of the International Speech Communication Association, Jun 2022
  3. CP-JKU Submission to DCASE22: Distilling Knowledge for Low-Complexity Convolutional Neural Networks From a Patchout Audio Transformer
    Florian Schmid, Shahed Masoudian, Khaled Koutini, and Gerhard Widmer
    Jun 2022
  4. Knowledge Distillation from Transformers for Low-Complexity Acoustic Scene Classification
    Florian Schmid, Shahed Masoudian, Khaled Koutini, and Gerhard Widmer
    In Proceedings of the Detection and Classification of Acoustic Scenes and Events 2022 Workshop, Nov 2022

2021

  1. Receptive Field Regularization Techniques for Audio Classification and Tagging with Deep Convolutional Neural Networks
    Khaled Koutini, Hamid Eghbal-zadeh, and Gerhard Widmer
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, Nov 2021
  2. Over-Parameterization and Generalization in Audio Classification
    Khaled Koutini, Hamid Eghbal-zadeh, Florian Henkel, Jan Schlüter, and Gerhard Widmer
    In The International Conference of Machine Learning ICML Workshop on Overparameterization Pitfalls and Opportunities, Nov 2021
  3. CPJKU Submission to DCASE21: Cross-Device Audio Scene Classification with Wide Sparse Frequency-Damped CNNs
    Khaled Koutini, Schlüter Jan, and Gerhard Widmer
    Jun 2021

2020

  1. CP-JKU Submissions to DCASE’20: Low-Complexity Cross-Device Acoustic Scene Classification with RF-Regularized CNNs
    Khaled Koutini, Florian Henkel, Hamid Eghbal-zadeh, and Gerhard Widmer
    Jun 2020
  2. Receptive-Field Regularized CNNs for Music Classification and Tagging
    Khaled Koutini, Hamid Eghbal-Zadeh, Verena Haunschmid, Paul Primus, Shreyan Chowdhury, and Gerhard Widmer
    CoRR, Jun 2020
  3. Low-Complexity Models for Acoustic Scene Classification Based on Receptive Field Regularization and Frequency Damping
    Khaled Koutini, Florian Henkel, Hamid Eghbal-zadeh, and Gerhard Widmer
    In Proceedings of the Detection and Classification of Acoustic Scenes and Events 2020 Workshop, Nov 2020

2019

  1. The Receptive Field as a Regularizer in Deep Convolutional Neural Networks for Acoustic Scene Classification
    Khaled Koutini, Hamid Eghbal-zadeh, Matthias Dorfer, and Gerhard Widmer
    In 27th European Signal Processing Conference, EUSIPCO 2019, A Coruña, Spain, September 2-6, 2019, Nov 2019
  2. Receptive-Field-Regularized CNN Variants for Acoustic Scene Classification
    Khaled Koutini, Hamid Eghbal-zadeh, and Gerhard Widmer
    In Proceedings of the Detection and Classification of Acoustic Scenes and Events 2019 Workshop , Oct 2019
  3. Acoustic Scene Classification with Reject Option Based on ResNets
    Bernhard Lehner, Khaled Koutini, Christopher Schwarzlmüller, Thomas Gallien, and Gerhard Widmer
    Jun 2019
  4. Exploiting Parallel Audio Recordings to Enforce Device Invariance in CNN-based Acoustic Scene Classification
    Paul Primus, Hamid Eghbal-zadeh, David Eitelsebner, Khaled Koutini, Andreas Arzt, and Gerhard Widmer
    In Proceedings of the Detection and Classification of Acoustic Scenes and Events 2019 Workshop , Oct 2019
  5. Emotion and Theme Recognition in Music with Frequency-Aware RF-Regularized CNNs
    Khaled Koutini, Shreyan Chowdhury, Verena Haunschmid, Hamid Eghbal-Zadeh, and Gerhard Widmer
    In Proceedings of the MediaEval 2019 Workshop, Sophia Antipolis, France, 27-30 October 2019, Oct 2019
  6. CP-JKU submissions to DCASE’19: Acoustic Scene Classification and Audio Tagging with Receptive-Field-Regularized CNNs
    Khaled Koutini, Hamid Eghbal-zadeh, and Gerhard Widmer
    Jun 2019

2018

  1. Iterative Knowledge Distillation in R-CNNs for Weakly-Labeled Semi-Supervised Sound Event Detection
    Khaled Koutini, Hamid Eghbal-zadeh, and Gerhard Widmer
    In Proceedings of the Detection and Classification of Acoustic Scenes and Events 2018 Workshop, Nov 2018

2017

  1. Classifying Short Acoustic Scenes with I-Vectors and CNNs: Challenges and Optimisations for the 2017 DCASE ASC Task
    Bernhard Lehner, Hamid Eghbal-zadeh, Matthias Dorfer, Filip Korzeniowski, Khaled Koutini, and Gerhard Widmer
    Jun 2017
  2. MediaEval 2017 AcousticBrainz Genre Task: Multilayer Perceptron Approach
    Khaled Koutini, Alina Imenina, Matthias Dorfer, Alexander Gruber, and Markus Schedl
    In Proceedings of the MediaEval 2017 Workshop co-located with the Conference and Labs of the Evaluation Forum (CLEF 2017), Dublin, Ireland, September 13-15, 2017, Jun 2017