Slim Essid's research page

ADASP reseach group | S²A team | LTCI lab | Télécom Paris | Institut Polytechnique de Paris

me.jpg

Research interests

Machine learning and artificial intelligence for temporal data analysis, especially:

  • multimodal and multiview deep learning;
  • representation learning, in particular self-supervised learning;
  • structured prediction;

with applications to:

  • machine listening, music content analysis (MIR) and speech processing;
  • multimedia content analysis, especially joint audiovisual data analysis;
  • multimodal perception, human behaviour analysis and affective computing, including EEG data analysis.

For more information about my research activities check my publications. You can also read about the research projects I have been involved in, including those of the PhD students and post-docs I have advised.

Short bio

Slim Essid is Full Professor of Télécom Paris and the coordinator of the Audio Data Analysis and Signal Processing (ADASP) group. He received the state engineering degree from the École Nationale d’Ingénieurs de Tunis in 2001; the M.Sc. (D.E.A.) degree in digital communication systems from the École Nationale Supérieure des Télécommunications, Paris, France, in 2002; the Ph.D. degree from the Université Pierre et Marie Curie (UPMC), in 2005; and the habilitation (HDR) degree from UPMC in 2015.

Over the past 15 years, he has been involved in various French and European research projects. He has collaborated with 14 post-docs and has graduated 15 PhD students; he is currently co-advising 10 others. He has published over 150 peer-reviewed conference and journal papers with more than 100 distinct co-authors. On a regular basis he serves as a reviewer for various machine learning, signal processing, audio and multimedia conferences and journals, for instance various IEEE transactions, and as an expert for research funding agencies.

Selected recent publications

  1. benigmim_cvpr-24.png
    COLLABORATING FOUNDATION MODELS FOR DOMAIN GENERALIZED SEMANTIC SEGMENTATION
    Y. Benigmim , S. Roy , S. Essid, V. Kalogeiton , and S. Lathuilière
    In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024) , 2024
    Accepted
  2. buisson_taslp-24.png
    SELF-SUPERVISED LEARNING OF MULTI-LEVEL AUDIO REPRESENTATIONS FOR MUSIC SEGMENTATION
    M. Buisson , B. Mcfee , S. Essid, and H. Crayencour
    IEEE/ACM Transactions on Audio, Speech and Language Processing, Mar 2024
    Accepted
  3. letzelter_neurips-23.png
    RESILIENT MULTIPLE CHOICE LEARNING: A LEARNED SCORING SCHEME WITH APPLICATION TO AUDIO SCENE ANALYSIS
    V. Letzelter , M. Fontaine , P. Perez , G. Richard , S. Essid, and M. Chen
    In Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS 2023) , Dec 2023
  4. zaiem_jstsp-23.png
    PRETEXT TASKS SELECTION FOR MULTITASK SELF-SUPERVISED AUDIO REPRESENTATION LEARNING
    S. Zaiem , T. Parcollet , S. Essid, and A. Heba
    IEEE Journal of Selected Topics in Signal Processing, Dec 2022
  5. furnon_taslp-21.png
    DNN-BASED MASK ESTIMATION FOR DISTRIBUTED SPEECH ENHANCEMENT IN SPATIALLY UNCONSTRAINED MICROPHONE ARRAYS
    N. Furnon , R. Serizel , S. Essid, and I. Illina
    IEEE/ACM Transactions on Audio, Speech and Language Processing, Dec 2021
  6. parekh_taslp-19.png
    WEAKLY SUPERVISED REPRESENTATION LEARNING FOR AUDIO-VISUAL SCENE ANALYSIS
    S. Parekh , S. Essid, A. Ozerov , N. Duong , P. Pérez , and G. Richard
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, Dec 2019

Contact

Télécom Paris - Room 5C
19, place Marguerite Perey 91120 Palaiseau - FRANCE
Indications on how to get there can be found here.