PUBLICATIONS

2024

  1. perera_neurips-24.png
    ANNEALED MULTIPLE CHOICE LEARNING: OVERCOMING LIMITATIONS OF WINNER-TAKES-ALL WITH ANNEALING
    D. Perera, V. Letzelter, T. Mariotte, A. Cortes, G. Richard, S. Essid, and M. Chen
    In Thirty-Eighth Annual Conference on Neural Information Processing Systems (NeurIPS 2024) , Dec 2024
    Accepted
  2. malard_neurips-24.png
    AN EYE FOR AN EAR: ZERO-SHOT AUDIO DESCRIPTION LEVERAGING AN IMAGE CAPTIONER WITH AUDIO-VISUAL TOKEN DISTRIBUTION MATCHING
    H. Malard, M. Olvera, S. Lathuilière, and S. Essid
    In Thirty-Eighth Annual Conference on Neural Information Processing Systems (NeurIPS 2024) , Dec 2024
    Accepted
  3. A SOUND DESCRIPTION: EXPLORING PROMPT TEMPLATES AND CLASS DESCRIPTIONS TO ENHANCE ZERO-SHOT AUDIO CLASSIFICATION
    M. Olvera, P. Stamatiadias, and S. Essid
    In International workshop on Detection and Classiffication of Acoustic Scenes and Events (DCASE) , Nov 2024
    Accepted
  4. SALT: STANDARDIZED AUDIO EVENT LABEL TAXONOMY
    P. Stamatiadias, M. Olvera, and S. Essid
    In International workshop on Detection and Classiffication of Acoustic Scenes and Events (DCASE) , Nov 2024
    Accepted
  5. larger_probes_speech-24.png
    SPEECH SELF-SUPERVISED REPRESENTATIONS BENCHMARKING: A CASE FOR LARGER PROBING HEADS
    S. Zaiem, Y. Kemiche, T. Parcollet, S. Essid, and M. Ravanelli
    Computer Speech & Language, Nov 2024
  6. A CONTRASTIVE SELF-SUPERVISED LEARNING SCHEME FOR BEAT TRACKING AMENABLE TO FEW-SHOT LEARNING
    A. Gagneré, S. Essid, and G. Peeters
    In International conference on music information retrieval (ISMIR 2024) , Nov 2024
    Accepted
  7. MUSIC STRUCTURE ANALYSIS WITH EDGE-CONDITIONED GRAPH ATTENTION NETWORKS
    M. Buisson, B. McFee, and S. Essid
    In International conference on music information retrieval (ISMIR 2024) , Nov 2024
    Accepted
  8. INVARIANCE-BASED LAYER REGULARIZATION FOR SOUND EVENT DETECTION
    D. Perera, S. Essid, and G. Richard
    In European signal processing conference (EUSIPCO 2024) , Aug 2024
  9. letzelter_icml-24.png
    WINNER-TAKES-ALL LEARNERS ARE GEOMETRY-AWARE CONDITIONAL DENSITY ESTIMATORS
    V. Letzelter, D. Perera, C. Rommel, M. Fontaine, S. Essid, G. Richard, and P. Pérez
    In International Conference on Machine Learning (ICML 2024) , Jul 2024
  10. ADAPTING PITCH-BASED SELF SUPERVISED LEARNING MODELS FOR TEMPO ESTIMATION
    A. Gagneré, S. Essid, and G. Peeters
    In ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , Jul 2024
  11. benigmim_cvpr-24.png
    COLLABORATING FOUNDATION MODELS FOR DOMAIN GENERALIZED SEMANTIC SEGMENTATION
    Y. Benigmim, S. Roy, S. Essid, V. Kalogeiton, and S. Lathuilière
    In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024) , Jul 2024
  12. buisson_taslp-24.png
    SELF-SUPERVISED LEARNING OF MULTI-LEVEL AUDIO REPRESENTATIONS FOR MUSIC SEGMENTATION
    M. Buisson, B. Mcfee, S. Essid, and H. Crayencour
    IEEE/ACM Transactions on Audio, Speech and Language Processing, Mar 2024
  13. ON THE CHOICE OF THE OPTIMAL TEMPORAL SUPPORT FOR AUDIO CLASSIFICATION WITH PRE-TRAINED EMBEDDINGS
    A. Quelennec, M. Olvera, G. Peeters, and S. Essid
    In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2024) , Apr 2024
  14. ONLINE SPEAKER DIARIZATION OF MEETINGS GUIDED BY SPEECH SEPARATION
    E. Gruttadauria, M. Fontaine, and S. Essid
    In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2024) , Apr 2024

2023

  1. A REPETITION-BASED TRIPLET MINING APPROACH FOR MUSIC SEGMENTATION
    M. Buisson, B. McFee, S. Essid, and H. Crayencour
    In International Society for Music Information Retrieval (ISMIR) , Nov 2023
  2. letzelter_neurips-23.png
    RESILIENT MULTIPLE CHOICE LEARNING: A LEARNED SCORING SCHEME WITH APPLICATION TO AUDIO SCENE ANALYSIS
    V. Letzelter, M. Fontaine, P. Perez, G. Richard, S. Essid, and M. Chen
    In Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS 2023) , Dec 2023
  3. SPEECH SELF-SUPERVISED REPRESENTATION BENCHMARKING: ARE WE DOING IT RIGHT?
    S. Zaiem, T. Parcollet, and S. Essid
    In Interspeech , Aug 2023
  4. AUTOMATIC DATA AUGMENTATION FOR DOMAIN ADAPTED FINE-TUNING OF SELF-SUPERVISED SPEECH REPRESENTATIONS
    S. Zaiem, T. Parcollet, and S. Essid
    In Interspeech , Aug 2023
  5. ONE-SHOT UNSUPERVISED DOMAIN ADAPTATION WITH PERSONALIZED DIFFUSION MODELS
    Y. Benigmim, S. Roy, S. Essid, V. Kalogeiton, and S. Lathuiliere
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops , Jun 2023
  6. COSMOPOLITE SOUND MONITORING (COSMO): A STUDY OF URBAN SOUND EVENT DETECTION SYSTEMS GENERALIZING TO MULTIPLE CITIES
    F. Angulo, S. Essid, G. Peeters, and C. Mietlicki
    In ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , Jun 2023
  7. FINE-TUNING STRATEGIES FOR FASTER INFERENCE USING SPEECH SELF-SUPERVISED MODELS: A COMPARATIVE STUDY
    S. Zaiem, R. Algayres, T. Parcollet, S. Essid, and M. Ravanelli
    In ICASSP 2023 - International Conference on Acoustics, Speech, and Signal Processing , Jun 2023

2022

  1. LATENT AND ADVERSARIAL DATA AUGMENTATION FOR SOUND EVENT DETECTION AND CLASSIFICATION
    D. Perera, S. Essid, and G. Richard
    In International workshop on Detection and Classiffication of Acoustic Scenes and Events (DCASE) , Nov 2022
  2. LATENT AND ADVERSARIAL DATA AUGMENTATION FOR SOUND EVENT DETECTION AND CLASSIFICATION
    D. Perera, S. Essid, and G. Richard
    In International workshop on Detection and Classiffication of Acoustic Scenes and Events (DCASE) , Nov 2022
  3. IMPACT DE PERTURBATIONS INTERNES SUR L ENTRAINEMENT DE RESEAUX PROFONDS POUR LA DETECTION D EVENEMENTS SONORES
    D. Perera, S. Essid, and G. Richard
    In Colloque Francophone de Traitement du Signal et des Images (GRETSI) , Sep 2022
  4. zaiem_jstsp-23.png
    PRETEXT TASKS SELECTION FOR MULTITASK SELF-SUPERVISED AUDIO REPRESENTATION LEARNING
    S. Zaiem, T. Parcollet, S. Essid, and A. Heba
    IEEE Journal of Selected Topics in Signal Processing, Sep 2022
  5. AUTOMATIC DATA AUGMENTATION SELECTION AND PARAMETRIZATION IN CONTRASTIVE SELF-SUPERVISED SPEECH REPRESENTATION LEARNING
    S. Zaiem, T. Parcollet, and S. Essid
    In Proc. Interspeech 2022 , Sep 2022
  6. LEARNING MULTI-LEVEL REPRESENTATIONS FOR HIERARCHICAL MUSIC STRUCTURE ANALYSIS
    M. Buisson, B. McFee, S. Essid, and H. Crayencour
    In International Society for Music Information Retrieval (ISMIR) , Dec 2022
  7. OPINIONS IN INTERACTIONS : NEW ANNOTATIONS OF THE SEMAINE DATABASE
    V. Barrière, C. Clavel, and S. Essid
    In LREC , Jun 2022

2021

  1. furnon_taslp-21.png
    DNN-BASED MASK ESTIMATION FOR DISTRIBUTED SPEECH ENHANCEMENT IN SPATIALLY UNCONSTRAINED MICROPHONE ARRAYS
    N. Furnon, R. Serizel, S. Essid, and I. Illina
    IEEE/ACM Transactions on Audio, Speech and Language Processing, Jun 2021
  2. USER-GUIDED ONE-SHOT DEEP MODEL ADAPTATION FOR MUSIC SOURCE SEPARATION
    g. Cantisani, A. Ozerov, S. Essid, and G. Richard
    In 2021 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) , Oct 2021
  3. ATTENTION-BASED DISTRIBUTED SPEECH ENHANCEMENT FOR UNCONSTRAINED MICROPHONE ARRAYS WITH VARYING NUMBER OF NODES
    N. Furnon, R. Serizel, S. Essid, and I. Illina
    In European Signal Processing Conference (EUSIPCO) , Aug 2021
  4. DISTRIBUTED SPEECH SEPARATION IN SPATIALLY UNCONSTRAINED MICROPHONE ARRAYS
    N. Furnon, R. Serizel, I. Illina, and S. Essid
    In ICASSP 2021 - 46th International Conference on Acoustics, Speech, and Signal Processing , Jun 2021
  5. NEURO-STEERED MUSIC SOURCE SEPARATION WITH EEG-BASED AUDITORY ATTENTION DECODING AND CONTRASTIVE-NMF
    G. Cantisani, S. Essid, and G. Richard
    In ICASSP 2021 - 46th International Conference on Acoustics, Speech, and Signal Processing , Jun 2021
  6. CONDITIONAL INDEPENDENCE FOR PRETEXT TASK SELECTION IN SELF-SUPERVISED SPEECH REPRESENTATION LEARNING
    S. Zaiem, T. Parcollet, and S. Essid
    In Interspeech , Aug 2021

2020

  1. METHOD AND SYSTEM FOR BROADCASTING A MULTICHANNEL AUDIO STREAM TO TERMINALS OF SPECTATORS ATTENDING A SPORTS EVENT
    R. Blouet, and S. Essid
    Patent Application, Sep 2020
  2. DNN-BASED DISTRIBUTED MULTICHANNEL MASK ESTIMATION FOR SPEECH ENHANCEMENT IN MICROPHONE ARRAYS
    N. Furnon, R. Serizel, I. Illina, and S. Essid
    In ICASSP 2020 - 45th International Conference on Acoustics, Speech, and Signal Processing , May 2020

2019

  1. parekh_taslp-19.png
    WEAKLY SUPERVISED REPRESENTATION LEARNING FOR AUDIO-VISUAL SCENE ANALYSIS
    S. Parekh, S. Essid, A. Ozerov, N. Duong, P. Pérez, and G. Richard
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, May 2019
  2. ON-THE-FLY DETECTION OF USER ENGAGEMENT DECREASE IN SPONTANEOUS HUMAN-ROBOT INTERACTION
    A. Ben Youssef, G. Varni, S. Essid, and C. Clavel
    International Journal of Social Robotics, Jan 2019
  3. A MULTIMODAL MOVIE REVIEW CORPUS FOR FINE-GRAINED OPINION MINING
    A. Garcia, S. Essid, F. DAlche-Buc, and C. Clavel
    Jan 2019
  4. FROM THE TOKEN TO THE REVIEW: A HIERARCHICAL MULTIMODAL APPROACH TO OPINION MINING
    A. Garcia, P. Colombo, F. DAlche-Buc, S. Essid, and C. Clavel
    In 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing , Nov 2019
  5. MAD-EEG: AN EEG DATASET FOR DECODING AUDITORY ATTENTION TO A TARGET INSTRUMENT IN POLYPHONIC MUSIC
    G. Cantisani, G. Tregoat, S. Essid, and G. Richard
    In Speech, Music and Mind (SMM19), Satellite workshop of Interspeech 2019 , Nov 2019
  6. IDENTIFY, LOCATE AND SEPARATE: AUDIO-VISUAL OBJECT EXTRACTION IN LARGE VIDEO COLLECTIONS USING WEAK SUPERVISION
    S. Parekh, A. Ozerov, S. Essid, N. Duong, P. Perez, and G. Richard
    In 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) , Oct 2019
  7. EEG-BASED DECODING OF AUDITORY ATTENTION TO A TARGET INSTRUMENT IN POLYPHONIC MUSIC
    G. Cantisani, S. Essid, and G. Richard
    In 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) , Oct 2019
  8. SAMBASET: A DATASET OF HISTORICAL SAMBA DE ENREDO RECORDINGS FOR COMPUTATIONAL MUSIC ANALYSIS
    L. Maia, M. Fuentes, L. Biscainho, M. Rocamora, and S. Essid
    In The 20th International Society for Music Information Retrieval Conference , Nov 2019
  9. TRACKING BEATS AND MICROTIMING IN AFRO-LATIN AMERICAN MUSIC USING CONDITIONAL RANDOM FIELDS AND DEEP LEARNING
    M. Fuentes, L. Maia, M. Rocamora, L. Biscainho, H. Crayencour, S. Essid, and J. Bello
    In The 20th International Society for Music Information Retrieval Conference , Nov 2019
  10. A MUSIC STRUCTURE INFORMED DOWNBEAT TRACKING SYSTEM USING SKIP-CHAIN CONDITIONAL RANDOM FIELDS AND DEEP LEARNING
    M. Fuentes, B. McFee, H. Crayencour, S. Essid, and J. Bello
    In IEEE International Conference on Acoustics, Speech and Signal processing , May 2019
  11. AUDIOVISUAL ANALYSIS OF MUSIC PERFORMANCES: OVERVIEW OF AN EMERGING FIELD
    Z. Duan, S. Essid, C. Liem, G. Richard, and G. Sharma
    IEEE Signal Processing Magazine, Jan 2019
  12. EARLY DETECTION OF USER ENGAGEMENT BREAKDOWN IN SPONTANEOUS HUMAN-HUMANOID INTERACTION
    A. Ben Youssef, C. Clavel, and S. Essid
    IEEE Transactions on Affective Computing, Jan 2019

2018

  1. PROCEDE ET SYSTEME DE DIFFUSION D UN FLUX AUDIO MULTICANAL A DES TERMINAUX DE SPECTATEURS ASSISTANT A UN EVENEMENT SPORTIF
    R. Blouet, and S. Essid
    Patent Application, Mar 2018
  2. MEDLEY-SOLOS-DB: A CROSS-COLLECTION DATASET FOR MUSICAL INSTRUMENT RECOGNITION
    V. Lostanlen, C. Cella, R. Bittner, and S. Essid
    Sep 2018
  3. EEG-BASED INTER-SUBJECT CORRELATION SCHEMES IN A STIMULI-SHARED FRAMEWORK: INTERPLAY WITH VALENCE AND AROUSAL
    A. Hajlaoui, M. Chetouani, and S. Essid
    Sep 2018
  4. ANALYSIS OF COMMON DESIGN CHOICES IN DEEP LEARNING SYSTEMS FOR DOWNBEAT TRACKING
    M. Fuentes, B. McFee, H. Crayencour, S. Essid, and J. Bello
    In Proceedings of the 19th International Society for Music Information Retrieval Conference , Sep 2018
  5. MAIN MELODY ESTIMATION WITH SOURCE-FILTER NMF AND CRNN
    D. Basaran, S. Essid, and G. Peeters
    In Proceedings of the 19th International Society for Music Information Retrieval Conference , Sep 2018
  6. A ROBUST AUDIO CLASSIFICATION SYSTEM FOR DETECTING PULMONARY EDEMA
    K. Hong, S. Essid, W. Ser, and D. Foo
    Biomedical Signal Processing and Control, Sep 2018
  7. MULTI-TASK FEATURE LEARNING FOR EEG-BASED EMOTION RECOGNITION USING GROUP NONNEGATIVE MATRIX FACTORIZATION
    A. Hajlaoui, M. Chetouani, and S. Essid
    In The European Signal Processing Conference (EUSIPCO) , Sep 2018
  8. STRUCTURED OUTPUT LEARNING WITH ABSTENTION: APPLICATION TO ACCURATE OPINION PREDICTION
    A. Garcia, S. Essid, C. Clavel, and F. DAlche-Buc
    In International Conference on Machine Learning (ICML) , Jul 2018
  9. WEAKLY SUPERVISED REPRESENTATION LEARNING FOR UNSYNCHRONIZED AUDIO-VISUAL EVENTS
    S. Parekh, S. Essid, A. Ozerov, Q. Duong, P. Perez, and G. Richard
    In CVPR Workshop on Sight and Sound (WSS) , Jun 2018
  10. WEAKLY SUPERVISED REPRESENTATION LEARNING FOR UNSYNCHRONIZED AUDIO-VISUAL EVENTS
    S. Parekh, S. Essid, A. Ozerov, Q. Duong, P. Perez, and G. Richard
    Apr 2018
  11. STRUCTURED OUTPUT LEARNING WITH ABSTENTION: APPLICATION TO ACCURATE OPINION PREDICTION
    A. Garcia, S. Essid, C. Clavel, and F. DAlche-Buc
    Mar 2018
  12. AN ENSEMBLE LEARNING APPROACH TO DETECT EPILEPTIC SEIZURES FROM LONG INTRACRANIAL EEG RECORDINGS
    J. Schiratti, J. Le Douget, M. Le Van Quyen, S. Essid, and A. Gramfort
    In International Conference on Acoustics, Speech, and Signal Processing (ICASSP) , Apr 2018
  13. ATTITUDE CLASSIFICATION IN ADJACENCY PAIRS OF A HUMAN-AGENT INTERACTION WITH HIDDEN CONDITIONAL RANDOM FIELDS
    V. Barriere, C. Clavel, and S. Essid
    In International Conference on Acoustics, Speech, and Signal Processing (ICASSP) , Apr 2018
  14. METHOD FOR AUDIO-VISUAL EVENTS CLASSIFICATION AND LOCALIZATION, AND CORRESPONDING APPARATUS, COMPUTER READABLE PROGRAM, PRODUCT AND COMPUTER READABLE STORAGE MEDIUM
    S. Parekh, S. Essid, A. Ozerov, Q. Duong, P. Perez, and G. Richard
    Patent Application, Apr 2018

2017

  1. MATRIX CO-FACTORISATION AND APPLICATIONS TO MUSIC ANALYSIS
    S. Essid
    In Machine Learning for Music Discovery Workshop, International Conference on Machine Learning (ICML) 2017 , Aug 2017
  2. NONNEGATIVE FEATURE LEARNING METHODS FOR ACOUSTIC SCENE CLASSIFICATION
    V. Bisot, R. Serizel, S. Essid, and G. Richard
    In DCASE 2017 - Workshop on Detection and Classification of Acoustic Scenes and Events , Nov 2017
  3. METHOD FOR PROCESSING AN INPUT AUDIO SIGNAL AND CORRESPONDING ELECTRONIC DEVICE
    S. Parekh, S. Essid, A. Ozerov, Q. Duong, P. Perez, and G. Richard
    Patent Application, Nov 2017
  4. COMPUTATIONAL ANALYSIS OF SOUND SCENES AND EVENTS
    R. Serizel, V. Bisot, S. Essid, and G. Richard
    Nov 2017
  5. COMPUTATIONAL ANALYSIS OF SOUND SCENES AND EVENTS
    S. Essid, S. Parekh, Q. Duong, A. Ozerov, and R. Serizel
    Nov 2017
  6. UE-HRI: A NEW DATASET FOR THE STUDY OF USER ENGAGEMENT IN SPONTANEOUS HUMAN-ROBOT INTERACTIONS
    A. Ben Youssef, C. Clavel, S. Essid, M. Bilac, M. Chamoux, and A. Lim
    In ACM International Conference on Multimodal Interaction , Nov 2017
  7. LEVERAGING DEEP NEURAL NETWORKS WITH NONNEGATIVE REPRESENTATIONS FOR IMPROVED ENVIRONMENTAL SOUND CLASSIFICATION
    V. Bisot, R. Serizel, S. Essid, and G. Richard
    In IEEE International Workshop on Machine Learning for Signal Processing (MLSP) , Sep 2017
  8. GUIDING AUDIO SOURCE SEPARATION BY VIDEO OBJECT INFORMATION
    S. Parekh, S. Essid, A. Ozerov, Q. Duong, P. Perez, and G. Richard
    In IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) , Oct 2017
  9. EMOEEG: A NEW MULTIMODAL DATASET FOR DYNAMIC EEG-BASED EMOTION RECOGNITION WITH AUDIOVISUAL ELICITATION
    A. Conneau, A. Hajlaoui, M. Chetouani, and S. Essid
    In The European Signal Processing Conference (EUSIPCO) , Oct 2017
  10. OPINION DYNAMICS MODELING FOR MOVIE REVIEW TRANSCRIPTS
    V. Barriere, C. Clavel, and S. Essid
    In Interspeech , Oct 2017
  11. OVERLAPPING SOUND EVENT DETECTION WITH SUPERVISED NONNEGATIVE MATRIX FACTORIZATION
    V. Bisot, S. Essid, and G. Richard
    In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , Oct 2017
  12. FEATURE LEARNING WITH MATRIX FACTORIZATION APPLIED TO ACOUSTIC SCENE CLASSIFICATION
    V. Bisot, R. Serizel, S. Essid, and G. Richard
    IEEE Transactions on Audio, Speech, and Language Processing (TASLP), Oct 2017
  13. SUPERVISED GROUP NONNEGATIVE MATRIX FACTORISATION WITH SIMILARITY CONSTRAINTS AND APPLICATIONS TO SPEAKER IDENTIFICATION
    R. Serizel, V. Bisot, S. Essid, and G. Richard
    In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , Oct 2017
  14. MOTION INFORMED AUDIO SOURCE SEPARATION
    S. Parekh, S. Essid, A. Ozerov, Q. Duong, P. Perez, and G. Richard
    In IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP) , Oct 2017

2016

  1. DISPOSITIF A CASQUE AUDIO PERFECTIONNE
    S. Essid, and R. Blouet
    Patent Application, Nov 2016
  2. SUPERVISED NONNEGATIVE MATRIX FACTORIZATION FOR ACOUSTIC SCENE CLASSIFICATION
    V. Bisot, R. Serizel, S. Essid, and G. Richard
    In IEEE international evaluation campaign on detection and classification of acousitc scenes and events (DCASE 2016) , Sep 2016
  3. DOWNBEAT DETECTION WITH CONDITIONAL RANDOM FIELDS AND DEEP LEARNED FEATURES
    S. Durand, and S. Essid
    In The 17th International Society for Music Information Retrieval Conference (ISMIR) , Aug 2016
  4. MINI-BATCH STOCHASTIC APPROACHES FOR ACCELERATED MULTIPLICATIVE UPDATES IN NONNEGATIVE MATRIX FACTORISATION WITH BETA-DIVERGENCE
    R. Serizel, S. Essid, and G. Richard
    In IEEE International Workshop on Machine Learning for Signal Processing (MLSP) , Sep 2016
  5. MACHINE LISTENING TECHNIQUES AS A COMPLEMENT TO VIDEO IMAGE ANALYSIS IN FORENSICS
    R. Serizel, V. Bisot, S. Essid, and G. Richard
    In The International Conference on Image Processing (ICIP) , Oct 2016
  6. ACOUSTIC SCENE CLASSIFICATION WITH MATRIX FACTORIZATION FOR UNSUPERVISED FEATURE LEARNING
    V. Bisot, R. Serizel, S. Essid, and G. Richard
    In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , Mar 2016
  7. GROUP NONNEGATIVE MATRIX FACTORISATION WITH SPEAKER AND SESSION VARIABILITY COMPENSATION FOR SPEAKER IDENTIFICATION
    R. Serizel, S. Essid, and G. Richard
    In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , Mar 2016

2015

  1. CONTRIBUTIONS IN MACHINE LEARNING FOR MULTIMODAL DATA ANALYSIS: METHODS, ALGORITHMS AND SYSTEMS FOR TEMPORALLY STRUCTURED DATA
    S. Essid
    Université Pierre et Marie Curie , Sep 2015
    Habilitation Thesis
  2. TPT-DANCE&ACTIONS : UN CORPUS MULTIMODAL D’ACTIVITES HUMAINES
    A. Masurelle, A. Sekkat, S. Essid, and G. Richard
    Revue Traitement du Signal, Sep 2015
  3. MELODY EXTRACTION BY CONTOUR CLASSIFICATION
    R. Bittner, J. Salmon, S. Essid, and J. Bello
    In International Conference on Music Information Retrieval (ISMIR) , Sep 2015
  4. HOG AND SUBBAND POWER DISTRIBUTION IMAGE FEATURES FOR ACOUSTIC SCENE CLASSIFICATION
    V. Bisot, S. Essid, and G. Richard
    In European Signal Processing Conference (EUSIPCO) , Sep 2015
  5. A CONDITIONAL RANDOM FIELD SYSTEM FOR BEAT TRACKING
    T. Fillon, C. Joder, S. Durand, and S. Essid
    In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , Apr 2015

2014

  1. SOFT NONNEGATIVE MATRIX CO-FACTORIZATION
    N. Seichepine, S. Essid, C. Fevotte, and O. Cappe
    IEEE Transactions on Signal Processing, Apr 2014
  2. PIECEWISE CONSTANT NONNEGATIVE MATRIX FACTORIZATION
    N. Seichepine, S. Essid, C. Fevotte, and O. Cappe
    In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) , May 2014
  3. ASSESSMENT OF NEW SPECTRAL FEATURES FOR EEG-BASED EMOTION RECOGNITION
    A. Conneau, and S. Essid
    In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) , May 2014
  4. GESTURE RECOGNITION USING A NMF-BASED REPRESENTATION OF MOTION-TRACES EXTRACTED FROM DEPTH SILHOUETTES
    A. Masurelle, S. Essid, and G. Richard
    In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) , May 2014

2013

  1. CO-FACTORISATION DOUCE EN MATRICES NON-NEGATIVES. APPLICATION AU REGROUPEMENT MULTIMODAL DE LOCUTEURS
    N. Seichepine, S. Essid, C. Fevotte, and O. Cappe
    In GRETSI , Sep 2013
  2. NONNEGATIVE TENSOR FACTORIZATION FOR SINGLE-CHANNEL EEG ARTIFACT REJECTION
    C. Damon, A. Liutkus, A. Gramfort, and S. Essid
    In IEEE International Workshop on Machine Learning for Signal Processing , Sep 2013
  3. EXPLORING NEW FEATURES FOR MUSIC CLASSIFICATION
    R. Foucard, S. Essid, G. Richard, and M. Lagrange
    In International Workshop on Image and Audio Analysis for Multimedia Interactive Services (WIAMIS) , Jul 2013
  4. MULTIMODAL CLASSIFICATION OF DANCE MOVEMENTS USING BODY JOINT TRAJECTORIES AND STEP SOUNDS
    A. Masurelle, S. Essid, and G. Richard
    In International Workshop on Image and Audio Analysis for Multimedia Interactive Services (WIAMIS) , Jul 2013
  5. PROBABILISTIC DANCE PERFORMANCE ALIGNMENT BY FUSION OF MULTIMODAL FEATURES
    A. Dremeau, and S. Essid
    In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) , May 2013
  6. SOFT NONNEGATIVE MATRIX CO-FACTORIZATION WITH APPLICATION TO MULTIMODAL SPEAKER DIARIZATION
    N. Seichepine, S. Essid, C. Fevotte, and O. Cappe
    In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) , May 2013
  7. NONNEGATIVE MATRIX FACTORIZATION FOR SINGLE-CHANNEL EEG ARTIFACT REJECTION
    C. Damon, A. Liutkus, A. Gramfort, and S. Essid
    In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) , May 2013
  8. A MULTIMODAL APPROACH TO SPEAKER DIARIZATION ON TV TALK-SHOWS
    F. Vallet, S. Essid, and J. Carrive
    IEEE Transactions on Multimedia, May 2013
  9. LEARNING OPTIMAL FEATURES FOR POLYPHONIC AUDIO-TO-SCORE ALIGNMENT
    C. Joder, S. Essid, and G. Richard
    IEEE Transactions on Audio, Speech, and Language Processing, May 2013
  10. SMOOTH NONNEGATIVE MATRIX FACTORIZATION FOR UNSUPERVISED AUDIOVISUAL DOCUMENT STRUCTURING
    S. Essid, and C. Fevotte
    IEEE Transactions on Multimedia, May 2013

2012

  1. ANALYSIS OF DANCE MOVEMENTS USING GAUSSIAN PROCESSES
    A. Liutkus, A. Dremeau, D. Alexiadis, S. Essid, and P. Daras
    In ACM Multimedia , Nov 2012
  2. DECOMPOSING THE VIDEO EDITING STRUCTURE OF A TALK-SHOW USING NONNEGATIVE MATRIX FACTORIZATION
    S. Essid, and C. Fevotte
    In International Conference on Image Processing (ICIP) , Oct 2012
  3. MULTIMODAL MUSIC PROCESSING
    S. Essid, and G. Richard
    Oct 2012
  4. A MULTI-MODAL DANCE CORPUS FOR RESEARCH INTO INTERACTION BETWEEN HUMANS IN VIRTUAL ENVIRONMENTS
    S. Essid, X. Lin, M. Gowing, G. Kordelas, A. Aksay, P. Kelly, T. Fillon, Q. Zhang, A. Dielmann, V. Kitanovski, R. Tournemenne, A. Masurelle, E. Izquierdo, N. OConnor, P. Daras, and G. Richard
    Journal on Multimodal User Interfaces: Special issue on multimodal corpora, Oct 2012
  5. AN ADVANCED VIRTUAL DANCE PERFORMANCE EVALUATOR
    S. Essid, D. Alexiadis, R. Tournemenne, M. Gowing, P. Kelly, D. Monhagan, P. Daras, A. Dremeau, and N. OConnor
    In IEEE International Conference on Acoustics, Speech and Signal Processing , Mar 2012
  6. A SINGLE-CLASS SVM BASED ALGORITHM FOR COMPUTING AN IDENTIFIABLE NMF
    S. Essid
    In IEEE International Conference on Acoustics, Speech and Signal Processing , Mar 2012
  7. A REGRESSIVE BOOSTING APPROACH TO AUTOMATIC AUDIO TAGGING BASED ON SOFT ANNOTATOR FUSION
    R. Foucard, S. Essid, M. Lagrange, and G. Richard
    In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , Mar 2012

2011

  1. A MULTIMODAL DANCE CORPUS FOR RESEARCH INTO REAL-TIME INTERACTION BETWEEN HUMANS IN ONLINE VIRTUAL ENVIRONMENTS
    S. Essid, X. Lin, M. Gowing, G. Kordelas, A. Aksay, P. Kelly, T. Fillon, Q. Zhang, A. Dielmann, V. Kitanovski, R. Tournemenne, N. OConnor, P. Daras, and G. Richard
    In ICMI Workshop On Multimodal Corpora For Machine Learning , Nov 2011
  2. AN AUDIO-DRIVEN VIRTUAL DANCE-TEACHING ASSISTANT
    S. Essid, Y. Grenier, M. Maazaoui, G. Richard, and R. Tournemenne
    In ACM Multimedia , Nov 2011
  3. ENHANCED VISUALISATION OF DANCE PERFORMANCE FROM AUTOMATICALLY SYNCHRONISED MULTIMODAL RECORDINGS
    M. Gowing, P. Kelly, N. OConnor, E. Izquierdo, V. Kitanovski, X. Lin, Q. Zhang, C. Concolato, S. Essid, J. Feuvre, and R. Tournemenne
    In ACM Multimedia , Nov 2011
  4. AN INTERACTIVE SYSTEM FOR ELECTRO-ACOUSTIC MUSIC ANALYSIS
    S. Gulluni, S. Essid, O. Buisson, and G. Richard
    In International Conference on Music Information Retrieval (ISMIR) , Oct 2011
  5. MULTI-SCALE TEMPORAL FUSION BY BOOSTING FOR MUSIC CLASSIFICATION
    R. Foucard, S. Essid, M. Lagrange, and G. Richard
    In International Conference on Music Information Retrieval (ISMIR) , Oct 2011
  6. OPTIMIZING THE MAPPING FROM A SYMBOLIC TO AN AUDIO REPRESENTATION FOR MUSIC-TO-SCORE ALIGNMENT
    C. Joder, S. Essid, and G. Richard
    In IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) , Oct 2011
  7. NONNEGATIVE MATRIX FACTORIZATION FOR UNSUPERVISED AUDIOVISUAL DOCUMENT STRUCTURING
    S. Essid, and C. Fevotte
    Oct 2011
  8. SEMANTIQUE ET MULTIMODALITE EN ANALYSE DE L INFORMATION
    G. Adda, G. Chollet, S. Essid, T. Fillon, M. Garnier-Rizet, C. Hory, and L. Beltaifa-Zouari
    Oct 2011
  9. TV CONTENT ANALYSIS: TECHNIQUES AND APPLICATIONS
    F. Vallet, S. Essid, J. Carrive, and G. Richard
    Oct 2011
  10. INTERACTIVE CLASSIFICATION OF SOUND OBJECTS FOR POLYPHONIC ELECTRO-ACOUSTIC MUSIC ANNOTATION
    S. Gulluni, S. Essid, O. Buisson, and G. Richard
    In AES 42nd International Conference , Jul 2011
  11. MULTIMEDIA SEMANTICS: METADATA, ANALYSIS AND INTERACTION
    R. Benmokhtar, B. Huet, G. Richard, T. Declerck, and S. Essid
    Jul 2011
  12. MULTIMEDIA SEMANTICS: METADATA, ANALYSIS AND INTERACTION
    S. Essid, M. Campedel, G. Richard, T. Piatrik, R. Benmokhtar, and B. Huet
    Jul 2011
  13. A CONDITIONAL RANDOM FIELD FRAMEWORK FOR ROBUST AND SCALABLE AUDIO-TO-SCORE MATCHING
    C. Joder, S. Essid, and G. Richard
    IEEE Transactions on Audio, Speech and Language Processing, Nov 2011
  14. HIDDEN DISCRETE TEMPO MODEL: A TEMPO-AWARE TIMING MODEL FOR AUDIO-TO-SCORE ALIGNMENT
    C. Joder, S. Essid, and G. Richard
    In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , May 2011

2010

  1. A MULTIMODAL APPROACH TO INITIALISATION FOR TOP-DOWN SPEAKER DIARIZATION OF TELEVISION SHOWS
    S. Bozonnet, F. Vallet, N. Evans, S. Essid, J. Carrive, and G. Richard
    In European Signal Processing Conference (EUSIPCO) , Aug 2010
  2. A CONDITIONAL RANDOM FIELD VIEWPOINT OF SYMBOLIC AUDIO-TO-SCORE MATCHING
    C. Joder, S. Essid, and G. Richard
    In ACM Multimedia 2010 , Oct 2010
  3. APPROCHE HIÉRARCHIQUE POUR UN ALIGNEMENT MUSIQUE-SUR-PARTITION EFFICACE
    C. Joder, S. Essid, and G. Richard
    In Compression et Représentation des Signaux Audiovisuels (CORESA) , Oct 2010
    Received Young Researcher Award!
  4. A COMPARATIVE STUDY OF TONAL ACOUSTIC FEATURES FOR A SYMBOLIC LEVEL MUSIC-TO-SCORE ALIGNMENT
    C. Joder, S. Essid, and G. Richard
    In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , Mar 2010
  5. AN IMPROVED HIERARCHICAL APPROACH FOR MUSIC-TO-SYMBOLIC SCORE ALIGNMENT
    C. Joder, S. Essid, and G. Richard
    In International Conference on Music Information Retrieval (ISMIR) , Aug 2010
  6. YAAFE, AN EASY TO USE AND EFFICIENT AUDIO FEATURE EXTRACTION SOFTWARE
    B. Mathieu, S. Essid, T. Fillon, J. Prado, and G. Richard
    In International Conference on Music Information Retrieval (ISMIR) , Aug 2010
  7. DESCRIPTEURS VISUELS ROBUSTES POUR L IDENTIFICATION DE LOCUTEURS DANS DES EMISSIONS TELEVISEES DE TALK-SHOWS
    F. Vallet, S. Essid, J. Carrive, and G. Richard
    In Compression et Représentation des Signaux Audiovisuels (CORESA) , Oct 2010
  8. ROBUST VISUAL FEATURES FOR THE MULTIMODAL IDENTIFICATION OF UNREGISTERED SPEAKERS IN TV TALK-SHOWS
    F. Vallet, S. Essid, J. Carrive, and G. Richard
    In IEEE International Conference on Image Processing (ICIP) , Oct 2010

2009

  1. INTERACTIVE SEGMENTATION OF ELECTRO-ACOUSTIC MUSIC
    S. Gulluni, S. Essid, O. Buisson, E. Favreau, and G. Richard
    In 2nd International Workshop on Machine Learning and Music (MML - ECML - PKDD) , Sep 2009
  2. ETUDE DES DESCRIPTEURS ACOUSTIQUES POUR L ALIGNEMENT TEMPOREL AUDIO-SUR-PARTITION MUSICALE
    C. Joder, S. Essid, and G. Richard
    In GRETSI , Sep 2009
  3. TEMPORAL INTEGRATION FOR AUDIO CLASSIFICATION WITH APPLICATION TO MUSICAL INSTRUMENT CLASSIFICATION
    C. Joder, S. Essid, and G. Richard
    IEEE Transactions on Audio, Speech and Language Processing, Jan 2009
  4. INCORPORATING PRIOR KNOWLEDGE ON THE DIGITAL MEDIA CREATION PROCESS INTO AUDIO CLASSIFIERS
    M. Lardeur, S. Essid, G. Richard, M. Haller, and T. Sikora
    In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , Apr 2009
  5. RECONNAISSANCE DES INSTRUMENTS DANS LA MUSIQUE POLYPHONIQUE PAR DÉCOMPOSITION NMF ET CLASSIFICATION SVM
    A. Ozerov, S. Essid, and M. Charbit
    Apr 2009

2008

  1. A COLLABORATIVE APPROACH TO AUTOMATIC RUSHES VIDEO SUMMARIZATION
    W. Bailer, E. Dumont, S. Essid, and B. Mérialdo
    In IEEE ICIP Workshop on Multimedia Information Retrieval: New Trends and Challenges , Oct 2008
  2. A COLLABORATIVE APPROACH TO VIDEO SUMMARIZATION
    E. Dumont, B. Merialdo, S. Essid, W. Bailer, D. Byrne, H. Bredin, N. OConnor, G. Jones, M. Haller, A. Krutz, T. Sikora, and T. Piatrik
    In 3rd International Conference on Semantic and Digital Media Technologies (SAMT) , Dec 2008
  3. RUSHES VIDEO SUMMARIZATION USING A COLLABORATIVE APPROACH
    E. Dumont, B. Merialdo, S. Essid, W. Bailer, H. Rehatschek, D. Byrne, H. Bredin, N. OConnor, G. Jones, A. Smeaton, . M. Haller, A. Krutz, T. Sikora, and T. Piatrik
    In TRECVID 2008, ACM International Conference on Multimedia Information Retrieval 2008 , Nov 2008
  4. ALIGNMENT KERNELS FOR AUDIO CLASSIFICATION WITH APPLICATION TO MUSIC INSTRUMENT RECOGNITION
    C. Joder, S. Essid, and G. Richard
    In European Signal Processing Conference (EUSIPCO) , Aug 2008
  5. ON THE ROBUSTNESS OF AUDIO FEATURES FOR MUSICAL INSTRUMENT CLASSIFICATION
    S. Wegener, M. Haller, J. Burred, T. Sikora, S. Essid, and G. Richard
    In European Signal Processing Conference (EUSIPCO) , Sep 2008

2007

  1. ON THE CORRELATION OF AUTOMATIC AUDIO AND VISUAL SEGMENTATIONS OF MUSIC VIDEOS
    O. Gillet, S. Essid, and G. Richard
    IEEE Transactions on Circuits and Systems for Video Technology, Mar 2007
  2. TOWARDS POLYPHONIC MUSICAL INSTRUMENT RECOGNITION
    G. Richard, P. Leveau, L. Daudet, S. Essid, and B. David
    In International Congress on Acoustics (ICA) , Sep 2007
  3. COMBINED SUPERVISED AND UNSUPERVISED APPROACHES FOR AUTOMATIC SEGMENTATION OF RADIOPHONIC AUDIO STREAMS
    G. Richard, M. Ramona, and S. Essid
    In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , Apr 2007
  4. K-SPACE AT TRECVID 2007
    P. Wilkins, T. Adamek, D. Byrne, G. Jones, H. Lee, G. Keenan, K. Guinness, N. OConnor, A. Smeaton, A. Amin, Z. Obrenovic, R. Benmokhtar, E. Galmar, B. Huet, S. Essid, R. Landais, F. Vallet, G. Papadopoulos, S. Vrochidis, V. Mezaris, I. Kompatsiaris, E. Spyrou, Y. Avrithis, R. Morzinger, P. Schallauer, W. Bailer, T. Piatrik, K. Chandramouli, E. Izquierdo, M. Haller, L. Goldmann, A. Samour, A. Cobet, T. Sikora, and P. Praks
    In TRECVID 2007 , Nov 2007

2006

  1. INSTRUMENT RECOGNITION IN POLYPHONIC MUSIC BASED ON AUTOMATIC TAXONOMIES
    S. Essid, G. Richard, and B. David
    IEEE Transactions on Audio, Speech, and Language Processing, Jan 2006
  2. MUSICAL INSTRUMENT RECOGNITION BY PAIRWISE CLASSIFICATION STRATEGIES
    S. Essid, G. Richard, and B. David
    IEEE Transactions on Audio, Speech, and Language Processing, Jul 2006
  3. HIERARCHICAL CLASSIFICATION OF MUSICAL INSTRUMENTS ON SOLO RECORDINGS
    S. Essid, G. Richard, and B. David
    In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , May 2006

2005

  1. CLASSIFICATION AUTOMATIQUE DES SIGNAUX AUDIO-FRÉQUENCES: RECONNAISSANCE DES INSTRUMENTS DE MUSIQUE
    S. Essid
    Université Pierre et Marie Curie , Dec 2005
  2. INSTRUMENT RECOGNITION IN POLYPHONIC MUSIC
    S. Essid, G. Richard, and B. David
    In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , Mar 2005
  3. ON THE USEFULNESS OF DIFFERENTIATED TRANSIENT/STEADY-STATE PROCESSING IN MACHINE RECOGNITION OF MUSICAL INSTRUMENTS
    P. Leveau, S. Essid, G. Richard, L. Daudet, and B. David
    In AES convention , May 2005

2004

  1. EFFICIENT MUSICAL INSTRUMENT RECOGNITION ON SOLO PERFORMANCES USING BASIC FEATURES
    S. Essid, G. Richard, and B. David
    In AES 25th conference , Jun 2004
  2. MUSICAL INSTRUMENT RECOGNITION ON SOLO PERFORMANCES
    S. Essid, G. Richard, and B. David
    In European Signal Processing Conference (EUSIPCO) , Sep 2004
  3. MUSICAL INSTRUMENT RECOGNITION BASED ON CLASS PAIRWISE FEATURE SELECTION
    S. Essid, G. Richard, and B. David
    In International Conference on Music Information Retrieval (ISMIR) , Oct 2004

2003

  1. MODÈLES SINUSOÏDAUX ÉTENDUS POUR LE CODAGE AUDIO
    R. Boyer, S. Essid, K. Abed-Meraim, and N. Moreau
    In Dix-neuvième colloque sur le Traitement du Signal et des Images , Sep 2003

2002

  1. TRANSIENT MODELING WITH A FREQUENCY-TRANSFORM SUBSPACE ALGORITHM AND TRANSIENT + SINUSOIDAL SCHEME
    R. Boyer, and S. Essid
    In 14th IEEE Int. Conf. on Digital Signal Proc. , Jul 2002
  2. DYNAMIC TEMPORAL SEGMENTATION IN PARAMETRIC NON-STATIONARY MODELING FOR PERCUSSIVE MUSICAL SIGNALS
    R. Boyer, S. Essid, and N. Moreau
    In IEEE Int. Conf. on Multimedia and Expo (ICME) , Aug 2002
  3. NON-STATIONARY SIGNAL PARAMETRIC MODELING TECHNIQUES WITH AN APPLICATION TO LOW BITRATE AUDIO CODING
    R. Boyer, S. Essid, and N. Moreau
    In 6th IEEE Int. Conf. Signal Processing , Aug 2002
  4. CODEUR AUDIO PARAMÉTRIQUE BAS DÉBIT BASÉ SUR UN MODÈLE "SINUSOÏDES AMORTIES EXPONENTIELLEMENT + TRANSITOIRES + BRUIT"
    S. Essid
    Ecole Nationale Supérieure des Télécommunications (ENST) , Oct 2002

2001

  1. EXPLORATION DE TECHNIQUES MODERNES DE MODÉLISATION ADAPTÉES À DU CODAGE AUDIO BAS-DÉBIT
    R. Boyer, S. Essid, and N. Moreau
    In 7èmes Journées d Etudes et d Echanges : Compression et Représentation des Signaux Audiovisuels (CORESA) , Oct 2001