Publication

Estimating Dominance in Multi-Party Meetings Using Speaker Diarization

Publications associées (6)

Multisensory haptic system and method

Olaf Blanke, Simon Gallo, Giulio Rognini

A computer-implemented method for operating a haptic device, the haptic device comprising a plurality of tactile displays configured to provide haptic stimuli to a user, the method including the steps of (a) processing an audio signal derived from an audio ...

2020

System fusion and speaker linking for longitudinal diarization of TV shows

Hervé Bourlard, Petr Motlicek

Performing speaker diarization while uniquely identifying the speakers in a collection of audio recordings is a challenging task. Based on our previous work on speaker diarization and linking, we developed a system for diarizing longitudinal TV show data s ...

IEEE2016

On dynamic stream weighting for Audio-Visual Speech Recognition

Jean-Philippe Thiran, Mihai Gurban, Virginia Estellers Casas

The integration of audio and visual information improves speech recognition performance, specially in the presence of noise. In these circumstances it is necessary to introduce audio and visual weights to control the contribution of each modality to the re ...

2012

Blocking artifacts in speech/audio: Dynamic auditory model-based characterization and optimal time-frequency smoothing

Chandra Sekhar Seelamantula

We revisit the problem of blocking artifacts and their suppression in generic frame-based speech/audio applications. We provide a perceptual characterization of the artifacts by using dynamic auditory models. We propose some short-time-Fourier-transform-ba ...

2009

Low-Dimensional Motion Features for Audio-Visual Speech Recognition

Jean-Philippe Thiran, Mihai Gurban, Andrés Vallés

Audio-visual speech recognition promises to improve the performance of speech recognizers, especially when the audio is corrupted, by adding information from the visual modality, more specifically, from the video of the speaker. However, the number of visu ...

2007

Improved Time Delay Analysis/Synthesis for Parametric Stereo Audio Coding

Christophe Tournery, Christof Faller

For parametric stereo and multi-channel audio coding, it has been proposed to use level difference, time difference, and coherence cues between audio channels to represent the perceptual spatial features of stereo and multi-channel audio signals. In practi ...

2006