Publication

Automatic Speech Recognition Benchmark for Air-Traffic Communications

Publications associées (26)

Utterance Verification-Based Dysarthric Speech Intelligibility Assessment Using Phonetic Posterior Features

Mathew Magimai Doss, Julian David Fritsch

In the literature, the task of dysarthric speech intelligibility assessment has been approached through development of different low-level feature representations, subspace modeling, phone confidence estimation or measurement of automatic speech recognitio ...

IEEE Institute of Electrical and Electronics Engineers2021

Novel Methods for Incorporating Prior Knowledge for Automatic Speech Assessment

Subrahmanya Pavankumar Dubagunta

Speech signal conveys several kinds of information such as a message, speaker identity, emotional state of the speaker and social state of the speaker. Automatic speech assessment is a broad area that refers to using automatic methods to predict human judg ...

EPFL2021

Fair Voice Biometrics: Impact of Demographic Imbalance on Group Fairness in Speaker Recognition

Mirko Marras

Speaker recognition systems are playing a key role in modern online applications. Though the susceptibility of these systems to discrimination according to group fairness metrics has been recently studied, their assessment has been mainly focused on the di ...

ISCA-INT SPEECH COMMUNICATION ASSOC2021

Estimating The Degree of Sleepiness by Integrating Articulatory Feature Knowledge In Raw Waveform Based CNNs

Subrahmanya Pavankumar Dubagunta, Julian David Fritsch

Speech-based degree of sleepiness estimation is an emerging research problem. This paper investigates an end-to-end approach, where given raw waveform as input, a convolutional neural network (CNN) estimates at its output the degree of sleepiness. Within t ...

IEEE2020

Multilingual Training and Adaptation in Speech Recognition

Sibo Tong

State-of-the-art acoustic models for Automatic Speech Recognition (ASR) are based on Hidden Markov Models (HMM) and Deep Neural Networks (DNN) and often require thousands of hours of transcribed speech data during training. Therefore, building multilingual ...

EPFL2020

Self-Supervised Prototypical Transfer Learning for Few-Shot Classification

Matthias Grossglauser, Arnout Jan J Devos, Carlos Roberto Medina Temme

Recent advances in transfer learning and few-shot learning largely rely on annotated data related to the goal task during (pre-)training. However, collecting sufficiently similar and annotated data is often infeasible. Building on advances in self-supervis ...

2020