Vocal Tract Length Normalization for Statistical Parametric Speech Synthesis

À propos
Confidentialité
Mentions légales

Graph Chatbot

Publications associées (27)

Fair Voice Biometrics: Impact of Demographic Imbalance on Group Fairness in Speaker Recognition

Mirko Marras

Speaker recognition systems are playing a key role in modern online applications. Though the susceptibility of these systems to discrimination according to group fairness metrics has been recently studied, their assessment has been mainly focused on the di ...

ISCA-INT SPEECH COMMUNICATION ASSOC2021

On The Relationship Between Speech-Based Breathing Signal Prediction Evaluation Measures And Breathing Parameters Estimation

Mathew Magimai Doss, Zohreh Mostaani, Venkata Srikanth Nallanthighal

The respiratory system is one of the major components of the speech production system. Any alteration in breathing can result in changes in speech. Specific breathing characteristics, such as breathing rate and tidal volume, can indicate a person's patholo ...

IEEE2021

Multilingual Training and Adaptation in Speech Recognition

Sibo Tong

State-of-the-art acoustic models for Automatic Speech Recognition (ASR) are based on Hidden Markov Models (HMM) and Deep Neural Networks (DNN) and often require thousands of hours of transcribed speech data during training. Therefore, building multilingual ...

EPFL2020

Multilingual and Unsupervised Subword Modeling for Zero-Resource Languages

Enno Hermann

Subword modeling for zero-resource languages aims to learn low-level representations of speech audio without using transcriptions or other resources from the target language (such as text corpora or pronunciation dictionaries). A good representation should ...

2021

A Bin Encoding Training Of A Spiking Neural Network Based Voice Activity Detection

Milos Cernak, Giorgia Dellaferrera

Advances of deep learning for Artificial Neural Networks (ANNs) have led to significant improvements in the performance of digital signal processing systems implemented on digital chips. Although recent progress in low-power chips is remarkable, neuromorph ...

IEEE2020

Chemiscope: interactive structure-property explorer for materials and molecules

Michele Ceriotti, Guillaume André Jean Fraux

The number of materials or molecules that can be created by combining different chemical elements in various proportions and spatial arrangements is enormous. Computational chemistry can be used to generate databases containing billions of potential struct ...

2020

Automatic Pathological Speech Intelligibility Assessment Exploiting Subspace-Based Analyses

Hervé Bourlard, Parvaneh Janbakhshi, Ina Kodrasi

Competitive state-of-the-art automatic pathological speech intelligibility measures typically rely on regression training on a large number of features, require a large amount of healthy speech training data, or are applicable only to phonetically balanced ...

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC2020

Page 2 sur 2