#speech representation

2022-02-22

#Voice AI #Speech Representation #Deep Learning

How deep learning models can isolate independent factors of variation in data through VAEs and Beta-TCVAE, enabling controlled synthesis and better downstream representations.

View

Learning Speaker Representation with Semi-supervised Learning Approach for Speaker Profiling

2021-10-24 Shangeth Rajaa, Pham Van Tung, Chng Eng Siong arXiv preprint arXiv:2110.13653

#Voice AI #Speaker Profiling #Speech Representation #Semi-supervised Learning

A semi-supervised framework for speaker profiling that leverages external unlabelled corpora via supervised, unsupervised, and consistency training, achieving RMSE of 6.8 years on age estimation.

View