In order to meet the demand for higher scene rendering quality from some...
We introduce a language modeling approach for text to speech synthesis (...
The massive growth of self-supervised learning (SSL) has been witnessed ...
Recently, self-supervised learning (SSL) has demonstrated strong perform...
Recently, pioneer work finds that speech pre-trained models can solve
fu...
Self-supervised learning (SSL) achieves great success in speech recognit...
The speech representations learned from large-scale unlabeled data have ...
Self-supervised learning (SSL) is a long-standing goal for speech proces...
In this paper, we propose a unified pre-training approach called UniSpee...
Continuous speech separation plays a vital role in complicated speech re...
Recently, there has been a strong push to transition from hybrid models ...
End-to-end speech translation poses a heavy burden on the encoder, becau...
Attention-based encoder-decoder model has achieved impressive results fo...
End-to-end speech translation, a hot topic in recent years, aims to tran...
Due to the highly parallelizable architecture, Transformer is faster to ...
Recently, Transformer has achieved the state-of-the-art performance on m...