Audio-visual representation learning aims to develop systems with human-...
Spoken Language Understanding (SLU) is a task that aims to extract seman...
We propose EAR, a query Expansion And Reranking approach for improving
p...
In this paper, we show that representations capturing syllabic units eme...
Speech processing Universal PERformance Benchmark (SUPERB) is a leaderbo...
The recent breakthroughs in natural language processing for model pretra...
Prompt tuning is a technology that tunes a small set of parameters to st...
We present Masked Audio-Video Learners (MAViL) to train audio-visual
rep...
Recent studies find existing self-supervised speech encoders contain
pri...
We present the SUPERB challenge at SLT 2022, which aims at learning
self...
In this study, we aim to explore efficient tuning methods for speech
sel...
Although supervised deep learning has revolutionized speech and audio
pr...
Deep learning has been the mainstream technique in natural language
proc...
Speech representations learned from Self-supervised learning (SSL) model...
Although deep learning-based end-to-end Automatic Speech Recognition (AS...
Transfer learning has proven to be crucial in advancing the state of spe...
Spoken Question Answering (SQA) is to find the answer from a spoken docu...
Pretrained language models (PTLMs) are typically learned over a large, s...
Many recent successes in sentence representation learning have been achi...
Automatic detection of toxic language plays an essential role in protect...
Neural network pretraining is gaining attention due to its outstanding
p...
Self-supervised learning (SSL) has proven vital for advancing research i...
Pretrained language models have significantly improved the performance o...
In this work, we propose a novel goal-oriented dialog task, automatic sy...
Dialog State Tracking (DST), an integral part of modern dialog systems, ...
Since its introduction in 2011, there have been over 4000 MOOCs on vario...
Recently deep learning has dominated many machine learning areas, includ...
Much recent work on Spoken Language Understanding (SLU) falls short in a...
Much recent work on Spoken Language Understanding (SLU) is limited in at...
Neural models have yielded state-of-the-art results in deciphering spoke...
We introduce a self-supervised speech pre-training method called TERA, w...
Spoken dialog systems have seen applications in many domains, including
...
For self-supervised speech processing, it is crucial to use pretrained m...
Modern virtual personal assistants provide a convenient interface for
co...