Audio-visual representation learning aims to develop systems with human-...
Text language models have shown remarkable zero-shot capability in
gener...
Large language models (LLMs) have gained considerable attention for
Arti...
Multi-talker overlapped speech poses a significant challenge for speech
...
Automatic speaker verification (ASV) plays a critical role in
security-s...
Despite multiple efforts made towards adopting complex-valued deep neura...
Audio-visual synchronization aims to determine whether the mouth movemen...
We present the SUPERB challenge at SLT 2022, which aims at learning
self...
Audio-visual active speaker detection (AVASD) is well-developed, and now...
Human-AI shared control allows human to interact and collaborate with AI...
Recently, many novel techniques have been introduced to deal with spoofi...
The past few years have witnessed the significant advances of speech
syn...
This paper describes our speaker diarization system submitted to the
Mul...
A leaderboard named Speech processing Universal PERformance Benchmark
(S...
Automatic speaker verification (ASV) is a well developed technology for
...
Previous works have shown that automatic speaker verification (ASV) is
s...
The state-of-the-art driving automation system demands extreme computati...
Automatic speaker verification (ASV) is one of the core technologies in
...
In recent years, Multi-Agent Reinforcement Learning (MARL) has revolutio...
In Cooperative Multi-Agent Reinforcement Learning (MARL) and under the
s...
High-performance anti-spoofing models for automatic speaker verification...
Various forefront countermeasure methods for automatic speaker verificat...
High-performance spoofing countermeasure systems for automatic speaker
v...