The goal of DCASE 2023 Challenge Task 7 is to generate various sound cli...
Previous pitch-controllable text-to-speech (TTS) models rely on directly...
Offline reinforcement learning has developed rapidly over the recent yea...
Previous generative adversarial network (GAN)-based neural vocoders are
...
In this paper, we present SANE-TTS, a stable and natural end-to-end
mult...
We focus on the problem of adversarial attacks against models on discret...
Conventionally, audio super-resolution models fixed the initial and the
...
In this work, we propose a joint system combining a talking face generat...
We propose a singing decomposition system that encodes time-aligned
ling...
In this work, we introduce NU-Wave, the first neural audio upsampling mo...
Computer vision has achieved impressive progress in recent years. Meanwh...
The Low-Power Image Recognition Challenge (LPIRC,
https://rebootingcompu...