In recent years, datasets of paired audio and captions have enabled
rema...
In this work, we investigate the personalization of text-to-music diffus...
The study of speech disorders can benefit greatly from time-aligned data...
Modern speech recognition systems exhibits rapid performance degradation...