This paper proposes an approach for anomalous sound detection that
incor...
Self-supervised pre-trained models such as Wav2vec2, Hubert, and WavLM h...
Labeled audio data is insufficient to build satisfying speech recognitio...
Speaker embedding has been a fundamental feature for speaker-related tas...
Clustering-based speaker diarization has stood firm as one of the major
...
Unsupervised clustering on speakers is becoming increasingly important f...
We propose BeamTransformer, an efficient architecture to leverage
beamfo...
In this paper we describe a speaker diarization system that enables
loca...