Audio deepfake detection is an emerging active topic. A growing number o...
Current fake audio detection algorithms have achieved promising performa...
End-to-end model, especially Recurrent Neural Network Transducer (RNN-T)...
Self-supervised speech models are a rapidly developing research topic in...
The rapid advancement of spoofing algorithms necessitates the developmen...
Audio deepfake detection is an emerging topic in the artificial intellig...
Expanding the language coverage of speech technology has the potential t...
ESPnet-ST-v2 is a revamp of the open-source ESPnet-ST toolkit necessitat...
Neural transducers have gained popularity in production ASR systems,
ach...
Automatic breast lesion detection and classification is an important tas...
We, for anti-quantum computing, will discuss various number-based string...
Blockchain technology enables stakeholders to conduct trusted data shari...
The coming quantum computation is forcing us to reexamine the cryptosyst...
Deep learning (DL) techniques have been extensively utilized for medical...
It is well known that many machine learning systems demonstrate bias tow...
From wearables to powerful smart devices, modern automatic speech recogn...
This paper improves the streaming transformer transducer for speech
reco...
We study numerical conformal mappings of planar Jordan domains with
boun...
Hybrid automatic speech recognition (ASR) models are typically sequentia...
Deep learning-based image super-resolution (DL-SR) has shown great promi...
In this work, to measure the accuracy and efficiency for a latency-contr...
In this work, we first show that on the widely used LibriSpeech benchmar...
Deep acoustic models typically receive features in the first layer of th...
We propose and evaluate transformer-based acoustic models (AMs) for hybr...
There is an implicit assumption that traditional hybrid approaches for
a...
Towards developing high-performing ASR for low-resource languages, appro...
A mathematical topology with matrix is a natural representation of a cod...
Graphical passwords (GPWs) are in many areas of the current world.
Topol...
Topological graphic passwords (Topsnut-gpws) are one of graph-type passw...
Graphical passwords (GPWs) are convenient for mobile equipments with tou...
Automated vehicles are deemed to be the key element for the intelligent
...
Speech recognition systems for irregularly-spelled languages like Englis...
We describe the neural-network training framework used in the Kaldi spee...