Junichi Yamagishi

research

∙ 09/18/2023

Spoofing attack augmentation: can differently-trained attack models improve generalisation?

A reliable deepfake detector or spoofing countermeasure (CM) should be r...

1 Wanying Ge, et al. ∙

research

∙ 09/14/2023

DDSP-based Neural Waveform Synthesis of Polyphonic Guitar Performance from String-wise MIDI Input

We explore the use of neural synthesis for acoustic guitar from string-w...

1 Nicolas Jonason, et al. ∙

research

∙ 09/12/2023

SynVox2: Towards a privacy-friendly VoxCeleb2 dataset

The success of deep learning in speaker recognition relies heavily on th...

2 Xiaoxiao Miao, et al. ∙

research

∙ 09/12/2023

Can large-scale vocoded spoofed data improve speech spoofing countermeasure with a self-supervised front end?

A speech spoofing countermeasure (CM) that discriminates between unseen ...

1 Xin Wang, et al. ∙

research

∙ 06/15/2023

Exploring Isolated Musical Notes as Pre-training Data for Predominant Instrument Recognition in Polyphonic Music

With the growing amount of musical data available, automatic instrument ...

0 Lifan Zhong, et al. ∙

research

∙ 05/30/2023

Towards single integrated spoofing-aware speaker verification embeddings

This study aims to develop a single integrated spoofing-aware speaker ve...

3 Sung Hwan Mun, et al. ∙

research

∙ 05/30/2023

Language-independent speaker anonymization using orthogonal Householder neural network

Speaker anonymization aims to conceal a speaker's identity while preserv...

0 Xiaoxiao Miao, et al. ∙

research

∙ 05/28/2023

Range-Based Equal Error Rate for Spoof Localization

Spoof localization, also called segment-level detection, is a crucial ta...

1 Lin Zhang, et al. ∙

research

∙ 03/05/2023

Cyber Vaccine for Deepfake Immunity

Deepfakes pose an evolving threat to cybersecurity, which calls for the ...

0 Ching-Chun Chang, et al. ∙

research

∙ 11/29/2022

Hiding speaker's sex in speech using zero-evidence speaker representation in an analysis/synthesis pipeline

The use of modern vocoders in an analysis/synthesis pipeline allows us t...

0 Paul-Gauthier Noé, et al. ∙

research

∙ 11/25/2022

Can Knowledge of End-to-End Text-to-Speech Models Improve Neural MIDI-to-Audio Synthesis Systems?

With the similarity between music and speech synthesis from symbolic inp...

0 Xuan Shi, et al. ∙

research

∙ 10/27/2022

Outlier-Aware Training for Improving Group Accuracy Disparities

Methods addressing spurious correlations such as Just Train Twice (JTT, ...

2 Li-Kuang Chen, et al. ∙

research

∙ 10/19/2022

Spoofed training data for speech spoofing countermeasure can be efficiently created using neural vocoders

A good training set for speech spoofing countermeasures requires diverse...

0 Xin Wang, et al. ∙

research

∙ 10/18/2022

Analysis of Master Vein Attacks on Finger Vein Recognition Systems

Finger vein recognition (FVR) systems have been commercially used, espec...

4 Huy H. Nguyen, et al. ∙

research

∙ 10/05/2022

ASVspoof 2021: Towards Spoofed and Deepfake Speech Detection in the Wild

Benchmarking initiatives support the meaningful comparison of competing ...

0 Xuechen Liu, et al. ∙

research

∙ 09/01/2022

Joint Speaker Encoder and Neural Back-end Model for Fully End-to-End Automatic Speaker Verification with Multiple Enrollment Utterances

Conventional automatic speaker verification systems can usually be decom...

0 Chang Zeng, et al. ∙

research

∙ 05/14/2022

The VoicePrivacy 2020 Challenge Evaluation Plan

The VoicePrivacy Challenge aims to promote the development of privacy pr...

0 Natalia Tomashenko, et al. ∙

research

∙ 04/11/2022

The PartialSpoof Database and Countermeasures for the Detection of Short Generated Audio Segments Embedded in a Speech Utterance

Automatic speaker verification is susceptible to various manipulations a...

0 Lin Zhang, et al. ∙

research

∙ 03/28/2022

Analyzing Language-Independent Speaker Anonymization Framework under Unseen Conditions

In our previous work, we proposed a language-independent speaker anonymi...

0 Xiaoxiao Miao, et al. ∙

research

∙ 03/23/2022

The VoicePrivacy 2022 Challenge Evaluation Plan

For new participants - Executive summary: (1) The task is to develop a v...

2 Natalia Tomashenko, et al. ∙

research

∙ 03/22/2022

Joint Noise Reduction and Listening Enhancement for Full-End Speech Enhancement

Speech enhancement (SE) methods mainly focus on recovering clean speech ...

0 Haoyu Li, et al. ∙

research

∙ 03/21/2022

The VoiceMOS Challenge 2022

We present the first edition of the VoiceMOS Challenge, a scientific eve...

0 Wen-Chin Huang, et al. ∙

research

∙ 02/26/2022

Language-Independent Speaker Anonymization Approach using Self-Supervised Pre-Trained Models

Speaker anonymization aims to protect the privacy of speakers while pres...

0 Xiaoxiao Miao, et al. ∙

research

∙ 02/24/2022

Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation

The performance of spoofing countermeasure systems depends fundamentally...

0 Hemlata Tak, et al. ∙

research

∙ 02/13/2022

Robust Deepfake On Unrestricted Media: Generation And Detection

Recent advances in deep learning have led to substantial improvements in...

6 Trung-Nghia Le, et al. ∙

research

∙ 01/24/2022

Optimizing Tandem Speaker Verification and Anti-Spoofing Systems

As automatic speaker verification (ASV) systems are vulnerable to spoofi...

1 Anssi Kanervisto, et al. ∙

research

∙ 01/10/2022

A Practical Guide to Logical Access Voice Presentation Attack Detection

Voice-based human-machine interfaces with an automatic speaker verificat...

0 Xin Wang, et al. ∙

research

∙ 11/25/2021

Effectiveness of Detection-based and Regression-based Approaches for Estimating Mask-Wearing Ratio

Estimating the mask-wearing ratio in public places is important as it en...

0 Khanh-Duy Nguyen, et al. ∙

research

∙ 11/15/2021

Investigating self-supervised front ends for speech spoofing countermeasures

Self-supervised speech model is a rapid progressing research topic, and ...

0 Xin Wang, et al. ∙

research

∙ 10/18/2021

LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech

An effective approach to automatically predict the subjective rating for...

4 Wen-Chin Huang, et al. ∙

research

∙ 10/11/2021

LaughNet: synthesizing laughter utterances from waveform silhouettes and a single laughter example

Emotional and controllable speech synthesis is a topic that has received...

1 Hieu-Thi Luong, et al. ∙

research

∙ 10/10/2021

Estimating the confidence of speech spoofing countermeasure

Conventional speech spoofing countermeasures (CMs) are designed to make ...

0 Xin Wang, et al. ∙

research

∙ 10/04/2021

On the Interplay Between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis

Are end-to-end text-to-speech (TTS) models over-parametrized? To what ex...

1 Cheng-I Jeff Lai, et al. ∙

research

∙ 09/16/2021

DDS: A new device-degraded speech dataset for speech enhancement

A large and growing amount of speech content in real-life scenarios is b...

0 Haoyu Li, et al. ∙

research

∙ 09/08/2021

Master Face Attacks on Face Recognition Systems

Face authentication is now widely used, especially on mobile devices, ra...

10 Huy H. Nguyen, et al. ∙

research

∙ 09/01/2021

The VoicePrivacy 2020 Challenge: Results and findings

This paper presents the results and analyses stemming from the first Voi...

2 Natalia Tomashenko, et al. ∙

research

∙ 09/01/2021

ASVspoof 2021: accelerating progress in spoofed and deepfake speech detection

ASVspoof 2021 is the forth edition in the series of bi-annual challenges...

1 Junichi Yamagishi, et al. ∙

research

∙ 09/01/2021

ASVspoof 2021: Automatic Speaker Verification Spoofing and Countermeasures Challenge Evaluation Plan

The automatic speaker verification spoofing and countermeasures (ASVspoo...

2 Hector Delgado, et al. ∙

research

∙ 09/01/2021

Benchmarking and challenges in security and privacy for voice biometrics

For many decades, research in speech technologies has focused upon impro...

0 Jean-Francois Bonastre, et al. ∙

research

∙ 07/30/2021

OpenForensics: Large-Scale Challenging Dataset For Multi-Face Forgery Detection And Segmentation In-The-Wild

The proliferation of deepfake media is raising concerns among the public...

1 Trung-Nghia Le, et al. ∙

research

∙ 07/29/2021

Multi-Task Learning in Utterance-Level and Segmental-Level Spoof Detection

In this paper, we provide a series of multi-tasking benchmarks for simul...

0 Lin Zhang, et al. ∙

research

∙ 07/24/2021

Use of speaker recognition approaches for learning timbre representations of musical instrument sounds from raw waveforms

Timbre representations of musical instruments, essential for diverse app...

0 Xuan Shi, et al. ∙

research

∙ 07/20/2021

SVSNet: An End-to-end Speaker Voice Similarity Assessment Model

Neural evaluation metrics derived for numerous speech generation tasks h...

1 Cheng-Hung Hu, et al. ∙

research

∙ 06/25/2021

Preliminary study on using vector quantization latent spaces for TTS/VC systems with consistent performance

Generally speaking, the main objective when training a neural speech syn...

3 Hieu-Thi Luong, et al. ∙

research

∙ 06/11/2021

Visualizing Classifier Adjacency Relations: A Case Study in Speaker Verification and Voice Anti-Spoofing

Whether it be for results summarization, or the analysis of classifier f...

7 Tomi Kinnunen, et al. ∙

research

∙ 06/02/2021

A Multi-Level Attention Model for Evidence-Based Fact Checking

Evidence-based fact checking aims to verify the truthfulness of a claim ...

2 Canasai Kruengkrai, et al. ∙

research

∙ 05/05/2021

How do Voices from Past Speech Synthesis Challenges Compare Today?

Shared challenges provide a venue for comparing systems trained on commo...

0 Erica Cooper, et al. ∙

research

∙ 05/04/2021

Exploring Disentanglement with Multilingual and Monolingual VQ-VAE

This work examines the content and usefulness of disentangled phone and ...

0 Jennifer Williams, et al. ∙

research

∙ 04/25/2021

Text-to-Speech Synthesis Techniques for MIDI-to-Audio Synthesis

Speech synthesis and music audio generation from symbolic input differ i...

0 Erica Cooper, et al. ∙

research

∙ 04/17/2021

Multi-Metric Optimization using Generative Adversarial Networks for Near-End Speech Intelligibility Enhancement

The intelligibility of speech severely degrades in the presence of envir...

0 Haoyu Li, et al. ∙

Junichi Yamagishi

Featured Co-authors

Sign in with Google

Consider DeepAI Pro