b'Jordi Luque'

research

∙ 01/31/2023

Scheduling Inference Workloads on Distributed Edge Clusters with Reinforcement Learning

Many real-time applications (e.g., Augmented/Virtual Reality, cognitive ...

1 Gabriele Castellano, et al. ∙

research

∙ 10/27/2022

Iterative pseudo-forced alignment by acoustic CTC loss for self-supervised ASR domain adaptation

High-quality data labeling from specific domains is costly and human tim...

0 Fernando Lopez, et al. ∙

research

∙ 07/14/2022

Data Augmentation for Low-Resource Quechua ASR Improvement

Automatic Speech Recognition (ASR) is a key element in new services that...

0 Rodolfo Zevallos, et al. ∙

research

∙ 12/21/2021

Voice Quality and Pitch Features in Transformer-Based Speech Recognition

Jitter and shimmer measurements have shown to be carriers of voice quali...

0 Guillermo Cámbara, et al. ∙

research

∙ 05/09/2021

English Accent Accuracy Analysis in a State-of-the-Art Automatic Speech Recognition System

Nowadays, research in speech technologies has gotten a lot out thanks to...

0 Guillermo Cámbara, et al. ∙

research

∙ 01/29/2021

Speech Enhancement for Wake-Up-Word detection in Voice Assistants

Keyword spotting and in particular Wake-Up-Word (WUW) detection is a ver...

2 David Bonet, et al. ∙

research

∙ 01/29/2021

BCN2BRNO: ASR System Fusion for Albayzin 2020 Speech to Text Challenge

This paper describes joint effort of BUT and Telefónica Research on deve...

0 Martin Kocour, et al. ∙

research

∙ 09/02/2020

Convolutional Speech Recognition with Pitch and Voice Quality Features

The effects of adding pitch and voice quality features such as jitter an...

0 Guillermo Cámbara, et al. ∙

research

∙ 06/01/2020

Transcription-Enriched Joint Embeddings for Spoken Descriptions of Images and Videos

In this work, we propose an effective approach for training unique embed...

9 Benet Oriol, et al. ∙

research

∙ 11/12/2019

Detection of speech events and speaker characteristics through photo-plethysmographic signal neural processing

The use of photoplethysmogram signal (PPG) for heart and sleep monitorin...

0 Guillermo Cámbara, et al. ∙

research

∙ 10/15/2019

Seeing and Hearing Egocentric Actions: How Much Can We Learn?

Our interaction with the world is an inherently multimodal experience. H...

8 Alejandro Cartas, et al. ∙

research

∙ 09/25/2019

Input complexity and out-of-distribution detection with likelihood-based generative models

Likelihood-based generative models are a promising resource to detect ou...

0 Joan Serrà, et al. ∙

research

∙ 06/03/2019

How Much Does Audio Matter to Recognize Egocentric Object Interactions?

Sounds are an important source of information on our daily interactions ...

5 Alejandro Cartas, et al. ∙

research

∙ 10/09/2016

Emergence of linguistic laws in human voice

Linguistic laws constitute one of the quantitative cornerstones of moder...

0 Ivan Gonzalez Torre, et al. ∙

Jordi Luque

Featured Co-authors

Sign in with Google

Consider DeepAI Pro