Seq-2-Seq based Refinement of ASR Output for Spoken Name Capture

03/29/2022
by   Karan Singla, et al.
0

Person name capture from human speech is a difficult task in human-machine conversations. In this paper, we propose a novel approach to capture the person names from the caller utterances in response to the prompt "say and spell your first/last name". Inspired from work on spell correction, disfluency removal and text normalization, we propose a lightweight Seq-2-Seq system which generates a name spell from a varying user input. Our proposed method outperforms the strong baseline which is based on LM-driven rule-based approach.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset