Recognizing Film Entities in Podcasts

09/24/2018
by   Ahmet Salih Gundogdu, et al.
0

In this paper, we propose a Named Entity Recognition (NER) system to identify film titles in podcast audio. Taking inspiration from NER systems for noisy text in social media, we implement a two-stage approach that is robust to computer transcription errors and does not require significant computational expense to accommodate new film titles/releases. Evaluating on a diverse set of podcasts, we demonstrate more than a 20 baseline approaches when combining fuzzy-matching with a linear model aware of film-specific metadata.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset