Hierarchical Character-Word Models for Language Identification

08/10/2016
by   Aaron Jaech, et al.
0

Social media messages' brevity and unconventional spelling pose a challenge to language identification. We introduce a hierarchical model that learns character and contextualized word-level representations for language identification. Our method performs well against strong base- lines, and can also reveal code-switching.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset