Subword tokenization is the de facto standard for tokenization in neural...
We study the role of an essential hyper-parameter that governs the train...
State-of-the-art multilingual systems rely on shared vocabularies that
s...
We describe the design, the evaluation setup, and the results of the 201...
This review paper discusses how context has been used in neural machine
...
This paper demonstrates that word sense disambiguation (WSD) can improve...
The goal of this work is to design a machine translation system for a
lo...
Neural sequence-to-sequence models achieve remarkable performance not on...
Hierarchical attention networks have recently achieved remarkable perfor...