The neural attention mechanism has been incorporated into deep neural
ne...
Attention-based neural networks have achieved state-of-the-art results o...
Single domain generalization aims to learn a model that performs well on...
Dropout has been demonstrated as a simple and effective module to not on...
Attention modules, as simple and effective tools, have not only enabled ...
Models based on the Transformer architecture have achieved better accura...
Sequence generation models are commonly refined with reinforcement learn...
Selecting hyperparameters for unsupervised learning problems is difficul...