In this paper, we present GEM as a General Evaluation benchmark for
Mult...
This paper presents a Multitask Multilingual Multimodal Pre-trained mode...
In this paper, we introduce XGLUE, a new benchmark dataset to train
larg...
While many BERT-based cross-modal pre-trained models produce excellent
r...
In this paper, we introduce a new vision-language pre-trained model –
Im...