Starting from the resurgence of deep learning, vision-language models (V...
While the emergence of powerful language models along with Chain-of-thou...
In the constant updates of the product dialogue systems, we need to retr...
Current image captioning works usually focus on generating descriptions ...
Distantly-Supervised Named Entity Recognition effectively alleviates the...