Video-language pre-training (VLP) has become increasingly important due ...
Recent research on Large Language Models (LLMs) has led to remarkable
ad...
This paper examines the problems of severe image-text misalignment and h...
Various stuff and things in visual data possess specific traits, which c...
Humans excel at learning from expert demonstrations and solving their ow...
Recently, to improve the unsupervised image retrieval performance, plent...