Large deep learning models have achieved remarkable success in many
scen...
Text-video retrieval is an important multi-modal learning task, where th...
Due to the need to store the intermediate activations for back-propagati...
Assessing action quality from videos has attracted growing attention in
...