Most of the existing video self-supervised methods mainly leverage tempo...
Existing video self-supervised learning methods mainly rely on trimmed v...
In self-supervised spatio-temporal representation learning, the temporal...
We propose a novel self-supervised method, referred to as Video Cloze
Pr...