Although deep learning methods have achieved advanced video object
recog...
Can Transformer perform 2D object-level recognition from a pure
sequence...
Can our video understanding systems perceive objects when a heavy occlus...
Abstract visual reasoning connects mental abilities to the physical worl...