Detecting actions in untrimmed videos should not be limited to a small,
...
While there have been significant gains in the field of automated video
...
A major challenge in text-video and text-audio retrieval is the lack of
...
We present a method to infer the 3D pose of mice, including the limbs an...
Variational autoencoders learn unsupervised data representations, but th...
We propose TAL-Net, an improved approach to temporal action localization...
We propose a method for unsupervised video object segmentation by
transf...
Convolutional Neural Networks (CNNs) have proven very effective in image...