The basic principle of the patch-matching based style transfer is to
sub...
In this paper, we present VideoGen, a text-to-video generation approach,...
Vision Transformer(ViT) is now dominating many vision tasks. The drawbac...
Transformer-based models achieve favorable performance in artistic style...
Text-driven image manipulation remains challenging in training or infere...
Reference-based image super-resolution (RefSR) is a promising SR branch ...
This paper reviews the Challenge on Super-Resolution of Compressed Image...
Video-text retrieval (VTR) is an attractive yet challenging task for
mul...
We propose a novel image retouching method by modeling the retouching pr...
We present a novel one-shot method for object detection and 6 DoF pose
e...
Pose estimation of 3D objects in monocular images is a fundamental and
l...
A common challenge posed to robust semantic segmentation is the expensiv...
To achieve disentangled image manipulation, previous works depend heavil...
We study the problem of extracting correspondences between a pair of poi...
We consider a facility location game in which n agents reside at known
l...
Neural painting refers to the procedure of producing a series of strokes...
Fast arbitrary neural style transfer has attracted widespread attention ...
Image Retrieval is a fundamental task of obtaining images similar to the...
It is often beneficial for agents to pool their resources in order to be...
Inpainting arbitrary missing regions is challenging because learning val...
This paper reviews the first NTIRE challenge on quality enhancement of
c...
Human pose transfer has received great attention due to its wide
applica...
Artistic style transfer aims at migrating the style from an example imag...
Recently, research efforts have been concentrated on revealing how
pre-t...
In recent work, Gourvès, Lesca, and Wilczynski propose a variant of the
...
This paper reviews the NTIRE 2020 challenge on video quality mapping (VQ...
Images or videos always contain multiple objects or actions. Multi-label...
In this work, we introduce a new problem, named as story-preserving lon...
Existing action localization approaches adopt shallow temporal convoluti...
The task of video grounding, which temporally localizes a natural langua...
Despite the success of deep learning for static image understanding, it
...
In this report, our approach to tackling the task of ActivityNet 2018
Ki...
This paper describes our solution for the video recognition task of
Acti...
This paper describes our solution for the video recognition task of the
...
In this paper, we study the stochastic combinatorial multi-armed bandit
...