Federated learning (FL) emerges as a decentralized learning framework wh...
Federated Learning (FL) offers a collaborative training framework, allow...
Audio-visual learning has been a major pillar of multi-modal machine
lea...
To understand how deep neural networks perform classification prediction...
We tackle the problem of target-free text-guided image manipulation, whi...
Novel object captioning (NOC) aims to describe images containing objects...
While self-supervised learning has been shown to benefit a number of vis...
Diffusion models (DMs) have shown great potential for high-quality image...
Face anti-spoofing (FAS) aims at distinguishing face spoof attacks from ...
In this paper, we address the task of semantics-guided image outpainting...
We present Neural Mixtures of Planar Experts (NeurMiPs), a novel planar-...
Anomaly detection aims to identify abnormal data that deviates from the
...
Few-shot classification aims to carry out classification given only few
...
How to handle domain shifts when recognizing or segmenting visual data a...
Few-shot semantic segmentation addresses the learning task in which only...
Human perceives rich auditory experience with distinct sound heard by ea...
Generating videos with content and motion variations is a challenging ta...
Representation disentanglement aims at learning interpretable features, ...
Learning interpretable and interpolatable latent representations has bee...
Aiming at recognizing images of the same person across distinct camera v...
Person re-identification (re-ID) requires one to match images of the sam...
To address semi-supervised learning from both labeled and unlabeled data...
Single image deraining is a crucial problem because rain severely degene...
Video summarization is among challenging tasks in computer vision, which...
Video summarization is among challenging tasks in computer vision, which...
Person re-identification (re-ID) aims at matching images of the same per...
Person re-identification (re-ID) aims at recognizing the same person fro...
Person re-identification (re-ID) aims at matching images of the same ide...
Video-based person re-identification (Re-ID) aims at matching video sequ...
Person re-identification (re-ID) solves the task of matching images acro...
Few-shot classification aims to learn a classifier to recognize unseen
c...
Audio-visual event localization requires one to identify theevent which ...
Aiming at inferring 3D shapes from 2D images, 3D shape reconstruction ha...
We present a novel and unified deep learning framework which is capable ...
Deep reinforcement learning has shown its success in game playing. Howev...
Person re-identification (Re-ID) aims at recognizing the same person fro...
With the increasing amount of video data, it is desirable to highlight o...
In this paper, we propose a novel deep learning architecture for multi-l...
In this paper, we propose the joint learning attention and recurrent neu...
While representation learning aims to derive interpretable features for
...
Despite the recent success of deep-learning based semantic segmentation,...