Deep learning has made significant strides in video understanding tasks,...
One of the main challenges in LiDAR-based 3D object detection is that th...
Federated learning (FL) is a promising approach for enhancing data priva...
Deep neural networks are widely known to be susceptible to adversarial
e...
We introduce Sketch-based Video Object Localization (SVOL), a new task a...
Previous anti-spoofing methods have used either pseudo maps or user-defi...
This paper presents a solution to the Weather4cast 2022 Challenge Stage ...
Standard multi-modal models assume the use of the same modalities in tra...
Real-world data often have a long-tailed distribution, where the number ...
Creating novel views from a single image has achieved tremendous strides...
Camera traps, unmanned observation devices, and deep learning-based imag...
In multi-modal action recognition, it is important to consider not only ...
The transferability of adversarial examples allows the deception on blac...
With the rising interest in deep learning-based methods in remote sensin...
Label distributions in camera-trap images are highly imbalanced and
long...
We present a new paradigm named explore-and-match for video grounding, w...
CNN-based face recognition models have brought remarkable performance
im...
Photometric consistency loss is one of the representative objective func...
Online action detection, which aims to identify an ongoing action from a...
Detecting and localizing image manipulation are necessary to counter
mal...
We present a novel approach for estimating depth from a monocular camera...
Few-shot learning often involves metric learning-based classifiers, whic...
Domain adaptation (DA) is a representation learning methodology that
tra...
In this paper, we attack a few-shot open-set recognition (FSOSR) problem...
While deep neural networks show unprecedented performance in various tas...
Federated learning is a paradigm that enables local devices to jointly t...
Although supervised person re-identification (Re-ID) methods have shown
...
Self-supervised monocular depth estimation has emerged as a promising me...
Style transfer is the image synthesis task, which applies a style of one...
Although unsupervised domain adaptation methods have been widely adopted...
Partial domain adaptation (PDA), in which we assume the target label spa...
Background and Objective: In pulmonary nodule detection, the first stage...
From a streaming video, online action detection aims to identify actions...
Visible-infrared person re-identification (VI-ReID) is an important task...
Recent temporal action proposal generation approaches have suggested
int...
Weakly supervised object localization has recently attracted attention s...
Bit-depth is the number of bits for each color channel of a pixel in an
...
Thesedays, Convolutional Neural Networks are widely used in semantic
seg...
Most video person re-identification (re-ID) methods are mainly based on
...
In this paper, we introduce a self-supervised approach for video object
...
Deep learning-based object detectors have shown remarkable improvements....
Deep learning-based semantic segmentation methods have an intrinsic
limi...
Detecting fashion landmarks is a fundamental technique for visual clothi...
To learn target discriminative representations, using pseudo-labels is a...
We introduce a novel unsupervised domain adaptation approach for object
...
In this paper, we propose a novel algorithm to rectify illumination of t...
We introduce a large-scale 3D shape understanding benchmark using data a...