Nong Sang

research

∙ 09/14/2023

Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning

Recently, large-scale pre-trained language-image models like CLIP have s...

0 Zhiwu Qing, et al. ∙

research

∙ 08/24/2023

HR-Pro: Point-supervised Temporal Action Localization via Hierarchical Reliability Propagation

Point-supervised Temporal Action Localization (PSTAL) is an emerging res...

0 Huaxin Zhang, et al. ∙

research

∙ 07/31/2023

Towards General Low-Light Raw Noise Synthesis and Modeling

Modeling and synthesizing low-light raw noise is a fundamental problem f...

0 Feng Zhang, et al. ∙

research

∙ 05/15/2023

PLIP: Language-Image Pre-training for Person Representation Learning

Pre-training has emerged as an effective technique for learning powerful...

0 Jialong Zuo, et al. ∙

research

∙ 04/03/2023

MoLo: Motion-augmented Long-short Contrastive Learning for Few-shot Action Recognition

Current state-of-the-art approaches for few-shot action recognition achi...

0 Xiang Wang, et al. ∙

research

∙ 03/06/2023

CLIP-guided Prototype Modulating for Few-shot Action Recognition

Learning from large-scale contrastive language-image pre-training like C...

0 Xiang Wang, et al. ∙

research

∙ 01/12/2023

Semantic Segmentation via Pixel-to-Center Similarity Calculation

Since the fully convolutional network has achieved great success in sema...

0 Dongyue Wu, et al. ∙

research

∙ 01/09/2023

Parallel Reasoning Network for Human-Object Interaction Detection

Human-Object Interaction (HOI) detection aims to learn how human interac...

0 Huan Peng, et al. ∙

research

∙ 11/02/2022

Learning a Condensed Frame for Memory-Efficient Video Class-Incremental Learning

Recent incremental learning for action recognition usually stores repres...

0 Yixuan Pei, et al. ∙

research

∙ 07/24/2022

MAR: Masked Autoencoders for Efficient Action Recognition

Standard approaches for video recognition usually operate on the full in...

10 Zhiwu Qing, et al. ∙

research

∙ 06/18/2022

Context-aware Proposal Network for Temporal Action Detection

This technical report presents our first place winning solution for temp...

0 Xiang Wang, et al. ∙

research

∙ 03/12/2022

Joint CNN and Transformer Network via weakly supervised Learning for efficient crowd counting

Currently, for crowd counting, the fully supervised methods via density ...

0 Fusen Wang, et al. ∙

research

∙ 12/22/2021

Multi-Centroid Representation Network for Domain Adaptive Person Re-ID

Recently, many approaches tackle the Unsupervised Domain Adaptive person...

0 Yuhang Wu, et al. ∙

research

∙ 12/15/2021

Modality-Aware Triplet Hard Mining for Zero-shot Sketch-Based Image Retrieval

This paper tackles the Zero-Shot Sketch-Based Image Retrieval (ZS-SBIR) ...

0 Zongheng Huang, et al. ∙

research

∙ 12/05/2021

Implicit Neural Deformation for Multi-View Face Reconstruction

In this work, we present a new method for 3D face reconstruction from mu...

0 Moran Li, et al. ∙

research

∙ 12/03/2021

Unsupervised Low-Light Image Enhancement via Histogram Equalization Prior

Deep learning-based methods for low-light image enhancement typically re...

10 Feng Zhang, et al. ∙

research

∙ 09/21/2021

CondNet: Conditional Classifier for Scene Segmentation

The fully convolutional network (FCN) has achieved tremendous success in...

0 Changqian Yu, et al. ∙

research

∙ 09/13/2021

Weakly Supervised Person Search with Region Siamese Networks

Supervised learning is dominant in person search, but it requires elabor...

0 Chuchu Han, et al. ∙

research

∙ 08/24/2021

ParamCrop: Parametric Cubic Cropping for Video Contrastive Learning

The central idea of contrastive learning is to discriminate between diff...

7 Zhiwu Qing, et al. ∙

research

∙ 06/24/2021

Exploring Stronger Feature for Temporal Action Localization

Temporal action localization aims to localize starting and ending time w...

0 Zhiwu Qing, et al. ∙

research

∙ 06/21/2021

OadTR: Online Action Detection with Transformers

Most recent approaches for online action detection tend to apply Recurre...

0 Xiang Wang, et al. ∙

research

∙ 06/20/2021

Weakly-Supervised Temporal Action Localization Through Local-Global Background Modeling

Weakly-Supervised Temporal Action Localization (WS-TAL) task aims to rec...

0 Xiang Wang, et al. ∙

research

∙ 06/20/2021

Proposal Relation Network for Temporal Action Detection

This technical report presents our solution for temporal action detectio...

0 Xiang Wang, et al. ∙

research

∙ 06/13/2021

A Stronger Baseline for Ego-Centric Action Detection

This technical report analyzes an egocentric video action detection meth...

0 Zhiwu Qing, et al. ∙

research

∙ 06/09/2021

Towards Training Stronger Video Vision Transformers for EPIC-KITCHENS-100 Action Recognition

With the recent surge in the research of vision transformers, they have ...

0 Ziyuan Huang, et al. ∙

research

∙ 06/04/2021

Hybrid attention network based on progressive embedding scale-context for crowd counting

The existing crowd counting methods usually adopted attention mechanism ...

3 Fusen Wang, et al. ∙

research

∙ 04/13/2021

Lite-HRNet: A Lightweight High-Resolution Network

We present an efficient high-resolution network, Lite-HRNet, for human p...

0 Changqian Yu, et al. ∙

research

∙ 04/07/2021

Self-Supervised Learning for Semi-Supervised Temporal Action Proposal

Self-supervised learning presents a remarkable performance to utilize un...

0 Xiang Wang, et al. ∙

research

∙ 03/24/2021

Temporal Context Aggregation Network for Temporal Action Proposal Refinement

Temporal action proposal generation aims to estimate temporal intervals ...

0 Zhiwu Qing, et al. ∙

research

∙ 02/22/2021

Decoupled and Memory-Reinforced Networks: Towards Effective Feature Learning for One-Step Person Search

The goal of person search is to localize and match query persons from sc...

13 Chuchu Han, et al. ∙

research

∙ 12/17/2020

Exploiting Learnable Joint Groups for Hand Pose Estimation

In this paper, we propose to estimate 3D hand pose by recovering the 3D ...

1 Moran Li, et al. ∙

research

∙ 08/16/2020

Do Not Disturb Me: Person Re-identification Under the Interference of Other Pedestrians

In the conventional person Re-ID setting, it is widely assumed that crop...

0 Shizhen Zhao, et al. ∙

research

∙ 08/12/2020

Representative Graph Neural Network

Non-local operation is widely explored to model the long-range dependenc...

0 Changqian Yu, et al. ∙

research

∙ 08/07/2020

Multi-Level Temporal Pyramid Network for Action Detection

Currently, one-stage frameworks have been widely applied for temporal ac...

0 Xiang Wang, et al. ∙

research

∙ 08/03/2020

Adversarial Semantic Data Augmentation for Human Pose Estimation

Human pose estimation is the task of localizing body keypoints from stil...

0 Yanrui Bin, et al. ∙

research

∙ 06/13/2020

CBR-Net: Cascade Boundary Refinement Network for Action Detection: Submission to ActivityNet Challenge 2020 (Task 1)

In this report, we present our solution for the task of temporal action ...

0 Xiang Wang, et al. ∙

research

∙ 06/13/2020

Temporal Fusion Network for Temporal Action Localization:Submission to ActivityNet Challenge 2020 (Task E)

This technical report analyzes a temporal action localization method we ...

0 Zhiwu Qing, et al. ∙

research

∙ 05/20/2020

Relevant Region Prediction for Crowd Counting

Crowd counting is a concerned and challenging task in computer vision. E...

8 Xinya Chen, et al. ∙

research

∙ 05/10/2020

Domain Adaptation for Image Dehazing

Image dehazing using learning-based methods has achieved state-of-the-ar...

15 Yuanjie Shao, et al. ∙

research

∙ 04/05/2020

BiSeNet V2: Bilateral Network with Guided Aggregation for Real-time Semantic Segmentation

The low-level details and high-level semantics are both essential to the...

0 Changqian Yu, et al. ∙

research

∙ 04/03/2020

Context Prior for Scene Segmentation

Recent works have widely explored the contextual dependencies to achieve...

0 Changqian Yu, et al. ∙

research

∙ 01/19/2020

GTNet: Generative Transfer Network for Zero-Shot Object Detection

We propose a Generative Transfer Network (GTNet) for zero shot object de...

0 Shizhen Zhao, et al. ∙

research

∙ 09/18/2019

Re-ID Driven Localization Refinement for Person Search

Person search aims at localizing and identifying a query person from a g...

13 Chuchu Han, et al. ∙

research

∙ 08/02/2018

BiSeNet: Bilateral Segmentation Network for Real-time Semantic Segmentation

Semantic segmentation requires both rich spatial information and sizeabl...

0 Changqian Yu, et al. ∙

research

∙ 04/25/2018

Learning a Discriminative Feature Network for Semantic Segmentation

Most existing methods of semantic segmentation still suffer from two asp...

0 Changqian Yu, et al. ∙

research

∙ 03/09/2018

Learning a Discriminative Prior for Blind Image Deblurring

We present an effective blind image deblurring method based on a data-dr...

0 Lerenhan Li, et al. ∙

research

∙ 02/09/2018

Multiple Target Tracking by Learning Feature Representation and Distance Metric Jointly

Designing a robust affinity model is the key issue in multiple target tr...

0 Jun Xiang, et al. ∙

research

∙ 11/12/2016

Online Generative-Discriminative Model for Object Detection in Video: An Unsupervised Learning Framework

Traditional single-view object detection methods often perform worse und...

0 Dapeng Luo, et al. ∙

research

∙ 06/29/2016

Scene Text Detection via Holistic, Multi-Channel Prediction

Recently, scene text detection has become an active research topic in co...

0 Cong Yao, et al. ∙

Nong Sang

Featured Co-authors

Sign in with Google

Consider DeepAI Pro