Dong Chen

research

∙ 09/07/2023

InstructDiffusion: A Generalist Modeling Interface for Vision Tasks

We present InstructDiffusion, a unifying and generic framework for align...

0 Zigang Geng, et al. ∙

research

∙ 08/13/2023

Foundation Models in Smart Agriculture: Basics, Opportunities, and Challenges

The past decade has witnessed the rapid development of ML and DL methodo...

0 Jiajia Li, et al. ∙

research

∙ 06/06/2023

FaaSwap: SLO-Aware, GPU-Efficient Serverless Inference via Model Swapping

The dynamic request patterns of machine learning (ML) inference workload...

0 Minchen Yu, et al. ∙

research

∙ 05/24/2023

Label-Efficient Learning in Agriculture: A Comprehensive Review

The past decade has witnessed many great successes of machine learning (...

0 Jiajia Li, et al. ∙

research

∙ 03/24/2023

Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior

In this work, we investigate the problem of creating high-fidelity 3D co...

1 Junshu Tang, et al. ∙

research

∙ 03/22/2023

CiCo: Domain-Aware Sign Language Retrieval via Cross-Lingual Contrastive Learning

This work focuses on sign language retrieval-a recently proposed task fo...

0 Yiting Cheng, et al. ∙

research

∙ 03/17/2023

IRGen: Generative Modeling for Image Retrieval

While generative modeling has been ubiquitous in natural language proces...

0 Yidan Zhang, et al. ∙

research

∙ 03/16/2023

Efficient Diffusion Training via Min-SNR Weighting Strategy

Denoising diffusion models have been a mainstream approach for image gen...

0 Tiankai Hang, et al. ∙

research

∙ 03/08/2023

O2RNet: Occluder-Occludee Relational Network for Robust Apple Detection in Clustered Orchard Environments

Automated apple harvesting has attracted significant research interest i...

0 Pengyu Chu, et al. ∙

research

∙ 02/18/2023

Hyneter: Hybrid Network Transformer for Object Detection

In this paper, we point out that the essential differences between CNN-b...

0 Dong Chen, et al. ∙

research

∙ 12/19/2022

FreeEnricher: Enriching Face Landmarks without Additional Cost

Recent years have witnessed significant growth of face alignment. Though...

0 Yangyu Huang, et al. ∙

research

∙ 12/12/2022

CLIP Itself is a Strong Fine-tuner: Achieving 85.7 Accuracy with ViT-B and ViT-L on ImageNet

Recent studies have shown that CLIP has achieved remarkable success in p...

0 Xiaoyi Dong, et al. ∙

research

∙ 12/07/2022

X-Paste: Revisit Copy-Paste at Scale with CLIP and StableDiffusion

Copy-Paste is a simple and effective data augmentation strategy for inst...

0 Hanqing Zhao, et al. ∙

research

∙ 11/23/2022

Paint by Example: Exemplar-based Image Editing with Diffusion Models

Language-guided image editing has achieved great success recently. In th...

0 Binxin Yang, et al. ∙

research

∙ 11/22/2022

SinDiffusion: Learning a Diffusion Model from a Single Natural Image

We present SinDiffusion, leveraging denoising diffusion models to captur...

0 Weilun Wang, et al. ∙

research

∙ 11/18/2022

A Structure-Guided Diffusion Model for Large-Hole Diverse Image Completion

Diverse image completion, a problem of generating various ways of fillin...

0 Daichi Horita, et al. ∙

research

∙ 10/18/2022

Deep Data Augmentation for Weed Recognition Enhancement: A Diffusion Probabilistic Model and Transfer Learning Based Approach

Weed management plays an important role in many modern agricultural appl...

8 Dong Chen, et al. ∙

research

∙ 10/05/2022

DigiFace-1M: 1 Million Digital Face Images for Face Recognition

State-of-the-art face recognition models show impressive accuracy, achie...

13 Gwangbin Bae, et al. ∙

research

∙ 09/12/2022

Explicitly Controllable 3D-Aware Portrait Generation

In contrast to the traditional avatar creation pipeline which is a costl...

10 Junshu Tang, et al. ∙

research

∙ 08/25/2022

MaskCLIP: Masked Self-Distillation Advances Contrastive Language-Image Pretraining

This paper presents a simple yet effective framework MaskCLIP, which inc...

18 Xiaoyi Dong, et al. ∙

research

∙ 08/22/2022

Event-Triggered Model Predictive Control with Deep Reinforcement Learning for Autonomous Driving

Event-triggered model predictive control (eMPC) is a popular optimal con...

0 Fengying Dang, et al. ∙

research

∙ 07/14/2022

Bootstrapped Masked Autoencoders for Vision BERT Pretraining

We propose bootstrapped masked autoencoders (BootMAE), a new approach fo...

21 Xiaoyi Dong, et al. ∙

research

∙ 06/30/2022

Semantic Image Synthesis via Diffusion Models

Denoising Diffusion Probabilistic Models (DDPMs) have achieved remarkabl...

6 Weilun Wang, et al. ∙

research

∙ 06/22/2022

I^2R-Net: Intra- and Inter-Human Relation Network for Multi-Person Pose Estimation

In this paper, we present the Intra- and Inter-Human Relation Networks (...

8 Yiwei Ding, et al. ∙

research

∙ 06/04/2022

Robust Meta-learning with Sampling Noise and Label Noise via Eigen-Reptile

Recent years have seen a surge of interest in meta-learning techniques f...

10 Dong Chen, et al. ∙

research

∙ 05/31/2022

Improved Vector Quantized Diffusion Models

Vector quantized diffusion (VQ-Diffusion) is a powerful generative model...

22 Zhicong Tang, et al. ∙

research

∙ 05/27/2022

Contrastive Learning Rivals Masked Image Modeling in Fine-tuning via Feature Distillation

Masked image modeling (MIM) learns representations with remarkably good ...

4 Yixuan Wei, et al. ∙

research

∙ 04/18/2022

Distributed Neural Precoding for Hybrid mmWave MIMO Communications with Limited Feedback

Hybrid precoding is a cost-efficient technique for millimeter wave (mmWa...

0 Kai Wei, et al. ∙

research

∙ 04/10/2022

Generative Adversarial Networks for Image Augmentation in Agriculture: A Systematic Review

In agricultural image analysis, optimal model performance is keenly purs...

8 Ebenezer Olaniyi, et al. ∙

research

∙ 03/30/2022

Large-Scale Pre-training for Person Re-identification with Noisy Labels

This paper aims to address the problem of pre-training for person re-ide...

4 Dengpan Fu, et al. ∙

research

∙ 03/29/2022

Semi-Supervised Image-to-Image Translation using Latent Space Mapping

Recent image-to-image translation works have been transferred from super...

7 Pan Zhang, et al. ∙

research

∙ 03/02/2022

Protecting Celebrities with Identity Consistency Transformer

In this work we propose Identity Consistency Transformer, a novel face f...

9 Xiaoyi Dong, et al. ∙

research

∙ 01/13/2022

Observability Analysis and Keyframe-Based Filtering for Visual Inertial Odometry with Full Self-Calibration

Camera-IMU (Inertial Measurement Unit) sensor fusion has been extensivel...

0 Jianzhu Huai, et al. ∙

research

∙ 12/20/2021

StyleSwin: Transformer-based GAN for High-resolution Image Generation

Despite the tantalizing success in a broad of vision tasks, transformers...

15 Bowen Zhang, et al. ∙

research

∙ 12/06/2021

General Facial Representation Learning in a Visual-Linguistic Manner

How to learn a universal facial representation that boosts all face anal...

7 Yinglin Zheng, et al. ∙

research

∙ 11/29/2021

Vector Quantized Diffusion Model for Text-to-Image Synthesis

We present the vector quantized diffusion (VQ-Diffusion) model for text-...

10 Shuyang Gu, et al. ∙

research

∙ 11/24/2021

PeCo: Perceptual Codebook for BERT Pre-training of Vision Transformers

This paper explores a better codebook for BERT pre-training of vision tr...

23 Xiaoyi Dong, et al. ∙

research

∙ 11/11/2021

Multi-agent Reinforcement Learning for Cooperative Lane Changing of Connected and Autonomous Vehicles in Mixed Traffic

Autonomous driving has attracted significant research interests in the p...

9 Wei Zhou, et al. ∙

research

∙ 10/13/2021

Offline Reinforcement Learning for Autonomous Driving with Safety and Exploration Enhancement

Reinforcement learning (RL) is a powerful data-driven control method tha...

0 Tianyu Shi, et al. ∙

research

∙ 10/11/2021

Performance Evaluation of Deep Transfer Learning on Multiclass Identification of Common Weed Species in Cotton Production Systems

Precision weed management offers a promising solution for sustainable cr...

9 Dong Chen, et al. ∙

research

∙ 09/17/2021

Proteome-informed machine learning studies of cocaine addiction

Cocaine addiction accounts for a large portion of substance use disorder...

5 Kaifu Gao, et al. ∙

research

∙ 08/15/2021

Exploring Temporal Coherence for More General Video Face Forgery Detection

Although current face manipulation techniques achieve impressive perform...

2 Yinglin Zheng, et al. ∙

research

∙ 08/13/2021

Dual Path Learning for Domain Adaptation of Semantic Segmentation

Domain adaptation for semantic segmentation enables to alleviate the nee...

5 Yiting Cheng, et al. ∙

research

∙ 08/10/2021

Instance-wise Hard Negative Example Generation for Contrastive Learning in Unpaired Image-to-Image Translation

Contrastive learning shows great potential in unpaired image-to-image tr...

5 Weilun Wang, et al. ∙

research

∙ 07/24/2021

BIoTA Control-Aware Attack Analytics for Building Internet of Things

Modern building control systems adopt demand control heating, ventilatio...

0 Nur Imtiazul Haque, et al. ∙

research

∙ 07/08/2021

Reinforcement Learning based Negotiation-aware Motion Planning of Autonomous Vehicles

For autonomous vehicles integrating onto roadways with human traffic par...

0 Zhitao Wang, et al. ∙

research

∙ 07/01/2021

CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped Windows

We present CSWin Transformer, an efficient and effective Transformer-bas...

2 Xiaoyi Dong, et al. ∙

research

∙ 06/01/2021

Robust Mutual Learning for Semi-supervised Semantic Segmentation

Recent semi-supervised learning (SSL) methods are commonly based on pseu...

14 Pan Zhang, et al. ∙

research

∙ 03/29/2021

High-Fidelity and Arbitrary Face Editing

Cycle consistency is widely used for face editing. However, we observe t...

9 Yue Gao, et al. ∙

research

∙ 03/22/2021

Control Distance IoU and Control Distance IoU Loss Function for Better Bounding Box Regression

Numerous improvements for feedback mechanisms have contributed to the gr...

8 Dong Chen, et al. ∙

Dong Chen

Featured Co-authors

Sign in with Google

Consider DeepAI Pro