Ce Liu

research

∙ 06/15/2023

NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations

Recent advances in neural reconstruction enable high-quality 3D object r...

0 Varun Jampani, et al. ∙

research

∙ 04/23/2023

Indiscernible Object Counting in Underwater Scenes

Recently, indiscernible scene understanding has attracted a lot of atten...

0 Guolei Sun, et al. ∙

research

∙ 03/31/2023

Single Image Depth Prediction Made Better: A Multivariate Gaussian Take

Neural-network-based single image depth prediction (SIDP) is a challengi...

0 Ce Liu, et al. ∙

research

∙ 03/20/2023

MM-REACT: Prompting ChatGPT for Multimodal Reasoning and Action

We propose MM-REACT, a system paradigm that integrates ChatGPT with a po...

0 Zhengyuan Yang, et al. ∙

research

∙ 02/13/2023

VA-DepthNet: A Variational Approach to Single Image Depth Prediction

We introduce VA-DepthNet, a simple, effective, and accurate deep neural ...

0 Ce Liu, et al. ∙

research

∙ 01/17/2023

Learning Customized Visual Models with Retrieval-Augmented Knowledge

Image-text contrastive learning models such as CLIP have demonstrated st...

10 Haotian Liu, et al. ∙

research

∙ 12/07/2022

X-Paste: Revisit Copy-Paste at Scale with CLIP and StableDiffusion

Copy-Paste is a simple and effective data augmentation strategy for inst...

0 Hanqing Zhao, et al. ∙

research

∙ 09/15/2022

OmniVL:One Foundation Model for Image-Language and Video-Language Tasks

This paper presents OmniVL, a new foundation model to support both image...

27 Junke Wang, et al. ∙

research

∙ 06/15/2022

Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone

Vision-language (VL) pre-training has recently received considerable att...

13 Zi-Yi Dou, et al. ∙

research

∙ 06/14/2022

LAVENDER: Unifying Video-Language Understanding as Masked Language Modeling

Unified vision-language frameworks have greatly advanced in recent years...

9 Linjie Li, et al. ∙

research

∙ 06/03/2022

Visual Clues: Bridging Vision and Language Foundations for Image Paragraph Captioning

People say, "A picture is worth a thousand words". Then how can we get t...

0 Yujia Xie, et al. ∙

research

∙ 05/27/2022

GIT: A Generative Image-to-text Transformer for Vision and Language

In this paper, we design and train a Generative Image-to-text Transforme...

14 Jianfeng Wang, et al. ∙

research

∙ 04/20/2022

K-LITE: Learning Transferable Visual Models with External Knowledge

Recent state-of-the-art computer vision systems are trained from natural...

3 Sheng Shen, et al. ∙

research

∙ 04/07/2022

Unified Contrastive Learning in Image-Text-Label Space

Visual recognition is recently learned via either supervised learning on...

20 Jianwei Yang, et al. ∙

research

∙ 02/08/2022

MaskGIT: Masked Generative Image Transformer

Generative transformers have experienced rapid popularity growth in the ...

10 Huiwen Chang, et al. ∙

research

∙ 11/30/2021

Pyramid Adversarial Training Improves ViT Performance

Aggressive data augmentation is a key component of the strong generaliza...

7 Charles Herrmann, et al. ∙

research

∙ 11/22/2021

Florence: A New Foundation Model for Computer Vision

Automated visual understanding of our diverse and open world demands com...

4 Lu Yuan, et al. ∙

research

∙ 10/27/2021

Neural-PIL: Neural Pre-Integrated Lighting for Reflectance Decomposition

Decomposing a scene into its shape, reflectance and illumination is a fu...

0 Mark Boss, et al. ∙

research

∙ 09/02/2021

SLIDE: Single Image 3D Photography with Soft Layering and Depth-aware Inpainting

Single image 3D photography enables viewers to view a still image from n...

2 Varun Jampani, et al. ∙

research

∙ 07/09/2021

ViTGAN: Training GANs with Vision Transformers

Recently, Vision Transformers (ViTs) have shown competitive performance ...

24 Kwonjoon Lee, et al. ∙

research

∙ 05/06/2021

LASR: Learning Articulated Shape Reconstruction from a Monocular Video

Remarkable progress has been made in 3D reconstruction of rigid structur...

2 Gengshan Yang, et al. ∙

research

∙ 04/29/2021

AutoFlow: Learning a Better Training Set for Optical Flow

Synthetic datasets play a critical role in pre-training CNN models for o...

10 Deqing Sun, et al. ∙

research

∙ 04/27/2021

Deep 3D-to-2D Watermarking: Embedding Messages in 3D Meshes and Extracting Them from 2D Renderings

Digital watermarking is widely used for copyright protection. Traditiona...

0 Innfarn Yoo, et al. ∙

research

∙ 04/26/2021

DVMark: A Deep Multiscale Framework for Video Watermarking

Video watermarking embeds a message into a cover video in an imperceptib...

0 Xiyang Luo, et al. ∙

research

∙ 12/07/2020

NeRD: Neural Reflectance Decomposition from Image Collections

Decomposing a scene into its shape, reflectance, and illumination is a c...

0 Mark Boss, et al. ∙

research

∙ 07/07/2020

Stability in Repeated Matching Markets

This paper develops a framework for repeated matching markets. The model...

0 Ce Liu, et al. ∙

research

∙ 12/24/2019

DepthTransfer: Depth Extraction from Video Using Non-parametric Sampling

We describe a technique that automatically generates plausible depth map...

0 Kevin Karsch, et al. ∙

research

∙ 08/19/2019

Boundless: Generative Adversarial Networks for Image Extension

Image extension models have broad applications in image editing, computa...

1 Piotr Teterwak, et al. ∙

research

∙ 04/25/2019

Learning the Depths of Moving People by Watching Frozen People

We present a method for predicting dense depth in scenarios where both a...

12 Zhengqi Li, et al. ∙

research

∙ 12/21/2017

Smart, Sparse Contours to Represent and Edit Images

We study the problem of reconstructing an image from information stored ...

0 Tali Dekel, et al. ∙

Ce Liu

Featured Co-authors

Sign in with Google

Consider DeepAI Pro