Recent advances in neural reconstruction enable high-quality 3D object
r...
Recently, indiscernible scene understanding has attracted a lot of atten...
Neural-network-based single image depth prediction (SIDP) is a challengi...
We propose MM-REACT, a system paradigm that integrates ChatGPT with a po...
We introduce VA-DepthNet, a simple, effective, and accurate deep neural
...
Image-text contrastive learning models such as CLIP have demonstrated st...
Copy-Paste is a simple and effective data augmentation strategy for inst...
This paper presents OmniVL, a new foundation model to support both
image...
Vision-language (VL) pre-training has recently received considerable
att...
Unified vision-language frameworks have greatly advanced in recent years...
People say, "A picture is worth a thousand words". Then how can we get t...
In this paper, we design and train a Generative Image-to-text Transforme...
Recent state-of-the-art computer vision systems are trained from natural...
Visual recognition is recently learned via either supervised learning on...
Generative transformers have experienced rapid popularity growth in the
...
Aggressive data augmentation is a key component of the strong generaliza...
Automated visual understanding of our diverse and open world demands com...
Decomposing a scene into its shape, reflectance and illumination is a
fu...
Single image 3D photography enables viewers to view a still image from n...
Recently, Vision Transformers (ViTs) have shown competitive performance ...
Remarkable progress has been made in 3D reconstruction of rigid structur...
Synthetic datasets play a critical role in pre-training CNN models for
o...
Digital watermarking is widely used for copyright protection. Traditiona...
Video watermarking embeds a message into a cover video in an imperceptib...
Decomposing a scene into its shape, reflectance, and illumination is a
c...
This paper develops a framework for repeated matching markets. The model...
We describe a technique that automatically generates plausible depth map...
Image extension models have broad applications in image editing,
computa...
We present a method for predicting dense depth in scenarios where both a...
We study the problem of reconstructing an image from information stored ...