Most existing forecasting systems are memory-based methods, which attemp...
Diffusion models have emerged as a powerful method of generative modelin...
The 4D millimeter-wave (mmWave) radar, capable of measuring the range,
a...
With the exponential growth of video data, there is an urgent need for
a...
Enhancing the robustness of vision algorithms in real-world scenarios is...
In this report, we present our champion solutions to five tracks at Ego4...
Capturing the state changes of interacting objects is a key technology f...
We provide the technical report for Ego4D audio-only diarization challen...
In this paper, we tackle the problem of active robotic 3D reconstruction...
This paper proposes the first real-world rolling shutter (RS) correction...
No-Reference Image Quality Assessment (NR-IQA) aims to assess the percep...
Human grasping synthesis has numerous applications including AR/VR, vide...
Dance challenges are going viral in video communities like TikTok nowada...
This report summarizes the results of Learning to Understand Aerial Imag...
In this paper, we introduce the Multi-Modal Video Reasoning and Analyzin...
Fine-grained action recognition is attracting increasing attention due t...
This paper studies the neural architecture search (NAS) problem for
deve...