While large-scale image-text pretrained models such as CLIP have been us...
We propose im2nerf, a learning framework that predicts a continuous neur...
We present a method to infer the 3D pose of mice, including the limbs an...
Semantic segmentation of 3D meshes is an important problem for 3D scene
...
We present a simple and flexible object detection framework optimized fo...
We propose DOPS, a fast single-stage 3D object detection method for LIDA...
Is it possible to guess human action from dialogue alone? In this work w...