b'Linjun Li'

research

∙ 07/25/2023

3DRP-Net: 3D Relative Position-aware Network for 3D Visual Grounding

3D visual grounding aims to localize the target object in a 3D point clo...

0 Zehan Wang, et al. ∙

research

∙ 07/18/2023

Distilling Coarse-to-Fine Semantic Matching Knowledge for Weakly Supervised 3D Visual Grounding

3D visual grounding involves finding a target object in a 3D scene that ...

0 Zehan Wang, et al. ∙

research

∙ 06/10/2023

OpenSR: Open-Modality Speech Recognition via Maintaining Multi-Modality Alignment

Speech Recognition builds a bridge between the multimedia streaming (aud...

0 Xize Cheng, et al. ∙

research

∙ 05/24/2023

AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation

Direct speech-to-speech translation (S2ST) aims to convert speech from o...

0 Rongjie Huang, et al. ∙

research

∙ 05/22/2023

Connecting Multi-modal Contrastive Representations

Multi-modal Contrastive Representation (MCR) learning aims to encode dif...

0 Zehan Wang, et al. ∙

research

∙ 03/09/2023

MixSpeech: Cross-Modality Self-Learning with Audio-Visual Stream Mixup for Visual Speech Translation and Recognition

Multi-media communications facilitate global interaction among people. H...

0 Xize Cheng, et al. ∙

research

∙ 06/05/2021

Motion Planning Transformers: One Model to Plan Them All

Transformers have become the powerhouse of natural language processing a...

0 Jacob J. Johnson, et al. ∙

research

∙ 01/17/2021

MPC-MPNet: Model-Predictive Motion Planning Networks for Fast, Near-Optimal Planning under Kinodynamic Constraints

Kinodynamic Motion Planning (KMP) is to find a robot motion subject to c...

0 Linjun Li, et al. ∙

research

∙ 08/12/2020

Dynamically Constrained Motion Planning Networks for Non-Holonomic Robots

Reliable real-time planning for robots is essential in today's rapidly e...

0 Jacob J. Johnson, et al. ∙

Linjun Li

Featured Co-authors

Sign in with Google

Consider DeepAI Pro