3D visual grounding aims to localize the target object in a 3D point clo...
3D visual grounding involves finding a target object in a 3D scene that
...
Most sign language translation (SLT) methods to date require the use of ...
Multi-modal Contrastive Representation (MCR) learning aims to encode
dif...
Multi-media communications facilitate global interaction among people.
H...
Sign language translation as a kind of technology with profound social
s...