Solid results from Transformers have made them prevailing architectures ...
Natural language spatial video grounding aims to detect the relevant obj...
This paper introduces a post-training quantization (PTQ) method achievin...
In recent years, significant progress has been made on the research of c...
This paper presents an end-to-end instance segmentation framework, terme...
This report summarizes the results of Learning to Understand Aerial Imag...
Multi-person pose estimation is an attractive and challenging task. Exis...
Text recognition is a popular topic for its broad applications. In this ...
Table structure recognition is a challenging task due to the various
str...
Fast and precise object detection for high-resolution aerial images has ...
3D vehicle detection based on multi-modal fusion is an important task of...
In this paper we propose a novel decomposition method based on filter gr...