Vision Transformers (ViTs) emerge to achieve impressive performance on m...
Generalized few-shot object detection aims to achieve precise detection ...
Given a long untrimmed video and natural language queries, video groundi...
Given an image and a reference caption, the image caption editing task a...
We study multimodal few-shot object detection (FSOD) in this paper, usin...
Few-shot object detection (FSOD), with the aim to detect novel objects u...
This work introduces a dataset for large-scale instance-level recognitio...
Few-shot object detection (FSOD) aims to detect never-seen objects using...
Few-shot object detection (FSOD) aims to detect objects using only few
e...
Recent works seek to endow recognition systems with the ability to handl...
To combat COVID-19, both clinicians and scientists need to digest the va...