Existing Unbiased Scene Graph Generation (USGG) methods only focus on
ad...
Generating consecutive descriptions for videos, i.e., Video Captioning,
...
Studies of image captioning are shifting towards a trend of a fully
end-...
The current studies of Scene Graph Generation (SGG) focus on solving the...
The performance of current Scene Graph Generation (SGG) models is severe...
Scene Graph Generation (SGG) represents objects and their interactions w...
Recently, attention-based Visual Question Answering (VQA) has achieved g...
To date, visual question answering (VQA) (i.e., image QA and video QA) i...
Video captioning is a challenging task that necessitates a thorough
comp...