Session-based recommendation techniques aim to capture dynamic user beha...
Multimodal vision-language (VL) learning has noticeably pushed the tende...
Referring Expression Comprehension (REC) is one of the most important ta...
Relying on Transformer for complex visual feature learning, object track...
Recently, online shopping has gradually become a common way of shopping ...
Due to balanced accuracy and speed, joint learning detection and ReID-ba...
The encoding of the target in object tracking moves from the coarse
boun...
Anchor-based Siamese trackers have achieved remarkable advancements in
a...