Visual foundation models like CLIP excel in learning feature representat...
Large-scale embedding-based retrieval (EBR) is the cornerstone of
search...
The traditional model upgrading paradigm for retrieval requires recomput...
The task of privacy-preserving model upgrades in image retrieval desires...
Conventional model upgrades for visual search systems require offline re...
The task of hot-refresh model upgrades of image retrieval systems plays ...
Spatio-temporal representation learning is critical for video self-super...
Vision Transformer (ViT) and its variants (e.g., Swin, PVT) have achieve...
As a basic task of computer vision, image similarity retrieval is facing...