Most recent works focus on answering first order logical queries to expl...
Recent regulations on the Right to be Forgotten have greatly influenced ...
Recent compositional zero-shot learning (CZSL) methods adapt pre-trained...
Existing audio analysis methods generally first transform the audio stre...
Video temporal grounding aims to pinpoint a video segment that matches t...
Foundation models are pre-trained on massive data and transferred to
dow...
This work presents a unified knowledge protocol, called UKnow, which
fac...
Many recent studies leverage the pre-trained CLIP for text-video cross-m...
Hashing is an efficient method for nearest neighbor search in large-scal...