Pre-trained vision-language models (VLMs) have shown impressive performa...
In recent years, prompt tuning has proven effective in adapting pre-trai...
Interpretability is a crucial factor in building reliable models for var...
Learning implicit templates as neural fields has recently shown impressi...
Video Question Answering (VideoQA) is a challenging task that entails co...
Transformers have shown superior performance on various computer vision ...
Foundation models have shown outstanding performance and generalization
...
The application of modern machine learning to retinal image analyses off...
We present a learning framework for reconstructing neural scene
represen...
Multi-resolution hash encoding has recently been proposed to reduce the
...
Recent studies have proven that DNNs, unlike human vision, tend to explo...
Question Answering (QA) is a task that entails reasoning over natural
la...
Normalizing flows model probability distributions by learning invertible...
Mixup is a commonly adopted data augmentation technique for image
classi...
Data augmentation is key to improving the generalization ability of deep...
Transformer-based models have been widely used and achieved state-of-the...
Graph Neural Networks (GNNs) have been widely applied to various fields ...
Human-Object Interaction detection is a holistic visual recognition task...
Learning generic joint representations for video and text by a supervise...
Graph Neural Networks (GNNs) often suffer from weak-generalization due t...
We aim to detect and identify multiple objects using multiple cameras an...
Graph neural networks (GNNs) have significantly improved the representat...
Despite the extensive usage of point clouds in 3D vision, relatively lim...
Graph Neural Networks (GNNs) have been widely applied to various fields ...
Human-Object Interaction (HOI) detection is a task of identifying "a set...
In recent years, graph neural networks (GNNs) have been widely adopted i...
Deep neural networks have achieved state-of-the-art performance in a var...
Graph neural networks have shown superior performance in a wide range of...
Disentangling content and style information of an image has played an
im...
Deep convolutional neural networks have achieved remarkable success in
c...
Recent results in coupled or temporal graphical models offer schemes for...