Vision-Language Pre-training (VLP) methods based on object detection enj...
Vision Transformer (ViT) based Vision-Language Pre-training (VLP) models...
Large language models (LLMs) are gaining increasing popularity in both
a...
With the extensive accumulation of conversational data on the Internet,
...
Instruction tuning large language models (LLMs) remains a challenging ta...
The increasing reliance on Large Language Models (LLMs) across academia ...
Cross-modal contrastive learning in vision language pretraining (VLP) fa...
Cross-modal contrastive learning in vision language pretraining (VLP) fa...
The capability of Large Language Models (LLMs) like ChatGPT to comprehen...
Vision-Language models (VLMs) that use contrastive language-image
pre-tr...
End-to-end generation-based approaches have been investigated and applie...
ChatGPT is a recent chatbot service released by OpenAI and is receiving
...
The typical way for relation extraction is fine-tuning large pre-trained...
Sequence generation demonstrates promising performance in recent informa...
Event detection (ED) identifies and classifies event triggers from
unstr...
Direct time-of-flight (dToF) sensors are promising for next-generation
o...
Code summarization generates brief natural language descriptions of sour...
While parameter efficient tuning (PET) methods have shown great potentia...
Semi-supervised learning (SSL) improves model generalization by leveragi...
Graph kernels are conventional methods for computing graph similarities....
Though widely used in industry, traditional task-oriented dialogue syste...
Graph neural networks (GNNs) often assume strong homophily in graphs, se...
Graph Convolutional Networks (GCN) is a pioneering model for graph-based...
AI and humans bring complementary skills to group deliberations. Modelin...
Temporally consistent depth estimation is crucial for real-time applicat...
Vision transformers (ViTs) have attracted much attention for their super...
Recently, deep clustering methods have gained momentum because of the hi...
Legal judgment prediction(LJP) is an essential task for legal AI. While ...
Capturing interactions among event arguments is an essential step toward...
Knowledge graphs (KGs) are widely used to facilitate relation extraction...
Code summaries are brief natural language descriptions of source code pi...
Automatic song writing aims to compose a song (lyric and/or melody) by
m...
Graph-structured data arise in many scenarios. A fundamental problem is ...
Graph clustering has been studied extensively on both plain graphs and
a...
Code summarization generates brief natural language description given a
...
Graph-structured data arise ubiquitously in many application domains. A
...
In practical scenario, relation extraction needs to first identify entit...