Question answering on tabular data (a.k.a TableQA), which aims at genera...
As we embark on a new era of LLMs, it becomes increasingly crucial to
un...
Large Language Models (LLMs) have gained prominence in the field of Lega...
The multimedia community has shown a significant interest in perceiving ...
The integration of retrieved passages and large language models (LLMs), ...
Studies on semi-supervised medical image segmentation (SSMIS) have seen ...
Recent advanced methods in Natural Language Understanding for Task-orien...
In this work, we study dialogue scenarios that start from chit-chat but
...
Despite advancements in conversational AI, language models encounter
cha...
Choice Modeling is at the core of many economics, operations, and market...
Spoken dialogue systems (SDSs) have been separately developed under two
...
The dominant paradigm of textual question answering systems is based on
...
Over the past few decades, multimodal emotion recognition has made remar...
Location recommendation plays a vital role in improving users' travel
ex...
Predicting the next location is a highly valuable and common need in man...
Just noticeable difference (JND) refers to the maximum visual change tha...
Dense retrievers have made significant strides in obtaining state-of-the...
Multimodal headline utilizes both video frames and transcripts to genera...
Parsing natural language questions into executable logical forms is a us...
In the scenario of unsupervised extractive summarization, learning
high-...
The pre-trained conversational models still fail to capture the implicit...
Scene segmentation and classification (SSC) serve as a critical step tow...
The extraction of text information in videos serves as a critical step
t...
Finding relevant moments and highlights in videos according to natural
l...
Defense models against adversarial attacks have grown significantly, but...
Massive open online courses (MOOCs), which provide a large-scale interac...
Few-shot table-to-text generation is a task of composing fluent and fait...
This work combines information about the dialogue history encoded by
pre...
Annotating microscopy images for nuclei segmentation is laborious and
ti...
The task of instance segmentation in remote sensing images, aiming at
pe...
With the proliferation of knowledge graphs, modeling data with complex
m...
Dense neural text retrieval has achieved promising results on open-domai...
Due to the vulnerability of deep neural networks (DNNs) to adversarial
e...
To capture the semantic graph structure from raw text, most existing
sum...
One challenge for dialogue agents is to recognize feelings of the
conver...
This paper presents an automatic method to evaluate the naturalness of
n...
Single image dehazing is a challenging task, for which the domain shift
...
Pretrained Transformer-based models were reported to be robust in intent...
The non-autoregressive models have boosted the efficiency of neural mach...
Open attribute value extraction for emerging entities is an important bu...
Generative commonsense reasoning which aims to empower machines to gener...
We consider the problem of Human-Object Interaction (HOI) Detection, whi...
Healthcare question answering assistance aims to provide customer health...
The popularity of concurrent transmissions (CT) has soared after recent
...
Human tackle reading comprehension not only based on the given context i...
The ubiquitous deployment of monitoring devices in urban flow monitoring...
Bas-relief generation based on 3d models is a hot topic in computer grap...
In this paper, we study the nonnegative tensor data and propose an ortho...
Ridge-valley features are important elements of point clouds, as they co...
In real-world question-answering (QA) systems, ill-formed questions, suc...