A challenge in the Dialogue State Tracking (DST) field is adapting model...
The standard Gaussian Process (GP) only considers a single output sample...
Stance detection determines whether the author of a piece of text is in ...
This study investigates machine translation between related languages i....
Adapting a large language model for multiple-attribute text style transf...
Conversational Question Generation (CQG) is a critical task for machines...
Text-to-speech (TTS) models have achieved remarkable naturalness in rece...
Task-oriented dialogue (TOD) systems have assisted users on many tasks,
...
Sequence-to-sequence deep neural models fine-tuned for abstractive
summa...
Neural models are known to be over-parameterized, and recent work has sh...
Conversational question generation (CQG) serves as a vital task for mach...
Designed for tracking user goals in dialogues, a dialogue state tracker ...
There is growing interest in the automated extraction of relevant inform...
Text style transfer is an important task in controllable language genera...
Sequence-to-sequence neural networks have recently achieved great succes...
In this work, we investigate pronunciation differences in English spoken...
Catastrophic forgetting is a thorny challenge when updating keyword spot...
While multi-party conversations are often less structured than monologue...
Text discourse parsing weighs importantly in understanding information f...
In this paper, we propose a controllable neural generation framework tha...
Recently abstractive spoken language summarization raises emerging resea...
Speech evaluation is an essential component in computer-assisted languag...
Video-grounded dialogue systems aim to integrate video understanding and...
Summarizing conversations via neural approaches has been gaining researc...
Neural module networks (NMN) have achieved success in image-grounded tas...
Compared to traditional visual question answering, video-grounded dialog...
The collection and annotation of task-oriented conversational data is a
...
Acoustic modeling for child speech is challenging due to the high acoust...
Document-level discourse parsing, in accordance with the Rhetorical Stru...
Text discourse parsing plays an important role in understanding informat...
Video-grounded dialogues are very challenging due to (i) the complexity ...
Spell check is a useful application which involves processing noisy
huma...
Building an end-to-end conversational agent for multi-domain task-orient...
Audio-Visual Scene-Aware Dialog (AVSD) is an extension from Video Questi...
Logographs (Chinese characters) have recursive structures (i.e. hierarch...
We propose an architecture to jointly learn word and label embeddings fo...
Due to the lack of publicly available resources, conversation summarizat...
Developing Video-Grounded Dialogue Systems (VGDS), where a dialogue is
c...
Data for human-human spoken dialogues for research and development are
c...
Transliteration converts words in a source language (e.g., English) into...
Graphemes of most languages encode pronunciation, though some are more
e...