Wenliang Dai

research

∙ 07/03/2023

Visual Instruction Tuning with Polite Flamingo

Recent research has demonstrated that the multi-task fine-tuning of mult...

0 Delong Chen, et al. ∙

research

∙ 05/11/2023

InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning

General-purpose language models that can solve various language-domain t...

0 Wenliang Dai, et al. ∙

research

∙ 02/08/2023

A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity

This paper proposes a framework for quantitatively evaluating interactiv...

13 Yejin Bang, et al. ∙

research

∙ 12/19/2022

NusaCrowd: Open Source Initiative for Indonesian NLP Resources

We present NusaCrowd, a collaborative initiative to collect and unite ex...

0 Samuel Cahyawijaya, et al. ∙

research

∙ 10/14/2022

Plausible May Not Be Faithful: Probing Object Hallucination in Vision-Language Pre-training

Large-scale vision-language pre-trained (VLP) models are prone to halluc...

0 Wenliang Dai, et al. ∙

research

∙ 07/06/2022

Kaggle Competition: Cantonese Audio-Visual Speech Recognition for In-car Commands

With the rise of deep learning and intelligent vehicles, the smart assis...

0 Wenliang Dai, et al. ∙

research

∙ 12/12/2021

ASCEND: A Spontaneous Chinese-English Dataset for Code-switching in Multi-turn Conversation

Code-switching is a speech phenomenon when a speaker switches language d...

7 Holy Lovenia, et al. ∙

research

∙ 09/14/2021

Greenformer: Factorization Toolkit for Efficient Deep Neural Networks

While the recent advances in deep neural networks (DNN) bring remarkable...

0 Samuel Cahyawijaya, et al. ∙

research

∙ 09/06/2021

Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive Summarization

Multimodal abstractive summarization (MAS) models that summarize videos ...

7 Tiezheng Yu, et al. ∙

research

∙ 04/23/2021

Weakly-supervised Multi-task Learning for Multimodal Affect Recognition

Multimodal affect recognition constitutes an important aspect for enhanc...

13 Wenliang Dai, et al. ∙

research

∙ 03/17/2021

Multimodal End-to-End Sparse Model for Emotion Recognition

Existing works on multimodal affective computing tasks, such as emotion ...

16 Wenliang Dai, et al. ∙

research

∙ 12/08/2020

CrossNER: Evaluating Cross-Domain Named Entity Recognition

Cross-domain named entity recognition (NER) models are able to cope with...

7 Zihan Liu, et al. ∙

research

∙ 10/19/2020

Dimsum @LaySumm 20: BART-based Approach for Scientific Document Summarization

Lay summarization aims to generate lay summaries of scientific papers au...

0 Tiezheng Yu, et al. ∙

research

∙ 10/19/2020

Multi-hop Question Generation with Graph Convolutional Network

Multi-hop Question Generation (QG) aims to generate answer-related quest...

10 Dan Su, et al. ∙

research

∙ 09/21/2020

Modality-Transferable Emotion Embeddings for Low-Resource Multimodal Emotion Recognition

Despite the recent achievements made in the multi-modal emotion recognit...

0 Wenliang Dai, et al. ∙

research

∙ 04/28/2020

Kungfupanda at SemEval-2020 Task 12: BERT-Based Multi-Task Learning for Offensive Language Detection

Nowadays, offensive content in social media has become a serious problem...

0 Wenliang Dai, et al. ∙

Wenliang Dai

Featured Co-authors

Sign in with Google

Consider DeepAI Pro