Wenqiao Zhang

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Yi Yang
329 publications
Qi Tian
238 publications
Peng Wang
224 publications
Fei Wu
174 publications
Tat-Seng Chua
173 publications
William Yang Wang
166 publications
Zhou Zhao
125 publications
Lei Zhu
121 publications
Ying Shan
106 publications
Shiliang Pu
96 publications
Hanwang Zhang
93 publications

research

∙ 08/08/2023

Empowering Vision-Language Models to Follow Interleaved Vision-Language Instructions

Multimodal Large Language Models (MLLMs) have recently sparked significa...

0 Juncheng Li, et al. ∙

research

∙ 04/20/2023

Learning in Imperfect Environment: Multi-Label Classification with Long-Tailed Distribution and Partial Labels

Conventional multi-label classification (MLC) methods assume that all sa...

0 Wenqiao Zhang, et al. ∙

research

∙ 04/10/2023

Toward Cohort Intelligence: A Universal Cohort Representation Learning Framework for Electronic Health Record Analysis

Electronic Health Records (EHR) are generated from clinical routine care...

0 Changshuo Liu, et al. ∙

research

∙ 03/30/2023

CAusal and collaborative proxy-tasKs lEarning for Semi-Supervised Domain Adaptation

Semi-supervised domain adaptation (SSDA) adapts a learner to a new domai...

0 Wenqiao Zhang, et al. ∙

research

∙ 03/12/2023

Gradient-Regulated Meta-Prompt Learning for Generalizable Vision-Language Models

Prompt tuning, a recently emerging paradigm, enables the powerful vision...

0 Juncheng Li, et al. ∙

research

∙ 02/14/2023

IDEAL: Toward High-efficiency Device-Cloud Collaborative and Dynamic Recommendation System

Recommendation systems have shown great potential to solve the informati...

0 Zheqi Lv, et al. ∙

research

∙ 01/22/2023

Variational Cross-Graph Reasoning and Adaptive Structured Semantics Learning for Compositional Temporal Grounding

Temporal grounding is the task of locating a specific segment from an un...

0 Juncheng Li, et al. ∙

research

∙ 09/12/2022

DUET: A Tuning-Free Device-Cloud Collaborative Parameters Generation Framework for Efficient Device Model Generalization

Device Model Generalization (DMG) is a practical yet under-investigated ...

12 Zheqi Lv, et al. ∙

research

∙ 08/11/2022

HERO: HiErarchical spatio-tempoRal reasOning with Contrastive Action Correspondence for End-to-End Video Object Grounding

Video Object Grounding (VOG) is the problem of associating spatial objec...

0 Mengze Li, et al. ∙

research

∙ 08/03/2022

Dilated Context Integrated Network with Cross-Modal Consensus for Temporal Emotion Localization in Videos

Understanding human emotions is a crucial ability for intelligent robots...

2 Juncheng Li, et al. ∙

research

∙ 07/09/2022

BOSS: Bottom-up Cross-modal Semantic Composition with Hybrid Counterfactual Training for Robust Content-based Image Retrieval

Content-Based Image Retrieval (CIR) aims to search for a target image by...

1 Wenqiao Zhang, et al. ∙

research

∙ 06/07/2022

Collaborative Intelligence Orchestration: Inconsistency-Based Fusion of Semi-Supervised Learning and Active Learning

While annotating decent amounts of data to satisfy sophisticated learnin...

0 Jiannan Guo, et al. ∙

research

∙ 05/31/2022

DeVRF: Fast Deformable Voxel Radiance Fields for Dynamic Scenes

Modeling dynamic scenes is important for many applications such as virtu...

0 Jia-Wei Liu, et al. ∙

research

∙ 03/15/2022

End-to-End Modeling via Information Tree for One-Shot Natural Language Spatial Video Grounding

Natural language spatial video grounding aims to detect the relevant obj...

0 Mengze Li, et al. ∙

research

∙ 03/04/2022

BoostMIS: Boosting Medical Image Semi-supervised Learning with Adaptive Pseudo Labeling and Informative Active Annotation

In this paper, we propose a novel semi-supervised learning (SSL) framewo...

0 Wenqiao Zhang, et al. ∙

research

∙ 12/13/2021

MAGIC: Multimodal relAtional Graph adversarIal inferenCe for Diverse and Unpaired Text-based Image Captioning

Text-based image captioning (TextCap) requires simultaneous comprehensio...

5 Wenqiao Zhang, et al. ∙

research

∙ 12/02/2021

Consensus Graph Representation Learning for Better Grounded Image Captioning

The contemporary visual captioning models frequently hallucinate objects...

0 Wenqiao Zhang, et al. ∙

research

∙ 12/02/2021

Relational Graph Learning for Grounded Video Description Generation

Grounded video description (GVD) encourages captioning models to attend ...

0 Wenqiao Zhang, et al. ∙

Success!

An error occurred

Wenqiao Zhang

Featured Co-authors

Sign in with Google

Consider DeepAI Pro