Zineng Tang | DeepAI

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Mohit Bansal
184 publications
Heng Ji
126 publications
Chenguang Zhu
57 publications
Michael Zeng
51 publications
Jie Lei
34 publications
Hao Tan
26 publications
ZiYi Yang
26 publications
Yixin Nie
19 publications
Jaemin Cho
17 publications
Sha Li
17 publications
Zhenhailong Wang
9 publications

research

∙ 05/19/2023

Any-to-Any Generation via Composable Diffusion

We present Composable Diffusion (CoDi), a novel generative model capable...

3 Zineng Tang, et al. ∙

research

∙ 05/18/2023

Paxion: Patching Action Knowledge in Video-Language Foundation Models

Action knowledge involves the understanding of textual, visual, and temp...

3 Zhenhailong Wang, et al. ∙

research

∙ 11/21/2022

Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention

We present Perceiver-VL, a vision-and-language framework that efficientl...

6 Zineng Tang, et al. ∙

research

∙ 09/28/2022

TVLT: Textless Vision-Language Transformer

In this work, we present the Textless Vision-Language Transformer (TVLT)...

4 Zineng Tang, et al. ∙

research

∙ 07/06/2021

VidLanKD: Improving Language Understanding via Video-Distilled Knowledge Transfer

Since visual perception can give rich information beyond text descriptio...

3 Zineng Tang, et al. ∙

research

∙ 05/13/2020

Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA

Videos convey rich information. Dynamic spatio-temporal relationships be...

24 Hyounghun Kim, et al. ∙

Success!

An error occurred