Yaohui Wang

research

∙ 08/28/2023

LAC: Latent Action Composition for Skeleton-based Action Segmentation

Skeleton-based action segmentation requires recognizing composable actio...

0 Di Yang, et al. ∙

research

∙ 07/13/2023

InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation

This paper introduces InternVid, a large-scale video-centric multimodal ...

0 Yi Wang, et al. ∙

research

∙ 07/10/2023

AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning

With the advance of text-to-image models (e.g., Stable Diffusion) and co...

0 Yuwei Guo, et al. ∙

research

∙ 05/10/2023

Self-Supervised Video Representation Learning via Latent Time Navigation

Self-supervised video representation learning aimed at maximizing simila...

0 Di Yang, et al. ∙

research

∙ 05/06/2023

LEO: Generative Latent Image Animator for Human Video Synthesis

Spatio-temporal coherency is a major challenge in synthesizing high qual...

5 Yaohui Wang, et al. ∙

research

∙ 05/02/2023

Long-Term Rhythmic Video Soundtracker

We consider the problem of generating musical soundtracks in sync with r...

7 Jiashuo Yu, et al. ∙

research

∙ 04/24/2023

Hierarchical Diffusion Autoencoders and Disentangled Image Manipulation

Diffusion models have attained impressive visual quality for image synth...

0 zeyu-lu, et al. ∙

research

∙ 01/02/2023

Learning Invariance from Generated Variance for Unsupervised Person Re-identification

This work focuses on unsupervised representation learning in person re-i...

0 Hao Chen, et al. ∙

research

∙ 08/31/2022

ViA: View-invariant Skeleton Action Representation Learning via Motion Retargeting

Current self-supervised approaches for skeleton action representation le...

6 Di Yang, et al. ∙

research

∙ 03/17/2022

Latent Image Animator: Learning to Animate Images via Latent Space Navigation

Due to the remarkable progress of deep generative models, animating imag...

13 Yaohui Wang, et al. ∙

research

∙ 07/19/2021

UNIK: A Unified Framework for Real-world Skeleton-based Action Recognition

Action recognition based on skeleton data has recently witnessed increas...

8 Di Yang, et al. ∙

research

∙ 01/08/2021

InMoDeGAN: Interpretable Motion Decomposition Generative Adversarial Network for Video Generation

In this work, we introduce an unconditional video generative model, InMo...

8 Yaohui Wang, et al. ∙

research

∙ 12/16/2020

Joint Generative and Contrastive Learning for Unsupervised Person Re-identification

Annotating identity labels in large-scale datasets is a labour-intensive...

0 Hao Chen, et al. ∙

research

∙ 11/10/2020

Selective Spatio-Temporal Aggregation Based Pose Refinement System: Towards Understanding Human Activities in Real-World Videos

Taking advantage of human pose data for understanding human activities h...

0 Di Yang, et al. ∙

research

∙ 12/11/2019

G^3AN: This video does not exist. Disentangling motion and appearance for video generation

Creating realistic human videos introduces the challenge of being able t...

51 Yaohui Wang, et al. ∙

Yaohui Wang

Featured Co-authors

Sign in with Google

Consider DeepAI Pro