We introduce a zero-shot video captioning method that employs two frozen...
We study the problem of syncing the lip movement in a video with the aud...
Recent text-to-image matching models apply contrastive learning to large...
We present a novel approach for image-animation of a source image by a
d...