Bryan Seybold | DeepAI

DeepAI

AI Chat AI Image Generator AI Video AI Music Voice Chat AI Photo Editor Math AI

Featured Co-authors

YI LIU
156 publications
Cordelia Schmid
156 publications
C. -C. Jay Kuo
151 publications
Chen Sun
74 publications
Jia Deng
61 publications
Arsha Nagrani
43 publications
Rahul Sukthankar
35 publications
Bo Hu
32 publications
Shan Yang
30 publications
Ron J. Weiss
30 publications
Rif A. Saurous
26 publications

research

∙ 12/20/2022

Open-Vocabulary Temporal Action Detection with Off-the-Shelf Image-Text Features

Detecting actions in untrimmed videos should not be limited to a small, ...

0 Vivek Rathod, et al. ∙

research

∙ 05/12/2022

What's in a Caption? Dataset-Specific Linguistic Diversity and Its Effect on Visual Description Models and Metrics

While there have been significant gains in the field of automated video ...

6 David M. Chan, et al. ∙

research

∙ 04/01/2022

Learning Audio-Video Modalities from Image Captions

A major challenge in text-video and text-audio retrieval is the lack of ...

3 Arsha Nagrani, et al. ∙

research

∙ 06/17/2021

Optical Mouse: 3D Mouse Pose From Single-View Video

We present a method to infer the 3D pose of mice, including the limbs an...

3 Bo Hu, et al. ∙

research

∙ 05/17/2019

Dueling Decoders: Regularizing Variational Autoencoder Latent Spaces

Variational autoencoders learn unsupervised data representations, but th...

0 Bryan Seybold, et al. ∙

research

∙ 04/20/2018

Rethinking the Faster R-CNN Architecture for Temporal Action Localization

We propose TAL-Net, an improved approach to temporal action localization...

0 Yu-Wei Chao, et al. ∙

research

∙ 01/03/2018

Instance Embedding Transfer to Unsupervised Video Object Segmentation

We propose a method for unsupervised video object segmentation by transf...

0 Siyang Li, et al. ∙

research

∙ 09/29/2016

CNN Architectures for Large-Scale Audio Classification

Convolutional Neural Networks (CNNs) have proven very effective in image...

0 Shawn Hershey, et al. ∙