Yuqing Song

research

∙ 03/12/2023

Accommodating Audio Modality in CLIP for Multimodal Processing

Multimodal processing has attracted much attention lately especially wit...

0 Ludan Ruan, et al. ∙

research

∙ 07/18/2022

Unifying Event Detection and Captioning as Sequence Generation via Pre-Training

Dense video captioning aims to generate corresponding text descriptions ...

0 Qi Zhang, et al. ∙

research

∙ 06/24/2022

Some theoretical results on discrete contour trees

Contour trees have been developed to visualize or encode scalar data in ...

0 Yuqing Song, et al. ∙

research

∙ 04/24/2022

Progressive Learning for Image Retrieval with Hybrid-Modality Queries

Image retrieval with hybrid-modality queries, also known as composing te...

0 Yida Zhao, et al. ∙

research

∙ 08/25/2021

Product-oriented Machine Translation with Cross-modal Cross-lingual Pre-training

Translating e-commercial product descriptions, a.k.a product-oriented ma...

0 Yuqing Song, et al. ∙

research

∙ 06/11/2021

Team RUC_AIM3 Technical Report at ActivityNet 2021: Entities Object Localization

Entities Object Localization (EOL) aims to evaluate how grounded or fait...

0 Ludan Ruan, et al. ∙

research

∙ 05/30/2021

Towards Diverse Paragraph Captioning for Untrimmed Videos

Video paragraph captioning aims to describe multiple events in untrimmed...

0 Yuqing Song, et al. ∙

research

∙ 06/14/2020

Team RUC_AIM3 Technical Report at Activitynet 2020 Task 2: Exploring Sequential Events Detection for Dense Video Captioning

Detecting meaningful events in an untrimmed video is essential for dense...

0 Yuqing Song, et al. ∙

research

∙ 10/15/2019

Integrating Temporal and Spatial Attentions for VATEX Video Captioning Challenge 2019

This notebook paper presents our model in the VATEX video captioning cha...

0 Shizhe Chen, et al. ∙

research

∙ 08/15/2019

Unpaired Cross-lingual Image Caption Generation with Self-Supervised Rewards

Generating image descriptions in different languages is essential to sat...

3 Yuqing Song, et al. ∙

research

∙ 07/11/2019

Activitynet 2019 Task 3: Exploring Contexts for Dense Captioning Events in Videos

Contextual reasoning is essential to understand events in long untrimmed...

0 Shizhe Chen, et al. ∙

research

∙ 06/22/2018

RUC+CMU: System Report for Dense Captioning Events in Videos

This notebook paper presents our system in the ActivityNet Dense Caption...

0 Shizhe Chen, et al. ∙

Yuqing Song

Featured Co-authors

Sign in with Google

Consider DeepAI Pro