b'Chiori Hori'

research

∙ 06/27/2023

Style-transfer based Speech and Audio-visual Scene Understanding for Robot Action Sequence Acquisition from Videos

To realize human-robot collaboration, robots need to execute actions for...

0 Chiori Hori, et al. ∙

research

∙ 02/18/2022

(2.5+1)D Spatio-Temporal Scene Graphs for Video Question Answering

Spatio-temporal scene-graph approaches to video-based reasoning tasks su...

5 Anoop Cherian, et al. ∙

research

∙ 10/13/2021

Audio-Visual Scene-Aware Dialog and Reasoning using Audio-Visual Transformers with Joint Student-Teacher Learning

In previous work, we have proposed the Audio-Visual Scene-Aware Dialog (...

0 Ankit P. Shah, et al. ∙

research

∙ 08/04/2021

Optimizing Latency for Online Video CaptioningUsing Audio-Visual Transformers

Video captioning is an essential technology to understand scenes and des...

0 Chiori Hori, et al. ∙

research

∙ 04/19/2021

Advanced Long-context End-to-end Speech Recognition Using Context-expanded Transformers

This paper addresses end-to-end automatic speech recognition (ASR) for l...

0 Takaaki Hori, et al. ∙

research

∙ 09/23/2020

Multi-Pass Transformer for Machine Translation

In contrast with previous approaches where information flows only toward...

0 Peng Gao, et al. ∙

research

∙ 07/08/2020

Spatio-Temporal Scene Graphs for Video Dialog

The Audio-Visual Scene-aware Dialog (AVSD) task requires an agent to ind...

0 Shijie Geng, et al. ∙

research

∙ 01/17/2020

Spatio-Temporal Ranked-Attention Networks for Video Captioning

Generating video descriptions automatically is a challenging task that i...

4 Anoop Cherian, et al. ∙

research

∙ 11/14/2019

The Eighth Dialog System Technology Challenge

This paper introduces the Eighth Dialog System Technology Challenge. In ...

0 Seokhwan Kim, et al. ∙

research

∙ 01/25/2019

Audio-Visual Scene-Aware Dialog

We introduce the task of scene-aware dialog. Given a follow-up question ...

48 Huda Alamri, et al. ∙

research

∙ 01/11/2019

Dialog System Technology Challenge 7

This paper introduces the Seventh Dialog System Technology Challenges (D...

0 Koichiro Yoshino, et al. ∙

research

∙ 06/21/2018

End-to-End Audio Visual Scene-Aware Dialog using Multimodal Attention-Based Video Features

Dialog systems need to understand dynamic visual scenes in order to have...

0 Chiori Hori, et al. ∙

research

∙ 06/01/2018

Audio Visual Scene-Aware Dialog (AVSD) Challenge at DSTC7

Scene-aware dialog systems will be able to have conversations with users...

0 Huda Alamri, et al. ∙

research

∙ 06/22/2017

End-to-end Conversation Modeling Track in DSTC6

End-to-end training of neural networks is a promising approach to automa...

0 Chiori Hori, et al. ∙

research

∙ 01/11/2017

Attention-Based Multimodal Fusion for Video Description

Currently successful methods for video description are based on encoder-...

0 Chiori Hori, et al. ∙

Chiori Hori

Featured Co-authors

Sign in with Google

Consider DeepAI Pro