Pranava Madhyastha

research

∙ 07/14/2023

Are words equally surprising in audio and audio-visual comprehension?

We report a controlled study investigating the effect of visual informat...

0 Pranava Madhyastha, et al. ∙

research

∙ 06/24/2023

Towards Robust Aspect-based Sentiment Analysis through Non-counterfactual Augmentations

While state-of-the-art NLP models have demonstrated excellent performanc...

0 Xinyu Liu, et al. ∙

research

∙ 04/11/2023

Towards preserving word order importance through Forced Invalidation

Large pre-trained language models such as BERT have been widely used as ...

0 Hadeel Al-Negheimish, et al. ∙

research

∙ 04/07/2023

Theoretical Conditions and Empirical Failure of Bracket Counting on Long Sequences with Linear Recurrent Networks

Previous work has established that RNNs with an unbounded activation fun...

0 Nadine El-Naggar, et al. ∙

research

∙ 01/25/2023

Towards a Unified Model for Generating Answers and Explanations in Visual Question Answering

Providing explanations for visual question answering (VQA) has gained mu...

0 Chenxi Whitehouse, et al. ∙

research

∙ 11/29/2022

Exploring the Long-Term Generalization of Counting Behavior in RNNs

In this study, we investigate the generalization of LSTM, ReLU and GRU m...

0 Nadine El-Naggar, et al. ∙

research

∙ 09/16/2022

Belief Revision based Caption Re-ranker with Visual Semantic Information

In this work, we focus on improving the captions generated by image-capt...

0 Ahmed Sabir, et al. ∙

research

∙ 04/01/2022

Evaluation of Fake News Detection with Knowledge-Enhanced Language Models

Recent advances in fake news detection have exploited the success of lar...

0 Chenxi Whitehouse, et al. ∙

research

∙ 09/16/2021

Numerical reasoning in machine reading comprehension tasks: are we there yet?

Numerical reasoning based machine reading comprehension is a task that i...

0 Hadeel Al-Negheimish, et al. ∙

research

∙ 06/07/2021

BERTGEN: Multi-task Generation through BERT

We present BERTGEN, a novel generative, decoder-only model which extends...

0 Faidon Mitzalis, et al. ∙

research

∙ 06/06/2021

A call for better unit testing for invariant risk minimisation

In this paper we present a controlled study on the linearized IRM framew...

0 Chunyang Xiao, et al. ∙

research

∙ 04/05/2021

Discrete Reasoning Templates for Natural Language Understanding

Reasoning about information from multiple parts of a passage to derive a...

0 Hadeel Al-Negheimish, et al. ∙

research

∙ 03/02/2021

MultiSubs: A Large-scale Multimodal and Multilingual Dataset

This paper introduces a large-scale multimodal and multilingual dataset ...

0 Josiah Wang, et al. ∙

research

∙ 02/22/2021

Exploiting Multimodal Reinforcement Learning for Simultaneous Machine Translation

This paper addresses the problem of simultaneous machine translation (Si...

0 Julia Ive, et al. ∙

research

∙ 01/25/2021

Cross-lingual Visual Pre-training for Multimodal Machine Translation

Pre-trained language models have been shown to improve performance in ma...

8 Ozan Caglayan, et al. ∙

research

∙ 12/13/2020

MSVD-Turkish: A Comprehensive Multimodal Dataset for Integrated Vision and Language Research in Turkish

Automatic generation of video descriptions in natural language, also cal...

14 Begum Citamak, et al. ∙

research

∙ 10/26/2020

Curious Case of Language Generation Evaluation Metrics: A Cautionary Tale

Automatic evaluation of language generation systems is a well-studied pr...

0 Ozan Caglayan, et al. ∙

research

∙ 09/15/2020

Simultaneous Machine Translation with Visual Context

Simultaneous machine translation (SiMT) aims to translate a continuous i...

0 Ozan Caglayan, et al. ∙

research

∙ 06/17/2020

A Tweet-based Dataset for Company-Level Stock Return Prediction

Public opinion influences events, especially related to stock market mov...

0 Karolina Sowinska, et al. ∙

research

∙ 10/16/2019

Imperial College London Submission to VATEX Video Captioning Task

This paper describes the Imperial College London team's submission to th...

0 Ozan Caglayan, et al. ∙

research

∙ 09/23/2019

On Model Stability as a Function of Random Seed

In this paper, we focus on quantifying model stability as a function of ...

0 Pranava Madhyastha, et al. ∙

research

∙ 08/29/2019

Probing Representations Learned by Multimodal Recurrent and Transformer Models

Recent literature shows that large-scale language modeling provides exce...

0 Jindřich Libovický, et al. ∙

research

∙ 08/05/2019

Predicting Actions to Help Predict Translations

We address the task of text translation on the How2 dataset using a stat...

0 Zixiu Wu, et al. ∙

research

∙ 07/22/2019

VIFIDEL: Evaluating the Visual Fidelity of Image Descriptions

We address the task of evaluating image description generation systems. ...

3 Pranava Madhyastha, et al. ∙

research

∙ 06/18/2019

Distilling Translations with Visual Awareness

Previous work on multimodal machine translation has shown that visual in...

0 Julia Ive, et al. ∙

research

∙ 06/18/2019

Model Explanations under Calibration

Explaining and interpreting the decisions of recommender systems are bec...

0 Rishabh Jain, et al. ∙

research

∙ 03/20/2019

Probing the Need for Visual Context in Multimodal Machine Translation

Current work on multimodal machine translation (MMT) has suggested that ...

0 Ozan Caglayan, et al. ∙

research

∙ 11/26/2018

Predicting Language Recovery after Stroke with Convolutional Networks on Stitched MRI

One third of stroke survivors have language difficulties. Emerging evide...

0 Yusuf H. Roohani, et al. ∙

research

∙ 11/21/2018

Learning from Multiview Correlations in Open-Domain Videos

An increasing number of datasets contain multiple views, such as video, ...

0 Nils Holzenberger, et al. ∙

research

∙ 09/11/2018

End-to-end Image Captioning Exploits Multimodal Distributional Similarity

We hypothesize that end-to-end neural image captioning systems work seem...

0 Pranava Madhyastha, et al. ∙

research

∙ 05/16/2018

Defoiling Foiled Image Captions

We address the task of detecting foiled image captions, i.e. identifying...

0 Pranava Madhyastha, et al. ∙

research

∙ 04/23/2018

Object Counts! Bringing Explicit Detections Back into Image Captioning

The use of explicit object detectors as an intermediate step to image ca...

0 Josiah Wang, et al. ∙

Pranava Madhyastha

Featured Co-authors

Sign in with Google

Consider DeepAI Pro