Yonatan Bitton

research

∙ 08/12/2023

VisIT-Bench: A Benchmark for Vision-Language Instruction Following Inspired by Real-World Use

We introduce VisIT-Bench (Visual InsTruction Benchmark), a benchmark for...

0 Yonatan Bitton, et al. ∙

research

∙ 08/02/2023

OpenFlamingo: An Open-Source Framework for Training Large Autoregressive Vision-Language Models

We introduce OpenFlamingo, a family of autoregressive vision-language mo...

0 Anas Awadalla, et al. ∙

research

∙ 07/06/2023

Read, Look or Listen? What's Needed for Solving a Multimodal Dataset

The prevalence of large-scale multimodal datasets presents unique challe...

0 Netta Madvil, et al. ∙

research

∙ 05/17/2023

What You See is What You Read? Improving Text-Image Alignment Evaluation

Automatically determining whether a text and a corresponding image are s...

0 Michal Yarom, et al. ∙

research

∙ 04/27/2023

q2d: Turning Questions into Dialogs to Teach Models How to Search

One of the exciting capabilities of recent language models for dialog is...

3 Yonatan Bitton, et al. ∙

research

∙ 04/27/2023

DataComp: In search of the next generation of multimodal datasets

Large multimodal datasets have been instrumental in recent breakthroughs...

0 Samir Yitzhak Gadre, et al. ∙

research

∙ 03/27/2023

IRFL: Image Recognition of Figurative Language

Figures of speech such as metaphors, similes, and idioms allow language ...

0 Ron Yosef, et al. ∙

research

∙ 03/13/2023

Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images

Weird, unusual, and uncanny images pique the curiosity of observers beca...

0 Nitzan Bitton Guetta, et al. ∙

research

∙ 12/08/2022

VASR: Visual Analogies of Situation Recognition

A core process in human cognition is analogical mapping: the ability to ...

0 Yonatan Bitton, et al. ∙

research

∙ 07/25/2022

WinoGAViL: Gamified Association Benchmark to Challenge Vision-and-Language Models

While vision-and-language models perform well on tasks such as visual qu...

1 Yonatan Bitton, et al. ∙

research

∙ 09/05/2021

Data Efficient Masked Language Modeling for Vision and Language

Masked language modeling (MLM) is one of the key sub-tasks in vision-lan...

0 Yonatan Bitton, et al. ∙

research

∙ 03/17/2021

Automatic Generation of Contrast Sets from Scene Graphs: Probing the Compositional Consistency of GQA

Recent works have shown that supervised models often exploit data artifa...

0 Yonatan Bitton, et al. ∙

Yonatan Bitton

Featured Co-authors

Sign in with Google

Consider DeepAI Pro