Matthias Minderer

research

∙ 07/12/2023

Patch n' Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution

The ubiquitous and demonstrably suboptimal choice of resizing images to ...

0 Mostafa Dehghani, et al. ∙

research

∙ 06/16/2023

Scaling Open-Vocabulary Object Detection

Open-vocabulary object detection has benefited greatly from pretrained v...

0 Matthias Minderer, et al. ∙

research

∙ 12/15/2022

FlexiViT: One Model for All Patch Sizes

Vision Transformers convert images to sequences by slicing them into pat...

15 Lucas Beyer, et al. ∙

research

∙ 05/23/2022

Decoder Denoising Pretraining for Semantic Segmentation

Semantic segmentation labels are expensive and time consuming to acquire...

4 Emmanuel Brempong Asiedu, et al. ∙

research

∙ 10/18/2021

SCENIC: A JAX Library for Computer Vision Research and Beyond

Scenic is an open-source JAX library with a focus on Transformer-based m...

31 Mostafa Dehghani, et al. ∙

research

∙ 06/15/2021

Revisiting the Calibration of Modern Neural Networks

Accurate estimation of predictive uncertainty (model calibration) is ess...

18 Matthias Minderer, et al. ∙

research

∙ 10/22/2020

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

While the Transformer architecture has become the de-facto standard for ...

6 Alexey Dosovitskiy, et al. ∙

research

∙ 07/16/2020

On Robustness and Transferability of Convolutional Neural Networks

Modern deep convolutional networks (CNNs) are often criticized for not g...

15 Josip Djolonga, et al. ∙

research

∙ 02/20/2020

Automatic Shortcut Removal for Self-Supervised Representation Learning

In self-supervised visual representation learning, a feature extractor i...

0 Matthias Minderer, et al. ∙

research

∙ 06/19/2019

Unsupervised Learning of Object Structure and Dynamics from Videos

Extracting and predicting object structure and dynamics from videos with...

4 Matthias Minderer, et al. ∙

Matthias Minderer

Featured Co-authors

Sign in with Google

Consider DeepAI Pro