Simon Kornblith

research

∙ 09/15/2023

Replacing softmax with ReLU in Vision Transformers

Previous research observed accuracy degradation when replacing the atten...

0 Mitchell Wortsman, et al. ∙

research

∙ 08/02/2023

OpenFlamingo: An Open-Source Framework for Training Large Autoregressive Vision-Language Models

We introduce OpenFlamingo, a family of autoregressive vision-language mo...

0 Anas Awadalla, et al. ∙

research

∙ 07/31/2023

Guiding Image Captioning Models Toward More Specific Captions

Image captioning is conventionally formulated as the task of generating ...

0 Simon Kornblith, et al. ∙

research

∙ 06/07/2023

Improving neural network representations using human similarity judgments

Deep neural networks have reached human-level performance on many comput...

0 Lukas Muttenthaler, et al. ∙

research

∙ 01/11/2023

Does progress on ImageNet transfer to real-world datasets?

Does progress on ImageNet transfer to real-world datasets? We investigat...

0 Alex Fang, et al. ∙

research

∙ 12/15/2022

FlexiViT: One Model for All Patch Sizes

Vision Transformers convert images to sequences by slicing them into pat...

15 Lucas Beyer, et al. ∙

research

∙ 12/13/2022

On the Relationship Between Explanation and Prediction: A Causal View

Explainability has become a central requirement for the development, dep...

0 Amir-Hossein Karimi, et al. ∙

research

∙ 11/02/2022

Human alignment of neural network representations

Today's computer vision models achieve human or near-human level perform...

0 Lukas Muttenthaler, et al. ∙

research

∙ 10/19/2022

Gaussian-Bernoulli RBMs Without Tears

We revisit the challenging problem of training Gaussian-Bernoulli restri...

5 Renjie Liao, et al. ∙

research

∙ 10/11/2022

Improving Dense Contrastive Learning with Dense Negative Pairs

Many contrastive representation learning methods learn a single global r...

9 Berk Iskender, et al. ∙

research

∙ 10/07/2022

Scaling Forward Gradient With Local Losses

Forward gradient learning computes a noisy directional gradient and is a...

8 Mengye Ren, et al. ∙

research

∙ 08/10/2022

Patching open-vocabulary models by interpolating weights

Open-vocabulary models like CLIP achieve high accuracy across many image...

10 Gabriel Ilharco, et al. ∙

research

∙ 07/09/2022

A Study on Self-Supervised Object Detection Pretraining

In this work, we study different approaches to self-supervised pretraini...

11 Trung Dang, et al. ∙

research

∙ 05/23/2022

Decoder Denoising Pretraining for Semantic Segmentation

Semantic segmentation labels are expensive and time consuming to acquire...

4 Emmanuel Brempong Asiedu, et al. ∙

research

∙ 05/19/2022

Robust and Efficient Medical Imaging with Self-Supervision

Recent progress in Medical Artificial Intelligence (AI) has delivered sy...

24 Shekoofeh Azizi, et al. ∙

research

∙ 03/10/2022

Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time

The conventional recipe for maximizing model accuracy is to (1) train mu...

10 Mitchell Wortsman, et al. ∙

research

∙ 02/15/2022

On the Origins of the Block Structure Phenomenon in Neural Network Representations

Recent work has uncovered a striking phenomenon in large-capacity neural...

3 Thao Nguyen, et al. ∙

research

∙ 11/02/2021

Meta-Learning to Improve Pre-Training

Pre-training (PT) followed by fine-tuning (FT) is an effective method fo...

14 Aniruddh Raghu, et al. ∙

research

∙ 10/27/2021

Generalized Shape Metrics on Neural Representations

Understanding the operation of biological and artificial networks remain...

0 Alex H Williams, et al. ∙

research

∙ 08/19/2021

Do Vision Transformers See Like Convolutional Neural Networks?

Convolutional neural networks (CNNs) have so far been the de-facto model...

33 Jojo Yun, et al. ∙

research

∙ 01/13/2021

Big Self-Supervised Models Advance Medical Image Classification

Self-supervised pretraining followed by supervised fine-tuning has seen ...

22 Shekoofeh Azizi, et al. ∙

research

∙ 11/23/2020

Boosting Contrastive Self-Supervised Learning with False Negative Cancellation

Self-supervised representation learning has witnessed significant leaps ...

0 Tri Huynh, et al. ∙

research

∙ 11/05/2020

Teaching with Commentaries

Effective training of deep neural networks can be challenging, and there...

3 Aniruddh Raghu, et al. ∙

research

∙ 10/30/2020

What's in a Loss Function for Image Classification?

It is common to use the softmax cross-entropy loss to train neural netwo...

31 Simon Kornblith, et al. ∙

research

∙ 10/29/2020

Do Wide and Deep Networks Learn the Same Things? Uncovering How Neural Network Representations Vary with Width and Depth

A key factor in the success of deep neural networks is the ability to sc...

73 Thao Nguyen, et al. ∙

research

∙ 06/17/2020

Big Self-Supervised Models are Strong Semi-Supervised Learners

One paradigm for learning from few labeled examples while making best us...

6 Ting Chen, et al. ∙

research

∙ 02/13/2020

A Simple Framework for Contrastive Learning of Visual Representations

This paper presents SimCLR: a simple framework for contrastive learning ...

13 Ting Chen, et al. ∙

research

∙ 02/11/2020

Generalised Lipschitz Regularisation Equals Distributional Robustness

The problem of adversarial examples has highlighted the need for a theor...

3 Zac Cranko, et al. ∙

research

∙ 02/10/2020

Subclass Distillation

After a large "teacher" neural network has been trained on labeled data,...

9 Rafael Müller, et al. ∙

research

∙ 02/07/2020

Revisiting Spatial Invariance with Low-Rank Local Connectivity

Convolutional neural networks are among the most successful architecture...

3 Gamaleldin F. Elsayed, et al. ∙

research

∙ 11/20/2019

Exploring the Origins and Prevalence of Texture Bias in Convolutional Neural Networks

Recent work has indicated that, unlike humans, ImageNet-trained CNNs ten...

0 Katherine L. Hermann, et al. ∙

research

∙ 08/20/2019

Saccader: Improving Accuracy of Hard Attention Models for Vision

Although deep convolutional neural networks achieve state-of-the-art per...

9 Gamaleldin F. Elsayed, et al. ∙

research

∙ 06/06/2019

When Does Label Smoothing Help?

The generalization and learning speed of a multi-class neural network ca...

7 Rafael Müller, et al. ∙

research

∙ 05/28/2019

Cerberus: A Multi-headed Derenderer

To generalize to novel visual scenes with new viewpoints and new object ...

4 Boyang Deng, et al. ∙

research

∙ 05/01/2019

Similarity of Neural Network Representations Revisited

Recent work has sought to understand the behavior of neural networks by ...

20 Simon Kornblith, et al. ∙

research

∙ 11/16/2018

Domain Adaptive Transfer Learning with Specialist Models

Transfer learning is a widely used method to build high performing compu...

0 Jiquan Ngiam, et al. ∙

research

∙ 09/04/2018

Lipschitz Networks and Distributional Robustness

Robust risk minimisation has several advantages: it has been studied wit...

0 Zac Cranko, et al. ∙

research

∙ 05/23/2018

Do Better ImageNet Models Transfer Better?

Transfer learning has become a cornerstone of computer vision with the a...

0 Simon Kornblith, et al. ∙

Simon Kornblith

Featured Co-authors

Sign in with Google

Consider DeepAI Pro