Asa Cooper Stickland | DeepAI

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Xian Li
56 publications
Iain Murray
32 publications
Marjan Ghazvininejad
26 publications
Alexandre Berard
18 publications
Tomasz Korbak
18 publications
Vassilina Nikoulina
17 publications
Owain Evans
15 publications
Ahmet Üstün
15 publications
Xiang Kong
14 publications
Yuqing Tang
11 publications
Lukas Berglund
2 publications

research

∙ 09/21/2023

The Reversal Curse: LLMs trained on "A is B" fail to learn "B is A"

We expose a surprising failure of generalization in auto-regressive larg...

0 Lukas Berglund, et al. ∙

research

∙ 09/01/2023

Taken out of context: On measuring situational awareness in LLMs

We aim to better understand the emergence of `situational awareness' in ...

0 Lukas Berglund, et al. ∙

research

∙ 05/23/2022

When does Parameter-Efficient Transfer Learning Work for Machine Translation?

Parameter-efficient fine-tuning methods (PEFTs) offer the promise of ada...

0 Ahmet Üstün, et al. ∙

research

∙ 10/18/2021

Multilingual Domain Adaptation for NMT: Decoupling Language and Domain Information with Adapters

Adapter layers are lightweight, learnable units inserted between transfo...

0 Asa Cooper Stickland, et al. ∙

research

∙ 09/28/2020

Deep Transformers with Latent Depth

The Transformer model has achieved state-of-the-art performance in many ...

0 Xian Li, et al. ∙

research

∙ 07/08/2020

Diverse Ensembles Improve Calibration

Modern deep neural networks can produce badly calibrated predictions, es...

0 Asa Cooper Stickland, et al. ∙

research

∙ 04/30/2020

Recipes for Adapting Pre-trained Monolingual and Multilingual Models to Machine Translation

There has been recent success in pre-training on monolingual data and fi...

0 Asa Cooper Stickland, et al. ∙

research

∙ 02/07/2019

BERT and PALs: Projected Attention Layers for Efficient Adaptation in Multi-Task Learning

Multi-task learning allows the sharing of useful information between mul...

16 Asa Cooper Stickland, et al. ∙

Success!

An error occurred