Prompted models have demonstrated impressive few-shot learning abilities...
The ability to extrapolate from short problem instances to longer ones i...
Language models have achieved remarkable performance on a wide range of ...
Large language models have been shown to achieve remarkable performance
...
Recent neural network-based language models have benefited greatly from
...
Large pre-trained language models perform remarkably well on tasks that ...
Complex learning rate schedules have become an integral part of deep
lea...
We study the role of L_2 regularization in deep learning, and uncover
si...
The choice of initial learning rate can have a profound effect on the
pe...