In this work we introduce RITA: a suite of autoregressive generative mod...
Access to large pre-trained models of varied architectures, in many diff...
Recent work has identified simple empirical scaling laws for language mo...
We introduce LightOn's Optical Processing Unit (OPU), the first photonic...
Optical Processing Units (OPUs) – low-power photonic chips dedicated to
...
Randomized Numerical Linear Algebra (RandNLA) is a powerful class of met...
The performance of algorithms for neural architecture search strongly de...
We propose a new defense mechanism against adversarial attacks inspired ...
The scaling hypothesis motivates the expansion of models past trillions ...
Despite being the workhorse of deep learning, the backpropagation algori...
Proteins are made of atoms constantly fluctuating, but can occasionally
...
As neural networks grow larger and more complex and data-hungry, trainin...
The backpropagation algorithm has long been the canonical training metho...
We consider the problem of detecting abrupt changes in the distribution ...