We study the training dynamics of shallow neural networks, in a two-time...
Gradient-based learning in multi-layer neural networks displays a number...
Diagonal linear networks (DLNs) are a toy simplification of artificial n...
Gossip algorithms and their accelerated versions have been studied
exclu...
Approximate-message passing (AMP) algorithms have become an important el...
We introduce the continuized Nesterov acceleration, a close variant of
N...
We introduce the "continuized" Nesterov acceleration, a close variant of...
In the context of statistical supervised learning, the noiseless linear ...
Consider a network of agents connected by communication links, where eac...