We develop an approach to efficiently grow neural networks, within which...
We study image segmentation from an information-theoretic perspective,
p...
We develop an approach to training deep networks while dynamically adjus...
A recent line of work studies overparametrized neural networks in the "k...
The Lottery Ticket Hypothesis from Frankle Carbin (2019) conjectures...
From a simplified analysis of adaptive methods, we derive AvaGrad, a new...
With the recent progress in machine learning, boosted by techniques such...
Adaptive gradient methods such as Adam have gained extreme popularity du...
We introduce a parameter sharing scheme, in which different layers of a
...
We consider the question of what functions can be captured by ReLU netwo...