It is important that consumers and regulators can verify the provenance ...
Standard first-order stochastic optimization algorithms base their updat...
The Gumbel-Max trick is the basis of many relaxed gradient estimators. T...
Selecting an optimizer is a central step in the contemporary deep learni...
In the twilight of Moore's law, GPUs and other specialized hardware
acce...