research
          
      
      ∙
      10/28/2022
    Flatter, faster: scaling momentum for optimal speedup of SGD
Commonly used optimization algorithms often show a trade-off between goo...
          
            research
          
      
      ∙
      12/15/2019
     
             
  
  
     
                             share
 share