research
∙
07/11/2023
Stack More Layers Differently: High-Rank Training Through Low-Rank Updates
Despite the dominance and effectiveness of scaling, resulting in large n...
research
∙
08/25/2020