This paper presents a novel nearest neighbor search algorithm achieving ...
The performance of a language model has been shown to be effectively mod...
We present GSPMD, an automatic, compiler-based parallelization system fo...
Self-attention has the promise of improving computer vision systems due ...
Recent results in language understanding using neural networks have requ...
In data-parallel synchronous training of deep neural networks, different...
In this work, we present two parallel algorithms for the large-scale dis...
The recent submission of Google TPU-v3 Pods to the industry wide MLPerf ...
Batch-splitting (data-parallelism) is the dominant distributed Deep Neur...