In the past few years, neural architecture search (NAS) has become an
in...
We present the design of a new large scale orchestration layer for
accel...
Multi-Chip-Modules (MCMs) reduce the design and fabrication cost of mach...
One of the major optimizations employed in deep learning frameworks is g...
Most compilers for machine learning (ML) frameworks need to solve many
c...
Accurate hardware performance models are critical to efficient code
gene...
Runtime and scalability of large neural networks can be significantly
af...
In this paper, we propose a closed form approximation to the mean and
va...