Neural network pruning and quantization techniques are almost as old as
...
Recently, the idea of using FP8 as a number format for neural network
tr...
When quantizing neural networks for efficient inference, low-bit integer...
In this paper, we introduce a novel method of neural network weight
comp...
Current methods for pruning neural network weights iteratively apply
mag...
The success of deep neural networks in many real-world applications is
l...
We present a new deep learning-based approach for dense stereo matching....