Dual Precision Deep Neural Network

09/02/2020
by   Jae-Hyun Park, et al.
7

On-line Precision scalability of the deep neural networks(DNNs) is a critical feature to support accuracy and complexity trade-off during the DNN inference. In this paper, we propose dual-precision DNN that includes two different precision modes in a single model, thereby supporting an on-line precision switch without re-training. The proposed two-phase training process optimizes both low- and high-precision modes.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset