DAC: Data-free Automatic Acceleration of Convolutional Networks

12/20/2018
by   Xin Li, et al.
0

Deploying a deep learning model on mobile/IoT devices is a challenging task. The difficulty lies in the trade-off between computation speed and accuracy. A complex deep learning model with high accuracy runs slowly on resource-limited devices, while a light-weight model that runs much faster loses accuracy. In this paper, we propose a novel decomposition method, namely DAC, that is capable of factorizing an ordinary convolutional layer into two layers with much fewer parameters. DAC computes the corresponding weights for the newly generated layers directly from the weights of the original convolutional layer. Thus, no training (or fine-tuning) or any data is needed. The experimental results show that DAC reduces a large number of floating-point operations (FLOPs) while maintaining high accuracy of a pre-trained model. If 2 drop is acceptable, DAC saves 53 ImageNet dataset, 29 dataset, and 46 COCO dataset. Compared to other existing decomposition methods, DAC achieves better performance.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset