Deep Convolutional Neural Network Inference with Floating-point Weights and Fixed-point Activations

8 Mar 2017 · Liangzhen Lai, Naveen Suda, Vikas Chandra ·

Deep convolutional neural network (CNN) inference requires significant amount of memory and computation, which limits its deployment on embedded devices. To alleviate these problems to some extent, prior research utilize low precision fixed-point numbers to represent the CNN weights and activations. However, the minimum required data precision of fixed-point weights varies across different networks and also across different layers of the same network. In this work, we propose using floating-point numbers for representing the weights and fixed-point numbers for representing the activations. We show that using floating-point representation for weights is more efficient than fixed-point representation for the same bit-width and demonstrate it on popular large-scale CNNs such as AlexNet, SqueezeNet, GoogLeNet and VGG-16. We also show that such a representation scheme enables compact hardware multiply-and-accumulate (MAC) unit design. Experimental results show that the proposed scheme reduces the weight storage by up to 36% and power consumption of the hardware multiplier by up to 50%.

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Datasets

Add Datasets introduced or used in this paper

Results from the Paper

Edit

Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.

Methods

Add Remove

1x1 Convolution • AlexNet • Auxiliary Classifier • Average Pooling • Convolution • Dense Connections • Dropout • Fire Module • Global Average Pooling • GoogLeNet • Grouped Convolution • Inception Module • Local Response Normalization • Max Pooling • ReLU • Residual Connection • Softmax • SqueezeNet • VGG-16 • Xavier Initialization

Edit Social Preview

Deep Convolutional Neural Network Inference with Floating-point Weights and Fixed-point Activations

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove