Lightweight compression of neural network feature tensors for collaborative intelligence

Cohen, Robert A.; Choi, Hyomin; Bajić, Ivan V.

doi:10.1109/ICME46284.2020.9102797

Computer Science > Machine Learning

arXiv:2105.06002 (cs)

[Submitted on 12 May 2021]

Title:Lightweight compression of neural network feature tensors for collaborative intelligence

Authors:Robert A. Cohen, Hyomin Choi, Ivan V. Bajić

View PDF

Abstract:In collaborative intelligence applications, part of a deep neural network (DNN) is deployed on a relatively low-complexity device such as a mobile phone or edge device, and the remainder of the DNN is processed where more computing resources are available, such as in the cloud. This paper presents a novel lightweight compression technique designed specifically to code the activations of a split DNN layer, while having a low complexity suitable for edge devices and not requiring any retraining. We also present a modified entropy-constrained quantizer design algorithm optimized for clipped activations. When applied to popular object-detection and classification DNNs, we were able to compress the 32-bit floating point activations down to 0.6 to 0.8 bits, while keeping the loss in accuracy to less than 1%. When compared to HEVC, we found that the lightweight codec consistently provided better inference accuracy, by up to 1.3%. The performance and simplicity of this lightweight compression technique makes it an attractive option for coding a layer's activations in split neural networks for edge/cloud applications.

Comments:	Accepted for publication in IEEE ICME 2020
Subjects:	Machine Learning (cs.LG); Image and Video Processing (eess.IV)
Cite as:	arXiv:2105.06002 [cs.LG]
	(or arXiv:2105.06002v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2105.06002
Journal reference:	2020 IEEE International Conference on Multimedia and Expo (ICME)
Related DOI:	https://doi.org/10.1109/ICME46284.2020.9102797

Submission history

From: Robert Cohen [view email]
[v1] Wed, 12 May 2021 23:41:35 UTC (117 KB)

Computer Science > Machine Learning

Title:Lightweight compression of neural network feature tensors for collaborative intelligence

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Lightweight compression of neural network feature tensors for collaborative intelligence

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators