Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3212725.3212733acmconferencesArticle/Chapter ViewAbstractPublication PagesmobisysConference Proceedingsconference-collections
research-article

Efficiently Combining SVD, Pruning, Clustering and Retraining for Enhanced Neural Network Compression

Published: 15 June 2018 Publication History
First page of PDF

References

[1]
Bojarski, M., Yeres, P., Choromanska, A., Choromanski, K., Firner, B., Jackel, L., and Muller, U. Explaining how a deep neural network trained with end-to-end learning steers a car. arXiv preprint arXiv:1704.07911 (2017).
[2]
Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., and Fei-Fei, L. ImageNet: A Large-Scale Hierarchical Image Database. In CVPR09 (2009).
[3]
Dieleman, S., Schlüter, J., Raffel, C., Olson, E., Sønderby, S. K., Nouri, D., et al. Lasagne: First release., Aug. 2015.
[4]
Garofolo, J. S., Lamel, L. F., Fisher, W. M., Fiscus, J. G., Pallett, D. S., and Dahlgren, N. L. Darpa timit acoustic phonetic continuous speech corpus cdrom, 1993.
[5]
Han, S., Liu, X., Mao, H., Pu, J., Pedram, A., Horowitz, M. A., and Dally, W. J. Eie: efficient inference engine on compressed deep neural network. In Proceedings of the 43rd International Symposium on Computer Architecture (2016), IEEE Press, pp. 243--254.
[6]
Han, S., Mao, H., and Dally, W. J. Deep compression: Compressing deep neural network with pruning, trained quantization and huffman coding. CoRR, abs/1510.00149 2 (2015).
[7]
Horowitz, M. 1.1 computing's energy problem (and what we can do about it). In 2014 IEEE International Solid-State Circuits Conference Digest of Technical Papers (ISSCC) (Feb 2014), pp. 10--14.
[8]
Jouppi, N. P., Young, C., Patil, N., Patterson, D., Agrawal, G., Bajwa, R., Bates, S., Bhatia, S., Boden, N., Borchers, A., et al. In-datacenter performance analysis of a tensor processing unit. International Symposium on Computer Architecture (ISCA) (2017).
[9]
Krizhevsky, A., and Hinton, G. Learning multiple layers of features from tiny images. 32--35.
[10]
Krizhevsky, A., Sutskever, I., and Hinton, G. E. Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems (2012), pp. 1097--1105.
[11]
Shin, D., Lee, J., Lee, J., and Yoo, H. J. 14.2 dnpu: An 8.1tops/w reconfigurable cnn-rnn processor for general-purpose deep neural networks. In 2017 IEEE International Solid-State Circuits Conference (ISSCC) (Feb 2017), pp. 240--241.
[12]
Sundermeyer, M., Schlüter, R., and Ney, H. Lstm neural networks for language modeling. In Interspeech (2012).
[13]
Taigman, Y., Yang, M., Ranzato, M., and Wolf, L. Deepface: Closing the gap to human-level performance in face verification. In Proceedings of the IEEE conference on computer vision and pattern recognition (2014), pp. 1701--1708.
[14]
Taylor, G., and Ding, W. Theano-based large-scale visual recognition with multiple gpus, 2015.
[15]
Theano Development Team. Theano: A Python framework for fast computation of mathematical expressions. arXiv e-prints abs/1605.02688 (May 2016).
[16]
Verhelst, M., and Moons, B. Embedded deep neural network processing: Algorithmic and processor techniques bring deep learning to iot and edge devices. IEEE Solid-State Circuits Magazine 9, 4 (Fall 2017), 55--65.
[17]
Xue, J., Li, J., and Gong, Y. Restructuring of deep neural network acoustic models with singular value decomposition. In Interspeech (2013), pp. 2365--2369.

Cited By

View all
  • (2024)A Neural Network Approach for the Solution of an Electromagnetic Inverse Source Problem with SVD-based Pruning2024 IEEE-APS Topical Conference on Antennas and Propagation in Wireless Communications (APWC)10.1109/APWC61918.2024.10701760(147-151)Online publication date: 2-Sep-2024
  • (2024)Seeking Interpretability and Explainability in Binary Activated Neural NetworksExplainable Artificial Intelligence10.1007/978-3-031-63787-2_1(3-20)Online publication date: 10-Jul-2024
  • (2023)Transfer Learning With Singular Value Decomposition of Multichannel Convolution MatricesNeural Computation10.1162/neco_a_0160835:10(1678-1712)Online publication date: 8-Sep-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
EMDL'18: Proceedings of the 2nd International Workshop on Embedded and Mobile Deep Learning
June 2018
51 pages
ISBN:9781450358446
DOI:10.1145/3212725
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 15 June 2018

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

MobiSys '18
Sponsor:

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)52
  • Downloads (Last 6 weeks)4
Reflects downloads up to 13 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)A Neural Network Approach for the Solution of an Electromagnetic Inverse Source Problem with SVD-based Pruning2024 IEEE-APS Topical Conference on Antennas and Propagation in Wireless Communications (APWC)10.1109/APWC61918.2024.10701760(147-151)Online publication date: 2-Sep-2024
  • (2024)Seeking Interpretability and Explainability in Binary Activated Neural NetworksExplainable Artificial Intelligence10.1007/978-3-031-63787-2_1(3-20)Online publication date: 10-Jul-2024
  • (2023)Transfer Learning With Singular Value Decomposition of Multichannel Convolution MatricesNeural Computation10.1162/neco_a_0160835:10(1678-1712)Online publication date: 8-Sep-2023
  • (2023)Communication Efficient Federated Learning With Heterogeneous Structured Client ModelsIEEE Transactions on Emerging Topics in Computational Intelligence10.1109/TETCI.2022.32093457:3(753-767)Online publication date: Jun-2023
  • (2023)Mixed-TD: Efficient Neural Network Accelerator with Layer-Specific Tensor Decomposition2023 33rd International Conference on Field-Programmable Logic and Applications (FPL)10.1109/FPL60245.2023.00036(204-211)Online publication date: 4-Sep-2023
  • (2022)Methods for Pruning Deep Neural NetworksIEEE Access10.1109/ACCESS.2022.318265910(63280-63300)Online publication date: 2022
  • (2022)Dimensionality reduction for multivariate time-series data miningThe Journal of Supercomputing10.1007/s11227-021-04303-478:7(9862-9878)Online publication date: 19-Jan-2022
  • (2022)Diverse and styled image captioning using singular value decomposition‐based mixture of recurrent expertsConcurrency and Computation: Practice and Experience10.1002/cpe.686634:22Online publication date: Feb-2022
  • (2020)Deep Learning on Mobile and Embedded DevicesACM Computing Surveys10.1145/339820953:4(1-37)Online publication date: 20-Aug-2020
  • (2020)VGG deep neural network compression via SVD and CUR decomposition techniques2020 7th NAFOSTED Conference on Information and Computer Science (NICS)10.1109/NICS51282.2020.9335842(118-123)Online publication date: 26-Nov-2020

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media