GPU-enabled back-propagation artificial neural network for digit recognition in parallel

Brito, Ricardo; Fong, Simon; Cho, Kyungeun; Song, Wei; Wong, Raymond; Mohammed, Sabah; Fiaidhi, Jinan

doi:10.1007/s11227-016-1633-y

GPU-enabled back-propagation artificial neural network for digit recognition in parallel

Published: 10 February 2016

Volume 72, pages 3868–3886, (2016)
Cite this article

The Journal of Supercomputing Aims and scope Submit manuscript

Ricardo Brito¹,
Simon Fong¹,
Kyungeun Cho²,
Wei Song³,
Raymond Wong⁴,
Sabah Mohammed⁵ &
…
Jinan Fiaidhi⁵

885 Accesses
5 Citations
3 Altmetric
Explore all metrics

Abstract

In this paper, we show that the GPU (graphics processing unit) can be used not only for processing graphics, but also for high speed computing. We provide a comparison between the times taken on the CPU and GPU to perform the training and testing of a back-propagation artificial neural network. We implemented two neural networks for recognizing handwritten digits; one consists of serial code executed on the CPU, while the other is a GPU-based version of the same system which executes in parallel. As an experiment for performance evaluation, a system for neural network training on the GPU is developed to reduce training time. The programming environment that the system is based on is CUDA which stands for compute unified device architecture, which allows a programmer to write code that will run on an NVIDIA GPU card. Our results over an experiment of digital image recognition using neural network confirm the speed-up advantages by tapping on the resources of GPU. Our proposed model has an advantage of simplicity, while it shows on par performance with the state-of-the-arts algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Neural Networks Training on Graphics Processing Unit (GPU) Using Dynamic Parallelism (DP)

Optimization and Analysis of Parallel Back Propagation Neural Network on GPU Using CUDA

A Generic Neural Network Implementation on GPU and Its Performance Benchmark

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

References

Park SI, Ponce SP, Huang J, Cao Y, Quek F (2008) Low-cost, high-speed computer vision using NVIDIA’s CUDA architecture. In: 37th IEEE applied imagery pattern recognition workshop, pp 1–7
NVidia CUDA Zone. http://www.nvidia.com/object/cuda_home.html
Steinkrau D, Simard PY, Buck I (2013) Using GPUs for machine learning algorithms. In: 12th International conference on document analysis and recognition, pp 1115–1119
Lopez-Fandino J, Heras DB, Arguello F (2014) Efficient classification of hyperspectral images on commodity GPUs using ELM-based techniques. In: Conference PDPTA’14, CSREA Press, July 21–24, pp 1–13
Catanzaro B, Sundaram N, Keutzer K (2008) Fast support vector machine training and classification on graphics processors. In: Proceedings of the 25th international conference on machine learning (ICML 2008), Helsinki, Finland, pp 104–111
van Heeswijk M, Miche Y, Lindh-Knuutila T, Hilbers P, Honkela T, Oja E, Lendasse A (2009) Adaptive ensemble models of extreme learning machines for time series prediction. In: 19th International conference on artificial neural networks, Limassol, Cyprus, 9
Neural Networks on the GPU. http://leenissen.dk/fann/html_latest/files2/gpu-txt.html
Neural Networks with Parallel and GPU Computing. http://www.mathworks.com/help/nnet/ug/neural-networks-with-parallel-and-gpu-computing.html
A Neural Network on GPU. http://www.codeproject.com/Articles/24361/A-Neural-Network-on-GPU
Huang G-B, Chen L, Siew C-K (2006) Universal approximation using incremental constructive feedforward networks with random hidden nodes. IEEE TNN 17(4):879–892
Google Scholar
Hayashi A, Ishizaki K, Koblents G, Sarkar V (2015) Machine-learning-based performance heuristics for runtime CPU/GPU selection. In: Proceedings of the principles and practices of programming on the Java platform, pp 27–36
Ribeiro B, Goncalves J (2012) Restricted Boltzmann machines and deep belief networks on multi-core processors. In: The 2012 international joint conference on neural networks (IJCNN), 10–15 June 2012, pp 1–7
Huqqani AA, Schikuta E, Ye S, Chen P (2013) Multicore and GPU parallelization of neural networks for face recognition. In: International conference on computational science, ICCS 2013, Procedia Computer Science, vol 18, pp 349–358
LeCun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324
Article Google Scholar
Cambria et al (2013) Extreme learning machines. IEEE Trans Cybern 28(6):30–59

Download references

Acknowledgments

The authors are thankful for the financial support from the research grant “Peer-production approaches to e-Learning (PPAeL),” Grant No. FDCT 019/2011/A1, offered by Macau Fundo para o Desenvolvimento das Ciências e da Tecnologia.

Author information

Authors and Affiliations

Department of Computer and Information Science, University of Macau, Macau SAR, China
Ricardo Brito & Simon Fong
Department of Computer and Multimedia Engineering, Dongguk University, Seoul, Korea
Kyungeun Cho
College of Information Engineering, North China University of Technology, Beijing, China
Wei Song
School of Computer Science and Engineering, University of New South Wales, Sydney, Australia
Raymond Wong
Department of Computer Science, Lakehead University, Thunder Bay, Canada
Sabah Mohammed & Jinan Fiaidhi

Authors

Ricardo Brito
View author publications
You can also search for this author in PubMed Google Scholar
Simon Fong
View author publications
You can also search for this author in PubMed Google Scholar
Kyungeun Cho
View author publications
You can also search for this author in PubMed Google Scholar
Wei Song
View author publications
You can also search for this author in PubMed Google Scholar
Raymond Wong
View author publications
You can also search for this author in PubMed Google Scholar
Sabah Mohammed
View author publications
You can also search for this author in PubMed Google Scholar
Jinan Fiaidhi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Simon Fong.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Brito, R., Fong, S., Cho, K. et al. GPU-enabled back-propagation artificial neural network for digit recognition in parallel. J Supercomput 72, 3868–3886 (2016). https://doi.org/10.1007/s11227-016-1633-y

Download citation

Published: 10 February 2016
Issue Date: October 2016
DOI: https://doi.org/10.1007/s11227-016-1633-y

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

GPU-enabled back-propagation artificial neural network for digit recognition in parallel

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Neural Networks Training on Graphics Processing Unit (GPU) Using Dynamic Parallelism (DP)

Optimization and Analysis of Parallel Back Propagation Neural Network on GPU Using CUDA

A Generic Neural Network Implementation on GPU and Its Performance Benchmark

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

GPU-enabled back-propagation artificial neural network for digit recognition in parallel

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Neural Networks Training on Graphics Processing Unit (GPU) Using Dynamic Parallelism (DP)

Optimization and Analysis of Parallel Back Propagation Neural Network on GPU Using CUDA

A Generic Neural Network Implementation on GPU and Its Performance Benchmark

Explore related subjects

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation