Boundedness and Convergence of Online Gradient Method with Penalty for Linear Output Feedforward Neural Networks

Zhang, Huisheng; Wu, Wei

doi:10.1007/s11063-009-9104-6

Boundedness and Convergence of Online Gradient Method with Penalty for Linear Output Feedforward Neural Networks

Published: 07 May 2009

Volume 29, pages 205–212, (2009)
Cite this article

Neural Processing Letters Aims and scope Submit manuscript

Huisheng Zhang^1,2 &
Wei Wu¹

97 Accesses
Explore all metrics

Abstract

This paper investigates an online gradient method with penalty for training feedforward neural networks with linear output. A usual penalty is considered, which is a term proportional to the norm of the weights. The main contribution of this paper is to theoretically prove the boundedness of the weights in the network training process. This boundedness is then used to prove an almost sure convergence of the algorithm to the zero set of the gradient of the error function.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A proof of convergence for stochastic gradient descent in the training of artificial neural networks with ReLU activation for constant target functions

Article Open access 09 August 2022

Convergence of a modified gradient-based learning algorithm with penalty for single-hidden-layer feed-forward networks

Article 29 September 2018

Stochastic Gradient Descent with Polyak’s Learning Rate

Article 08 September 2021

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

References

Chen T (1995) Universal approximation to nonlinear operations by Neural Networks with arbitrary activation functions and its application to dynamical system. IEEE Trans Neural Netw 6(4): 911–917
Article Google Scholar
Fine TL, Mukherjee S (1999) Parameter convergence and learning curves for neural networks. Neural Comput 11: 747–769
Article Google Scholar
Gaivoronski AA (1994) Convergence properties of backpropagation for neural nets via theory of stochastic gradient methods (Part I). Optim Methods Softw 4: 117–134
Article Google Scholar
Hagan MT, Demuth HB, Beale M (2003) Neural network design. China Machine Press, Beijing
Google Scholar
Hanson SJ, Pratt LY (1989) Comparing biases for minimal network construction with back-propagation. Neural Inf Process 1: 177–185
Google Scholar
Hu ST (1965) Elements of general topology. Holden-Day, San Francisco
Google Scholar
Reed R (1993) Pruning algorithms: a survey. IEEE Trans Neural Netw 4(5): 740–747
Article Google Scholar
Saito K, Nakano R (2000) Second-order learning algorithm with squared penalty term. Neural Comput 12: 709–729
Article Google Scholar
Shao H, Wu W, Liu L (2007) Convergence and monotonicity of an online gradient method with penalty for neural networks. WSEAS Trans Math 6(3): 469–476
MATH MathSciNet Google Scholar
Tadic V, Stankovic S (2000) Learning in neural networks by normalized stochastic gradient algorithm: local convergence. Proceedings of the 5th Seminar on Neural Network Applications in Electrical Engineering, Yugoslavia, pp 11–17
White H (1989) Some asymptotic results for learning in single hidden-layer feedforward network models. J Am Stat Ass 84: 1003–1013
Article MATH Google Scholar
Wu W, Feng G, Li Z et al (2005) Convergence of an online gradient method for BP neural networks. IEEE Trans Neural Netw 16(3): 533–540
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Applied Mathematics, Dalian University of Technology, Dalian, 116024, People’s Republic of China
Huisheng Zhang & Wei Wu
Department of Mathematics, Dalian Maritime University, Dalian, 116026, People’s Republic of China
Huisheng Zhang

Authors

Huisheng Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Wei Wu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wei Wu.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhang, H., Wu, W. Boundedness and Convergence of Online Gradient Method with Penalty for Linear Output Feedforward Neural Networks. Neural Process Lett 29, 205–212 (2009). https://doi.org/10.1007/s11063-009-9104-6

Download citation

Received: 21 July 2008
Accepted: 19 April 2009
Published: 07 May 2009
Issue Date: June 2009
DOI: https://doi.org/10.1007/s11063-009-9104-6

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Boundedness and Convergence of Online Gradient Method with Penalty for Linear Output Feedforward Neural Networks

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

A proof of convergence for stochastic gradient descent in the training of artificial neural networks with ReLU activation for constant target functions

Convergence of a modified gradient-based learning algorithm with penalty for single-hidden-layer feed-forward networks

Stochastic Gradient Descent with Polyak’s Learning Rate

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Boundedness and Convergence of Online Gradient Method with Penalty for Linear Output Feedforward Neural Networks

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

A proof of convergence for stochastic gradient descent in the training of artificial neural networks with ReLU activation for constant target functions

Convergence of a modified gradient-based learning algorithm with penalty for single-hidden-layer feed-forward networks

Stochastic Gradient Descent with Polyak’s Learning Rate

Explore related subjects

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation