Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Global Optimization for Neural Network Training

Published: 01 March 1996 Publication History

Abstract

Many learning algorithms find their roots in function-minimization algorithms that can be classified as local- or global-minimization algorithms. Algorithms that focus on either extreme--local search or global search--do not work well. The authors propose a hybrid method, called NOVEL for Nonlinear Optimization via External Lead, that combines global and local searches to explore the solution space, locate promising regions, and find local minima. To guide exploration of the solution space, it uses a continuous terrain-independent trace that does not get trapped in local minima. NOVEL next uses a locate gradient to attract the search to a local minimum, but the trace pulls it out once little improvement is found. NOVEL then selects one initial point for each promising region and uses these points for a descent algorithm to find local minima. It thus avoids searching unpromising local minima from random starting points using computationally expensive descent algorithms. In an implementation using differential- and difference-equation solvers, NOVEL demonstrated superior performance in five benchmark comparisons against the best global optimization algorithms.

References

[1]
A. Torn and A. Zilinskas, Global Optimization, Springer-Verlag, New York, 1987.
[2]
D.G. Luenberger, Linear and Nonlinear Programming, Addison-Wesley, Reading, Mass., 1984.
[3]
R. Battiti, "First- and Second-Order Methods for Learning: Between Steepest Descent and Newton's Method," Neural Computation, Vol. 4, No. 2, 1992, pp. 141-166.
[4]
L.C.W. Dixon, "Neural Networks and Unconstrained Optimization," in Algorithms for Continuous Optimization: The State of the Art, E. Spedicato, ed., Kluwer Academic, The Netherlands, 1994, pp. 513-530.
[5]
A.V. Levy, et al., Topics in Global Optimization, Lecture Notes in Mathematics No. 909, Springer-Verlag, New York, 1981.
[6]
A.C. Hindmarsh, "ODEPACK, a Systematized Collection of ODE Solvers," in Scientific Computing (R.S. Stepleman, ed.), North Holland, Amsterdam, 1983, pp. 55-64.
[7]
S.E. Fahlman and C. Lebiere, "The Cascade-Correlation Learning Architecture," in Advances in Neural Information Processing Systems 2, D.S. Touretzky, ed., Morgan Kaufmann, San Mateo, Calif., 1990, pp. 524-532.
[8]
J. Hwang, et al., "Regression Modeling in Back-Propagation and Projection Pursuit Learning," IEEE Trans. Neural Networks, Vol. 5, MAY 1994, pp. 1-24.
[9]
A. Corana, et al., "Minimizing Multi-Modal Functions of Continuous Variables with the Simulated Annealing Algorithm," ACM Trans. Mathematical Software, Vol. 13, No. 3, 1987, pp. 262-280.
[10]
Z. Michalewicz, Genetic Algorithms + Data Structure = Evolution Programs, Springer-Verlag, New York, 1994.
[11]
J. Sprave, "Linear Neighborhood Evolution Strategies," Proc. Third Ann. Conf. Evolutionary Programming, World Scientific, 1994.
[12]
T.J. Sejnowski and C.R. Rosenberg, "Parallel Networks That Learn to Pronounce English Text," Complex Systems, Vol. 1, No. 1, Feb. 1987, pp. 145-168.
[13]
T.J. Sejnowski and C.R. Rosenberg, "Parallel Networks That Learn to Pronounce English Text," Complex Systems, Vol. 1, No. 1, Feb. 1987, pp. 145-168.
[14]
S.E. Fahlman and C. Lebiere, "The Cascade-Correlation Learning Architecture," in Advances in Neural Information Processing Systems 2, D.S. Touretzky, ed., Morgan Kaufmann, San Mateo, Calif., 1990, pp. 524-532.

Cited By

View all
  • (2024)Organ boundary delineation for automated diagnosis from multi-center using ultrasound imagesExpert Systems with Applications: An International Journal10.1016/j.eswa.2023.122128238:PEOnline publication date: 27-Feb-2024
  • (2024)Understanding evolving user choices: a neural network analysis of TAXI and ride-hailing services in BarcelonaSoft Computing - A Fusion of Foundations, Methodologies and Applications10.1007/s00500-023-09239-w28:5(4649-4665)Online publication date: 1-Mar-2024
  • (2022)An improved atom search optimization for optimization tasksMultimedia Tools and Applications10.1007/s11042-022-13171-w82:5(6375-6429)Online publication date: 23-Jul-2022
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Computer
Computer  Volume 29, Issue 3
Special issue: neural computing: companion issue to Spring 1996 IEEE Computational Science & Engineering
March 1996
106 pages
ISSN:0018-9162
Issue’s Table of Contents

Publisher

IEEE Computer Society Press

Washington, DC, United States

Publication History

Published: 01 March 1996

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 03 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Organ boundary delineation for automated diagnosis from multi-center using ultrasound imagesExpert Systems with Applications: An International Journal10.1016/j.eswa.2023.122128238:PEOnline publication date: 27-Feb-2024
  • (2024)Understanding evolving user choices: a neural network analysis of TAXI and ride-hailing services in BarcelonaSoft Computing - A Fusion of Foundations, Methodologies and Applications10.1007/s00500-023-09239-w28:5(4649-4665)Online publication date: 1-Mar-2024
  • (2022)An improved atom search optimization for optimization tasksMultimedia Tools and Applications10.1007/s11042-022-13171-w82:5(6375-6429)Online publication date: 23-Jul-2022
  • (2020)Dynamic search trajectory methods for global optimizationAnnals of Mathematics and Artificial Intelligence10.1007/s10472-019-09661-788:1-3(3-37)Online publication date: 1-Mar-2020
  • (2019)Global optimization issues in deep network regressionJournal of Global Optimization10.1007/s10898-018-0701-773:2(239-277)Online publication date: 1-Feb-2019
  • (2018)Deep Learning Model Selection of Suboptimal ComplexityAutomation and Remote Control10.1134/S000511791808009X79:8(1474-1488)Online publication date: 1-Aug-2018
  • (2017)Metaheuristic design of feedforward neural networksEngineering Applications of Artificial Intelligence10.1016/j.engappai.2017.01.01360:C(97-116)Online publication date: 1-Apr-2017
  • (2013)Fourier-assisted machine learning of hard disk drive access time modelsProceedings of the 8th Parallel Data Storage Workshop10.1145/2538542.2538561(45-51)Online publication date: 17-Nov-2013
  • (2012)Simultaneous optimization of artificial neural networks for financial forecastingApplied Intelligence10.1007/s10489-011-0303-236:4(887-898)Online publication date: 1-Jun-2012
  • (2012)Agent-Based approach to RBF network training with floating centroidsProceedings of the 4th international conference on Computational Collective Intelligence: technologies and applications - Volume Part II10.1007/978-3-642-34707-8_46(453-462)Online publication date: 28-Nov-2012
  • Show More Cited By

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media