I.E. Livieris and P. Pintelas. Performance Evaluation of Descent CG Methods for Neural Network Training. In: Proceedings of 9th Hellinic European Research on Computer and its Application Conference (HERCMA'09), Athens, Greece, September, 2009.

 

Abstract -  Conjugate gradient methods constitute an excellent choice for efficiently training large neural networks since they don't require the evaluation of the Hessian matrix neither the impractical storage of an approximation of it. Despite  the  theoretical  and  practical advantages  of these methods their main drawback is the use of restarting procedures in order to guarantee convergence, abandoning econd order derivative information. In this work, we propose a neural network training algorithm which preserves the advantages of classical conjugate gradient methods and simultaneously avoids the inefficient restarts. Encouraging numerical experiments verify that the presented algorithm provides fast, stable and reliable convergence.