Bartlett PL. (1998). The sample complexity of pattern classification with neural networks: The size of the weights is more important than the size of the network IEEE Trans Inform Theory. 44
Bishop C. (1995). Neural Networks For Pattern Recognition.
Chen DS, Jain RC. (1994). A robust backpropagation learning algorithm for function approximation. IEEE transactions on neural networks. 5 [PubMed]
Clarke FH. (1983). Optimization and nonsmooth analysis.
Flake GW. (1998). Square unit augmented, radially extended, multilayer perceptrons Neural networks: Tricks of the trade.
Hagan MT, Menhaj MB. (1994). Training feedforward networks with the Marquardt algorithm. IEEE transactions on neural networks. 5 [PubMed]
Haykin S. (1994). Neural Networks: A Comprehensive Foundation.
Hecht-nielsen R, Sarajedini A. (1992). The best of both worlds: Casasent networks integrate multilayer perceptrons and radial basis functions International Joint Conference On Neural Networks. 3
Hettmansperger TP, Mckean JW. (1998). Robust nonparametric statistical methods.
Huber P. (1981). Robust statistics.
Karkkainen T, Heikkola E. (2002). Robust MLP Tep No C1.
Karkkainen T, Majava K. (2000). Determination of regularization parameter in monotone active set method for image restoration Proceedings of the Third European Conference on Numerical Mathematics and Advanced Applications.
Karkkainen T, Majava K, Makela MM. (2001). Comparison of formulations and solution methods for image restoration problems Inverse Problems. 17
Kemperman J. (1987). The median of a finite measure on a Banach space Statistical data analysis based on the L1-norm and related methods.
Kosko B. (1992). Neural networks and fuzzy systems: A dynamical systems approach to machine intelligence.
Kärkkäinen T. (2002). MLP in layer-wise form with applications to weight decay. Neural computation. 14 [PubMed]
Liano K. (1996). Robust error measure for supervised neural network learning with outliers. IEEE transactions on neural networks. 7 [PubMed]
Lowe D, Broomhead DS. (1988). Multivariable functional interpolation and adaptive networks Complex Systems. 2
Makela MM, Neittaanmaki P. (1992). Nonsmooth optimization.
Milasevic P, Ducharme G. (1987). Uniqueness of the spatial median Ann Stat. 15
Mitchell T. (1997). Machine learning.
Nocedal J, Wright SJ. (1999). Numerical optimization.
Oja H. (1999). Affine invariant multivariate sign and rank tests and corresponding estimates: A review Scand J Statist. 26
Prechelt L. (1998). Early stopping-but when? Neural networks: Tricks of the trade.
Rao CR. (1988). Methodology based on the L1-norm, in statistical inference Sankhy Ser. 50
Raudys S. (1998). Evolution and generalization of a single neurone: I. Single-layer perceptron as seven statistical classifiers. Neural networks : the official journal of the International Neural Network Society. 11 [PubMed]
Raudys S. (1998). Evolution and generalization of a single neurone: II. Complexity of statistical classifiers and sample size considerations. Neural networks : the official journal of the International Neural Network Society. 11 [PubMed]
Raudys S. (2000). Evolution and generalization of a single neurone. III. Primitive, regularized, standard, robust and minimax regressions. Neural networks : the official journal of the International Neural Network Society. 13 [PubMed]
Rognvaldsson TS. (1998). A simple trick for estimating the weight decay parameter Neural networks: Tricks of the trade.
Rousseeuw PJ, Leroy AM. (1987). Robust regression and outlier detection.
Saito K, Nakano R. (2000). Second-order learning algorithm with squared penalty term. Neural computation. 12 [PubMed]
Sarle W. (1997). The comp.ai.neural-nets frequently asked questions list Available on-line at: http:--www.faqs.org-faqs-ai-faq-neural-nets-(part2-section-14.html.