Devroye L, Gyorfi L, Lugosi G. (1996). A probabilistic theory of pattern recognition.
Erdogmus D, Principe J. (2000). Comparison of entropy and mean square error criteria in adaptive system training using higher order statistics Proceedings of the Intl. Conf. on ICA and Signal Separation.
Erdogmus D, Principe JC. (2002). An error-entropy minimization algorithm for supervised training of nonlinear adaptive systems IEEE Transactions On Signal Processing. 50
Kapur J. (1993). Maximum-entropy models in science and engineering (rev ed).
Kullback S. (1959). Information theory and statistics.
Linsker R. (1988). Self-organization in a perceptual network Computer. 2
Marques de Sa J. (2001). Pattern recognition: Concepts, methods and applications.
Marques de Sa J, Santos J, Alexandre L. (2004). The error entropy minimization algorithm for neural network classification Int. Conf. on Recent Advances in Soft Computing.
Marques de Sa J, Santos J, Alexandre L, Sereno F. (2004). Optimization of the error entropy minimization algorithm for neural network classification Intelligent Engineering Systems Through Artificial Neural Networks. 14
Newman D, Hettich S, Blake C, Merz C. (1998). UCI repository of machine learning databases Available online at http:--www.ics.uci.edu-mlearn-MLRepository.html.
Parzen E. (1962). On the estimation of a probability density function and mode Ann Math Stat. 33
Press WH, Teukolsky SA, Flannery BP, Vellerling WT. (1992). Numerical Recipes In C: The Art Of Scientific Computing.
Principe JC, Xu D, Fisher J. (2000). Information theoretic learning Unsupervised adaptive filtering: Blind source separation. 1
Shannon CE. (1948). The mathematical theory of communication Bell Syst Tech J. 27
Silva L, Marques de Sa J, Alexandre L. (2005). Neural network classification using Shannons entropy European Symposium on Artificial Neural Networks.
Silverman BW. (1986). Density Estimation for Statistics and Data Analysis, Monographs on Statistics and Applied Probability. 26
Stoller D. (1954). Univariate two-population distribution free discrimination J Am Stat Assoc. 49