Besag J. (1975). Statistical analysis of non-lattice data Statistician. 24
Gidas B. (1988). Consistency of maximum likelihood and pseudo-likelihood estimators for Gibbsian distributions Stochastic differential systems, stochastic control theory and applications.
Hinton G, Carreira-perpinan M. (2005). On contrastive divergence learning. Proceedings of the Society for Artificial Intelligence and Statistics.
Hinton GE. (2002). Training products of experts by minimizing contrastive divergence. Neural computation. 14 [PubMed]
Hyvarinen A. (2005). Estimation of non-normalized statistical models using score matching J Mach Learn Res. 6
Kappen H, Rodriguez F. (1998). Efficient learning in Boltzmann machines using linear response theory Neural Comput. 10
Mase S. (1995). Consistency of the maximum pseudo-likelihood estimator of continuous state space Gibbsian processes Ann Appl Prob. 5
Sejnowski TJ, Ackley DH, Hinton GE. (1985). A learning algorithm for Bolzmann machines. Cognitive Sci. 9
Welling M, Sutton C. (2005). Learning Markov random fields using contrastive free energies Proc Workshop Artif Intell Stat.