Szita I, Lorincz A. (2004). Kalman filter control embedded into the reinforcement learning framework. Neural computation. 16 [PubMed]

See more from authors: Szita I · Lorincz A

References and models cited by this paper

Baird LC. (1995). Residual algorithms: Reinforcement learning with function approximation Proc. 12th International Conference on Machine Learning (ICML-95).

Bousquet O, Balakrishnan K, Honavar V. (1998). Is the hippocampus a Kalman filter? Proceedings of the Pacific Symposium on Biocomputing.

Bradtke SJ. (1993). Reinforcement learning applied to linear quadratic regulation Advances in neural information processing systems. 5

Dayan P, Kakade S. (2000). Acquisition in autoshaping Advances in neural information processing systems. 12

Egorov AV, Hamam BN, Fransén E, Hasselmo ME, Alonso AA. (2002). Graded persistent activity in entorhinal cortex neurons. Nature. 420 [PubMed]

Gordon GJ. (2001). Reinforcement learning with function approximation converges to a region Advances in neural information processing systems. 13

Krose B, ten_Hagen S. (1998). Linear quadratic regulation using reinforcement learning Proceedings of the 8th Belgian-Dutch Conf. on Machine Learning.

Kéri S et al. (2002). Categories, prototypes and memory systems in Alzheimer's disease. Trends in cognitive sciences. 6 [PubMed]

Landelius T, Knutsson H. (1996). Greedy adaptive critics for LQR problems: Convergence proofs Tech. Rep. No. LiTH-ISY-R-1896.

Lorincz A, Szatmary B, Szirtes G. (2004). The mystery of structure and function of sensory processing areas of the neocortex: a resolution. J Comput Neurosci. 13

Lorincz A, Vogels R, Takacs B, Biederman I, Szirtes G. (2004). Relating priming and repetition suppression. Int J Neural Syst. 12

Lörincz A, Buzsáki G. (2000). Two-phase computational model training long-term memories in the entorhinal-hippocampal region. Annals of the New York Academy of Sciences. 911 [PubMed]

Montague PR, Dayan P, Sejnowski TJ. (1996). A framework for mesencephalic dopamine systems based on predictive Hebbian learning. The Journal of neuroscience : the official journal of the Society for Neuroscience. 16 [PubMed]

Murphy KP. (2000). A survey of POMDP solution techniques (Available on-line at: http:--www.ai.mit.edu-urphyk-Papers-pomdp.ps.gz).

Nádasdy Z, Hirase H, Czurkó A, Csicsvari J, Buzsáki G. (1999). Replay and time compression of recurring spike sequences in the hippocampus. The Journal of neuroscience : the official journal of the Society for Neuroscience. 19 [PubMed]

Rao RP, Ballard DH. (1997). Dynamic model of visual recognition predicts neural response properties in the visual cortex. Neural computation. 9 [PubMed]

Rao RP, Ballard DH. (1999). Predictive coding in the visual cortex: a functional interpretation of some extra-classical receptive-field effects. Nature neuroscience. 2 [PubMed]

Schultz W, Dayan P, Montague PR. (1997). A neural substrate of prediction and reward. Science (New York, N.Y.). 275 [PubMed]

Skaggs WE, McNaughton BL. (1996). Replay of neuronal firing sequences in rat hippocampus during sleep following spatial experience. Science (New York, N.Y.). 271 [PubMed]

Todorov E, Jordan M. (2002). Supplementary notes for optimal feedback control as a theory of motor coordination Available on-line at: http:--www.nature.com-neuro-supplements-.

Touretzky DS, Daw ND, Courville AC. (2004). Timing and partial observability in the dopamine system Advances in neural information processing systems. 16

Tsitsiklis JN, Van_Roy B. (1996). An analysis of temporal-difference learning with function approximation Tech. Rep. No. LIDS-P-2322.

References and models that cite this paper

Daw ND, Courville AC, Touretzky DS. (2006). Representation and timing in theories of the dopamine system. Neural computation. 18 [PubMed]

This website requires cookies and limited processing of your personal data in order to function. By continuing to browse or otherwise use this site, you are agreeing to this use. See our Privacy policy and how to cite and terms of use.