Acquas E, Carboni E, Di Chiara G. (1991). Profound depression of mesolimbic dopamine release after morphine withdrawal in dependent rats. European journal of pharmacology. 193 [PubMed]
Barto AG, Sutton RS. (1990). Time-derivative models of Pavlovian reinforcement Learning and computational neuroscience: Foundations of adaptive networks.
Barto AG, Sutton RS, Watkins CJCH. (1989). Technical Report 89-95.
Bechara A, Damasio AR, Damasio H, Anderson SW. (1994). Insensitivity to future consequences following damage to human prefrontal cortex. Cognition. 50 [PubMed]
Bernheimer H, Birkmayer W, Hornykiewicz O, Jellinger K, Seitelberger F. (1973). Brain dopamine and the syndromes of Parkinson and Huntington. Clinical, morphological and neurochemical correlations. Journal of the neurological sciences. 20 [PubMed]
Bush RR, Mosteller F. (1955). Stochastic models for learning.
Crippens D, Robinson TE. (1994). Withdrawal from morphine or amphetamine: different effects on dopamine in the ventral-medial striatum studied with microdialysis. Brain research. 650 [PubMed]
DeLong MR, Crutcher MD, Georgopoulos AP. (1983). Relations between movement and single cell discharge in the substantia nigra of the behaving monkey. The Journal of neuroscience : the official journal of the Society for Neuroscience. 3 [PubMed]
Diana M, Pistis M, Carboni S, Gessa GL, Rossetti ZL. (1993). Profound decrement of mesolimbic dopaminergic neuronal activity during ethanol withdrawal syndrome in rats: electrophysiological and biochemical evidence. Proceedings of the National Academy of Sciences of the United States of America. 90 [PubMed]
Dickinson A. (1980). Contemporary animal learning theory.
Fibiger HC, Phillips AG. (1986). Reward, motivation, cognition: psycholobiologyof mesotelencephalic dopamine systems Handbook of physiology.The nervous system. Intrinsic regulatory systems of the brain. 14
Freeman AS, Bunney BS. (1987). Activity of A9 and A10 dopaminergic neurons in unrestrained rats: further characterization and effects of apomorphine and cholecystokinin. Brain research. 405 [PubMed]
Gallagher M, Holland PC. (1994). The amygdala complex: multiple roles in associative learning and attention. Proceedings of the National Academy of Sciences of the United States of America. 91 [PubMed]
Gallistel CR. (1990). The Organization of Learning.
Grossberg S, Levine DS. (1987). Neural dynamics of attentionally modulated Pavlovian conditioning: blocking, interstimulus interval, and secondary reinforcement. Applied optics. 26 [PubMed]
Grossberg S, Schmajuk NA. (1989). Neural dynamics of adaptive timing and temporal discrimination during associative learning Neural Netw. Networks2
HERRNSTEIN RJ. (1961). Relative and absolute strength of response as a function of frequency of reinforcement. Journal of the experimental analysis of behavior. 4 [PubMed]
Herrnstein RJ. (1991). Experiments on stable suboptimality in individual behavior Am Econ Rev Paoers Proc. 83
Houk JC, Adams JL, Barto AGA. (1995). A model of how the basal ganglia generate and use neural signals that predict reinforcement. Models Of Information Processing In The Basal Ganglia.
Kalman RE. (1960). A new approach to linear filtering and prediction problems Trans ASME J Basic Eng. 82
Klopf AH. (1982). The hedonistic neuron.
Koob GF, Bloom FE. (1988). Cellular and molecular mechanisms of drug dependence. Science (New York, N.Y.). 242 [PubMed]
Kotter R, Wickens J. (1995). Cellular models of reinforcement Models ofinformation processing in the basal ganglia.
Ljungberg T, Apicella P, Schultz W. (1992). Responses of monkey dopamine neurons during learning of behavioral reactions. Journal of neurophysiology. 67 [PubMed]
Luce RD, Raiffa H. (1957). Games and decisions: introduction and critical survey.
Mackintosh NJ. (1983). Conditioning and associative learning.
Montague PR. (1996). Biological substrates of predictive mechanisms in learning and action choice Neural-network approaches to cognition: biobehavioral foundations.
Montague PR, Dayan P, Person C, Sejnowski TJ. (1995). Bee foraging in uncertain environments using predictive hebbian learning. Nature. 377 [PubMed]
Montague PR, Person C, Egelman DM. (1995). A predictive model for diffuse systems matches human choices in a simple decision-making task Soc Neurosci Abstr. 21
Montague PR, Sejnowski TJ. (1994). The predictive brain: temporal coincidence and temporal order in synaptic learning mechanisms. Learning & memory (Cold Spring Harbor, N.Y.). 1 [PubMed]
Oades RD, Halliday GM. (1987). Ventral tegmental (A10) system: neurobiology. 1. Anatomy and connectivity. Brain research. 434 [PubMed]
Parsons LH, Smith AD, Justice JB. (1991). Basal extracellular dopamine is decreased in the rat nucleus accumbens during abstinence from chronic cocaine. Synapse (New York, N.Y.). 9 [PubMed]
Real LA. (1991). Animal choice behavior and the evolution of cognitive architecture. Science (New York, N.Y.). 253 [PubMed]
Real LA, Harder LD. (1987). Why are bumble bees risk averse? Ecology. 68
Romo R, Schultz W. (1990). Dopamine neurons of the monkey midbrain: contingencies of responses to active touch during self-initiated arm movements. Journal of neurophysiology. 63 [PubMed]
Rossetti ZL, Hmaidan Y, Gessa GL. (1992). Marked inhibition of mesolimbic dopamine release: a common feature of ethanol, morphine, cocaine and amphetamine abstinence in rats. European journal of pharmacology. 221 [PubMed]
Sawaguchi T, Goldman-Rakic PS. (1991). D1 dopamine receptors in prefrontal cortex: involvement in working memory. Science (New York, N.Y.). 251 [PubMed]
Schultz W. (1992). Activity of dopamine neurons in the behaving primate Semin Neurosci. 4
Schultz W, Apicella P, Ljungberg T. (1993). Responses of monkey dopamine neurons to reward and conditioned stimuli during successive steps of learning a delayed response task. The Journal of neuroscience : the official journal of the Society for Neuroscience. 13 [PubMed]
Schultz W, Romo R. (1990). Dopamine neurons of the monkey midbrain: contingencies of responses to stimuli eliciting immediate behavioral reactions. Journal of neurophysiology. 63 [PubMed]
Sejnowski TJ, Doya K. (1995). A novel reinforcement model of birdsongvocalization learning Advances in neural information processing systems. 17
Sejnowski TJ, Montague PR, Dayan P, Nowlan SJ. (1993). Using aperiodic reinforcement for directed self-organization Advances in neural information processing systems .
Sejnowski TJ, Montague PR, Dayan P, Quartz SR. (1992). Expectation learning in the brain using diffuse ascending projections Soc Neuroscie Abstr. 18
Sutton RS. (1988). Learning to predict by the method of temporal diferences Machine Learning. 3
Sutton RS, Barto A. (1987). A temporal-difference model of classical conditioning Proceedings of the Ninth Annual Conference of the Cognitive Science Society.
Sutton RS, Barto AG. (1981). Toward a modern theory of adaptive networks: expectation and prediction. Psychological review. 88 [PubMed]
Tranel D, Damasio AR. (1985). Knowledge without awareness: an autonomic index of facial recognition by prosopagnosics. Science (New York, N.Y.). 228 [PubMed]
Wagner A, Rescorla R. (1972). A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and non-reinforcement Classical Conditioning II: Current Research and Theory.
Widrow B, Stearns SD. (1985). Adaptive signal processing.
Wise RA. (1982). Neuroleptics and operant behavior: the anhedonia hypothesis Behav Brain Sci. 5
Wise RA, Bozarth MA. (1984). Brain reward circuitry: four circuit elements "wired" in apparent series. Brain research bulletin. 12 [PubMed]
von_Neumann J, Morgenstearn O. (1947). Theory of games and economic behavior.
Bittner KC, Milstein AD, Grienberger C, Romani S, Magee JC. (2017). Behavioral time scale synaptic plasticity underlies CA1 place fields. Science (New York, N.Y.). 357 [PubMed]
Bogacz R, Gurney K. (2007). The basal ganglia and cortex implement optimal decision making between alternative actions. Neural computation. 19 [PubMed]
Daw ND, Courville AC, Touretzky DS. (2006). Representation and timing in theories of the dopamine system. Neural computation. 18 [PubMed]
Geerts JP, Chersi F, Stachenfeld KL, Burgess N. (2020). A general model of hippocampal and dorsal striatal learning and decision making. Proceedings of the National Academy of Sciences of the United States of America. 117 [PubMed]
Gruber AJ, Solla SA, Surmeier DJ, Houk JC. (2003). Modulation of striatal single units by expected reward: a spiny neuron model displaying dopamine-induced bistability. Journal of neurophysiology. 90 [PubMed]
Gurney KN, Humphries MD, Redgrave P. (2015). A new framework for cortico-striatal plasticity: behavioural theory meets in vitro data at the reinforcement-action interface. PLoS biology. 13 [PubMed]
Gutkin BS, Dehaene S, Changeux JP. (2006). A neurocomputational hypothesis for nicotine addiction. Proceedings of the National Academy of Sciences of the United States of America. 103 [PubMed]
Hasselmo ME. (2005). A model of prefrontal cortical mechanisms for goal-directed behavior. Journal of cognitive neuroscience. 17 [PubMed]
Hazy TE, Frank MJ, O'reilly RC. (2007). Towards an executive without a homunculus: computational models of the prefrontal cortex/basal ganglia system. Philosophical transactions of the Royal Society of London. Series B, Biological sciences. 362 [PubMed]
Izhikevich EM. (2007). Solving the distal reward problem through linkage of STDP and dopamine signaling. Cerebral cortex (New York, N.Y. : 1991). 17 [PubMed]
Kato A, Morita K. (2016). Forgetting in Reinforcement Learning Links Sustained Dopamine Signals to Motivation. PLoS computational biology. 12 [PubMed]
Keramati M, Dezfouli A, Piray P. (2011). Speed/accuracy trade-off between the habitual and the goal-directed processes. PLoS computational biology. 7 [PubMed]
Morita K, Kato A. (2014). Striatal dopamine ramping may indicate flexible reinforcement learning with forgetting in the cortico-basal ganglia circuits. Frontiers in neural circuits. 8 [PubMed]
Moustafa AA, Cohen MX, Sherman SJ, Frank MJ. (2008). A role for dopamine in temporal decision making and reward maximization in parkinsonism. The Journal of neuroscience : the official journal of the Society for Neuroscience. 28 [PubMed]
O'Reilly RC, Frank MJ. (2006). Making working memory work: a computational model of learning in the prefrontal cortex and basal ganglia. Neural computation. 18 [PubMed]
O`Reilly RC, Frank MJ. (2005). Making Working Memory Work: A Computational Model of Learning in the Prefrontal Cortex and Basal Ganglia Neural Comput. 18
Prescott TJ, Montes González FM, Gurney K, Humphries MD, Redgrave P. (2006). A robot model of the basal ganglia: behavior and intrinsic processing. Neural networks : the official journal of the International Neural Network Society. 19 [PubMed]
Rivest F, Kalaska JF, Bengio Y. (2010). Alternative time representation in dopamine models. Journal of computational neuroscience. 28 [PubMed]
Sakai Y, Fukai T. (2008). The actor-critic learning is behind the matching law: matching versus optimal behaviors. Neural computation. 20 [PubMed]
Sejnowski TJ, Destexhe A. (2000). Why do we sleep? Brain research. 886 [PubMed]
Sharp PE, Blair HT, Brown M. (1996). Neural network modeling of the hippocampal formation spatial signals and their possible role in navigation: a modular approach. Hippocampus. 6 [PubMed]
Smith AJ, Becker S, Kapur S. (2005). A computational model of the functional role of the ventral-striatal D2 receptor in the expression of previously acquired behaviors. Neural computation. 17 [PubMed]
Szita I, Lorincz A. (2004). Kalman filter control embedded into the reinforcement learning framework. Neural computation. 16 [PubMed]
Zannone S, Brzosko Z, Paulsen O, Clopath C. (2018). Acetylcholine-modulated plasticity in reward-driven navigation: a computational study. Scientific reports. 8 [PubMed]