ModelDB: Paper information

Florian RV. (2007). Reinforcement learning through modulation of spike-timing-dependent synaptic plasticity. Neural computation. 19 [PubMed]

See more from authors: Florian RV

References and models cited by this paper

Abbott LF, Gerstner W. (2005). Homeostasis and learning through spike-timing dependent plasticity Methods and models in neurophysics: Proceedings of the Les Houches SummerSchool 2003.

Abbott LF, Nelson SB. (2000). Synaptic plasticity: taming the beast. Nature neuroscience. 3 Suppl [PubMed]

Aizenman CD, Linden DJ. (2000). Rapid, synaptically driven increases in the intrinsic excitability of cerebellar deep nuclear neurons. Nature neuroscience. 3 [PubMed]

Alstrom P, Stassinopoulos D. (1995). Versatility and adaptive performance. Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics. 51 [PubMed]

Bak P, Stassinopoulos D. (1996). Democratic reinforcement: Learning via self organization Available online at http:--arxiv.org-abs-cond-mat-9601113.

Bartlett P, Baxter J. (1999). Hebbian synaptic modifications in spiking neurons that learn Tech Rep.

Bartlett P, Baxter J. (2000). Stochastic optimization of controlled partially observable Markov decision processes Proc 39th IEEE Conf Decision and Control.

Bartlett PL, Baxter J. (1999). Direct gradient-based reinforcement learning: I. Gradient estimation algorithms Tech Rep Australian National University, Research School of Information Sciences and Engineering.

Bartlett PL, Baxter J. (2000). A biologically plausible and locally optimal learning algorithm for spiking neurons Available online at http:--arp.anu.edu.au-ftp-papers-jon-brains.pdf.gz.

Bartlett PL, Baxter J. (2000). Estimation and approximation bounds for gradient based reinforcement learning Proc 13th Ann Conf Comput Learn Theory.

Bartlett PL, Baxter J. (2001). Infinite-horizon policy-gradient estimation J Artif Intell Res. 15

Bartlett PL, Baxter J, Weaver L. (1999). Direct gradient-based reinforcement learning: II. Gradient ascent algorithms and experiments Tech Rep Australian National University, Research School of Information Sciences and Engineering.

Bartlett PL, Baxter J, Weaver L. (2001). Experiments with infinite-horizon, policy-gradient estimation J Artif Intel Res. 15

Barto AG. (1985). Learning by statistical cooperation of self-interested neuron-like computing elements. Human neurobiology. 4 [PubMed]

Barto AG, Anandan P. (1985). Pattern-recognizing stochastic learning automata IEEE Trans Syst Man Cybernet. 15

Barto AG, Anderson CW. (1985). Structural learning in connectionist systems Proc 7th Ann Conf Cogn Sci Soc.

Barto AG, Jordan MI. (1987). Gradient following without back-propagation in layered networks Proc 1st IEEE Ann Conf Neural Networks. 2

Barto AG, Sutton RS. (1998). Reinforcement learning: an introduction.

A reinforcement learning example (Sutton and Barto 1998) [Model]

Bell AJ, Parra LC. (2005). Maximising sensitivity in a spiking network Advances in neural information processing systems. 17

Bell CC, Han VZ, Sugawara Y, Grant K. (1997). Synaptic plasticity in a cerebellum-like structure depends on temporal order. Nature. 387 [PubMed]

Bi GQ, Poo MM. (1998). Synaptic modifications in cultured hippocampal neurons: dependence on spike timing, synaptic strength, and postsynaptic cell type. The Journal of neuroscience : the official journal of the Society for Neuroscience. 18 [PubMed]

Bohte SM. (2004). The evidence for neural information processing with precise spike-times: A survey Natural Comput. 3

Chechik G. (2003). Spike-timing-dependent plasticity and relevant mutual information maximization. Neural computation. 15 [PubMed]

Cudmore RH, Turrigiano GG. (2004). Long-term potentiation of intrinsic excitability in LV visual cortical neurons. Journal of neurophysiology. 92 [PubMed]

Dan Y, Poo MM. (1992). Hebbian depression of isolated neuromuscular synapses in vitro. Science (New York, N.Y.). 256 [PubMed]

Dan Y, Poo MM. (2004). Spike timing-dependent plasticity of neural circuits. Neuron. 44 [PubMed]

Daoudal G, Debanne D. (2003). Long-term plasticity of intrinsic excitability: learning rules and mechanisms. Learning & memory (Cold Spring Harbor, N.Y.). 10 [PubMed]

Dauce E, Soula H, Beslon G. (2005). Learning methods for dynamic neural networks Proc 2005 Intl Symp Nonlinear Theory and Its Applications.

Egger V, Feldmeyer D, Sakmann B. (1999). Coincidence detection and changes of synaptic efficacy in spiny stellate neurons in rat barrel cortex. Nature neuroscience. 2 [PubMed]

Fairhall AL, Farries MA. (2005). Reinforcement learning with modulated spike timing-dependent plasticity Poster presented at the Computational and Systems Neuroscience Conference (COSYNE 2005). Available online at http:--www.cosyne.org-climages-d-dy-COSYNE05 Abstracts.pdf.

Fairhall AL, Farries MA. (2005). Reinforcement learning with modulated spike timing-dependent plasticity Program No. 384.3. 2005 Abstract Viewer-Itinerary Planner.

Florian RV. (2005). A reinforcement learning algorithm for spiking neural networks Proc 7th Intl Symp Symbolic and Numeric Algorithms for Scientific Computing.

Froemke RC, Dan Y. (2002). Spike-timing-dependent synaptic modification induced by natural spike trains. Nature. 416 [PubMed]

Ganguly K, Kiss L, Poo M. (2000). Enhancement of presynaptic neuronal excitability by correlated presynaptic and postsynaptic spiking. Nature neuroscience. 3 [PubMed]

Gerstner W. (2001). A framework for spiking neuron models: The spike response method The Handbook of Biological Physics.

Gerstner W, Aihara K, Toyoizumi T, Pfister JP. (2005). Spike-timing dependent plasticity and mutual information maximization for a spiking neuron model Advances in neural information processing systems. 17

Gerstner W, Strosslin T. (2003). Reinforcement learning in continuous state and action space Available online at http:--lenpe7.epfl.ch-stroessl-publications-StrosslinGe03.pdf.

Gütig R, Aharonov R, Rotter S, Sompolinsky H. (2003). Learning input correlations through nonlinear temporally asymmetric Hebbian plasticity. The Journal of neuroscience : the official journal of the Society for Neuroscience. 23 [PubMed]

Han VZ, Grant K, Bell CC. (2000). Reversible associative depression and nonassociative potentiation at a parallel fiber synapse. Neuron. 27 [PubMed]

Hopfield JJ, Brody CD. (2004). Learning rules and network repair in spike-timing-based computation networks. Proceedings of the National Academy of Sciences of the United States of America. 101 [PubMed]

Huang YY, Simpson E, Kellendonk C, Kandel ER. (2004). Genetic evidence for the bidirectional modulation of synaptic plasticity in the prefrontal cortex by D1 receptors. Proceedings of the National Academy of Sciences of the United States of America. 101 [PubMed]

Kempter R, Gerstner W, van Hemmen JL. (2001). Intrinsic stabilization of output rates by spike-based Hebbian learning. Neural computation. 13 [PubMed]

Kistler WM, Gerstner W. (2002). Spiking neuron models.

Legenstein R, Naeger C, Maass W. (2005). What can a neuron learn with spike-timing-dependent plasticity? Neural computation. 17 [PubMed]

Li CY, Lu JT, Wu CP, Duan SM, Poo MM. (2004). Bidirectional modification of presynaptic neuronal excitability accompanying spike timing-dependent synaptic plasticity. Neuron. 41 [PubMed]

Lin YW, Min MY, Chiu TH, Yang HW. (2003). Enhancement of associative long-term potentiation by activation of beta-adrenergic receptors at CA1 synapses in rat hippocampal slices. The Journal of neuroscience : the official journal of the Society for Neuroscience. 23 [PubMed]

Markram H, Lübke J, Frotscher M, Sakmann B. (1997). Regulation of synaptic efficacy by coincidence of postsynaptic APs and EPSPs. Science (New York, N.Y.). 275 [PubMed]

Mazzoni P, Andersen RA, Jordan MI. (1991). A more biologically plausible learning rule for neural networks. Proceedings of the National Academy of Sciences of the United States of America. 88 [PubMed]

Mozer MC, Bohte SM. (2005). Reducing spike train variability: A computational theory of spike-timing dependent plasticity Advances in neural information processing systems. 17

Muresan RC, Florian RV. (2006). Phase precession and recession with STDP and anti-STDP Proc 16th Intl Conf Artif Neural Networks.

Neutze R. (1995). Ring interferometer with angular acceleration. Physical review. A, Atomic, molecular, and optical physics. 51 [PubMed]

Nick TA, Ribera AB. (2000). Synaptic activity modulates presynaptic excitability. Nature neuroscience. 3 [PubMed]

Pfister JP, Toyoizumi T, Barber D, Gerstner W. (2006). Optimal spike-timing-dependent plasticity for precise action potential firing in supervised learning. Neural computation. 18 [PubMed]

Press WH, Teukolsky SA, Flannery BP, Vellerling WT. (1992). Numerical Recipes In C: The Art Of Scientific Computing.

Rao RP, Sejnowski TJ. (2001). Spike-timing-dependent Hebbian plasticity as temporal difference learning. Neural computation. 13 [PubMed]

Roberts PD. (1999). Computational consequences of temporally asymmetric learning rules: I. Differential hebbian learning. Journal of computational neuroscience. 7 [PubMed]

Roberts PD, Bell CC. (2002). Spike timing dependent synaptic plasticity in biological systems. Biological cybernetics. 87 [PubMed]

Schultz W. (2002). Getting formal with dopamine and reward. Neuron. 36 [PubMed]

Seamans JK, Yang CR. (2004). The principal features and mechanisms of dopamine modulation in the prefrontal cortex. Progress in neurobiology. 74 [PubMed]

Sejnowski TJ, Pouget A, Deffayet C. (1995). Reinforcement learning predicts the site of plasticity for auditory remapping in the barn owl Advances in neural information processing systems. 7

Seung HS. (2003). Learning in spiking neural networks by reinforcement of stochastic synaptic transmission. Neuron. 40 [PubMed]

Song S, Miller KD, Abbott LF. (2000). Competitive Hebbian learning through spike-timing-dependent synaptic plasticity. Nature neuroscience. 3 [PubMed]

Soula H, Beslon G, Alwan A. (2004). Obstacle avoidance learning in a spiking neural network Poster presented at Last Minute Results of Simulation of Adaptive Behavior.

Soula H, Beslon G, Alwan A. (2005). Learning at the edge of chaos: Temporal coupling of spiking neuron controller of autonomous robotic Proc AAAI Spring Symposia on Developmental Robotics.

Sutton RS. (1988). Learning to predict by the method of temporal diferences Machine Learning. 3

Takita K, Hagiwara M. (2002). A pulse neural network learning algorithm for POMDP environment Proc 2002 Intl Joint Conf Neural Networks.

Takita K, Hagiwara M. (2005). A pulse neural network reinforcement learning algorithm for partially observable Markov decision processes Systems And Computers In Japan. 36

Takita K, Hagiwara M, Osana Y. (2001). Reinforcement learning algorithm with network extension for pulse neural network Trans Institute Electrical Engineers Of Japan. 121

Thiel CM, Friston KJ, Dolan RJ. (2002). Cholinergic modulation of experience-dependent plasticity in human auditory cortex. Neuron. 35 [PubMed]

Tsitsiklis JN, Marbach P. (1999). Simulation-based optimization of Markov reward processes: Implementation issues Proc 38th Conf Decision and Control.

Tsitsiklis JN, Marbach P. (2000). Approximate gradient methods in policy-space optimization of Markov reward processes Discrete Event Dynamic Systems: Theory and Applications. 13

Turrigiano GG, Nelson SB. (2004). Homeostatic plasticity in the developing nervous system. Nature reviews. Neuroscience. 5 [PubMed]

Williams RJ. (1992). Simple statistical gradient-following algorithms for connectionist reinforcement learning Mach Learn. 8

Xie X, Seung HS. (2004). Learning in neural networks by reinforcement of irregular spiking. Physical review. E, Statistical, nonlinear, and soft matter physics. 69 [PubMed]

Xie X, Seung S. (2000). Spike-based learning rules and stabilization of persistent neural activity. Advances in neural information processing systems. 12

Zhang W, Linden DJ. (2003). The other side of the engram: experience-driven changes in neuronal intrinsic excitability. Nature reviews. Neuroscience. 4 [PubMed]

van Hemmen JL, Gerstner W, Kempter R. (1999). Hebbian learning and spiking neurons Physical Review E. 59

References and models that cite this paper

Brzosko Z, Zannone S, Schultz W, Clopath C, Paulsen O. (2017). Sequential neuromodulation of Hebbian plasticity offers mechanism for effective reward-based navigation. eLife. 6 [PubMed]

Sequential neuromodulation of Hebbian plasticity in reward-based navigation (Brzosko et al 2017) [Model]

Chadderdon GL, Neymotin SA, Kerr CC, Lytton WW. (2012). Reinforcement learning of targeted movement in a spiking neuronal model of motor cortex. PloS one. 7 [PubMed]

Reinforcement learning of targeted movement (Chadderdon et al. 2012) [Model]

Legenstein R, Pecevski D, Maass W. (2008). A learning theory for reward-modulated spike-timing-dependent plasticity with application to biofeedback. PLoS computational biology. 4 [PubMed]

Reward modulated STDP (Legenstein et al. 2008) [Model]

Mozafari M, Kheradpisheh SR, Masquelier T, Nowzari-Dalini A, Ganjtabesh M. (2018). First-Spike-Based Visual Categorization Using Reward-Modulated STDP IEEE Transactions on Neural Networks and Learning Systems.

First-Spike-Based Visual Categorization Using Reward-Modulated STDP (Mozafari et al. 2018) [Model]

Nakano T, Otsuka M, Yoshimoto J, Doya K. (2015). A spiking neural network model of model-free reinforcement learning with high-dimensional sensory input and perceptual ambiguity. PloS one. 10 [PubMed]

A spiking neural network model of model-free reinforcement learning (Nakano et al 2015) [Model]

Neymotin SA, Chadderdon GL, Kerr CC, Francis JT, Lytton WW. (2013). Reinforcement learning of two-joint virtual arm reaching in a computer model of sensorimotor cortex. Neural computation. 25 [PubMed]

Sensorimotor cortex reinforcement learning of 2-joint virtual arm reaching (Neymotin et al. 2013) [Model]

Richmond P, Buesing L, Giugliano M, Vasilaki E. (2011). Democratic population decisions result in robust policy-gradient learning: a parametric study with GPU simulations. PloS one. 6 [PubMed]

Democratic population decisions result in robust policy-gradient learning (Richmond et al. 2011) [Model]

Rivest F, Kalaska JF, Bengio Y. (2010). Alternative time representation in dopamine models. Journal of computational neuroscience. 28 [PubMed]

Alternative time representation in dopamine models (Rivest et al. 2009) [Model]

Zannone S, Brzosko Z, Paulsen O, Clopath C. (2018). Acetylcholine-modulated plasticity in reward-driven navigation: a computational study. Scientific reports. 8 [PubMed]

Acetylcholine-modulated plasticity in reward-driven navigation (Zannone et al 2018) [Model]