Baras D, Meir R. (2007). Reinforcement learning, spike-time-dependent plasticity, and the BCM rule. Neural computation. 19 [PubMed]
Chadderdon GL, Neymotin SA, Kerr CC, Lytton WW. (2012). Reinforcement learning of targeted movement in a spiking neuronal model of motor cortex. PloS one. 7 [PubMed]
Costa RP, Froemke RC, Sjöström PJ, van Rossum MC. (2015). Unified pre- and postsynaptic long-term plasticity enables reliable and flexible learning. eLife. 4 [PubMed]
Costa RP et al. (2017). Synaptic Transmission Optimization Predicts Expression Loci of Long-Term Plasticity. Neuron. 96 [PubMed]
Fiete IR, Fee MS, Seung HS. (2007). Model of birdsong learning based on gradient estimation by dynamic perturbation of neural conductances. Journal of neurophysiology. 98 [PubMed]
Florian RV. (2007). Reinforcement learning through modulation of spike-timing-dependent synaptic plasticity. Neural computation. 19 [PubMed]
Izhikevich EM. (2007). Solving the distal reward problem through linkage of STDP and dopamine signaling. Cerebral cortex (New York, N.Y. : 1991). 17 [PubMed]
Nemenman I. (2005). Fluctuation-dissipation theorem and models of learning. Neural computation. 17 [PubMed]
Neymotin SA, Chadderdon GL, Kerr CC, Francis JT, Lytton WW. (2013). Reinforcement learning of two-joint virtual arm reaching in a computer model of sensorimotor cortex. Neural computation. 25 [PubMed]
Richmond P, Buesing L, Giugliano M, Vasilaki E. (2011). Democratic population decisions result in robust policy-gradient learning: a parametric study with GPU simulations. PloS one. 6 [PubMed]
Roelfsema PR, van Ooyen A. (2005). Attention-gated reinforcement learning of internal representations for classification. Neural computation. 17 [PubMed]
Sakai Y, Fukai T. (2008). The actor-critic learning is behind the matching law: matching versus optimal behaviors. Neural computation. 20 [PubMed]
Soltani A, Wang XJ. (2006). A biophysically based neural model of matching law behavior: melioration by stochastic synapses. The Journal of neuroscience : the official journal of the Society for Neuroscience. 26 [PubMed]
Swinehart CD, Abbott LF. (2005). Supervised learning through neuronal response modulation. Neural computation. 17 [PubMed]
Toyoizumi T, Pfister JP, Aihara K, Gerstner W. (2007). Optimality model of unsupervised spike-timing-dependent plasticity: synaptic memory and weight distribution. Neural computation. 19 [PubMed]
Whittington JCR, Bogacz R. (2017). An Approximation of the Error Backpropagation Algorithm in a Predictive Coding Network with Local Hebbian Synaptic Plasticity. Neural computation. 29 [PubMed]