Chadderdon GL, Neymotin SA, Kerr CC, Lytton WW. (2012). Reinforcement learning of targeted movement in a spiking neuronal model of motor cortex. PloS one. 7 [PubMed]
Nakano T, Otsuka M, Yoshimoto J, Doya K. (2015). A spiking neural network model of model-free reinforcement learning with high-dimensional sensory input and perceptual ambiguity. PloS one. 10 [PubMed]
Neymotin SA, Chadderdon GL, Kerr CC, Francis JT, Lytton WW. (2013). Reinforcement learning of two-joint virtual arm reaching in a computer model of sensorimotor cortex. Neural computation. 25 [PubMed]
Richmond P, Buesing L, Giugliano M, Vasilaki E. (2011). Democratic population decisions result in robust policy-gradient learning: a parametric study with GPU simulations. PloS one. 6 [PubMed]
Rivest F, Kalaska JF, Bengio Y. (2010). Alternative time representation in dopamine models. Journal of computational neuroscience. 28 [PubMed]