Izhikevich EM. (2007). Solving the distal reward problem through linkage of STDP and dopamine signaling. Cerebral cortex (New York, N.Y. : 1991). 17 [PubMed]
Porr B, Wörgötter F. (2007). Learning with "relevance": using a third factor to stabilize Hebbian learning. Neural computation. 19 [PubMed]
Richmond P, Buesing L, Giugliano M, Vasilaki E. (2011). Democratic population decisions result in robust policy-gradient learning: a parametric study with GPU simulations. PloS one. 6 [PubMed]
Roelfsema PR, van Ooyen A. (2005). Attention-gated reinforcement learning of internal representations for classification. Neural computation. 17 [PubMed]
Wörgötter F, Porr B. (2005). Temporal sequence learning, prediction, and control: a review of different models and their relation to biological mechanisms. Neural computation. 17 [PubMed]