Geist M, Pietquin O, Fricout G. (2009). Kalman temporal differences: the deterministic case Proceedings of the 2009 IEEE International Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL-09).
Keramati M, Dezfouli A, Piray P. (2011). Speed/accuracy trade-off between the habitual and the goal-directed processes. PLoS computational biology. 7 [PubMed]