Houk JC, Adams JL, Barto AG: A model of how the basal ganglia generate and use neural signals that predictv reinforcement. Models of information processing in the basal ganglia. Edited by: Houk JC, Davis JL, Beiser DG. 1995, Cambridge (MA): The MIT Press, 249-270.
Khamassi M, Humphries MD: Integrating cortico-limbic-basal ganglia architectures for learning model-based and model-free navigation strategies. Front Behav Neurosci. 2012, 6:
Khamassi M, Lachèze L, Girard B, Berthoz A, Guillot A: Actor-Critic models of reinforcement learning in the basal ganglia: from natural to artificial rats. Adapt Behav. 2005, 13 (2): 131-148. 10.1177/105971230501300205.
Roesch MR, Calu DJ, Schoenbaum G: Dopamine neurons encode the better option in rats deciding between differently delayed or sized rewards. Nature neuroscience. 10 (12): 1615-1624.
Rueda-Orozco P, Robbe D: Striatal ensembles continuously represent animals kinematics and limb movement dynamics during execution of a locomotor habit. submitted.
Shadmehr R, Smith MA, Krakauer JW: Error correction, sensory prediction, and adaptation in motor control. Ann Rev Neurosci. 2010, 33: 89-108. 10.1146/annurev-neuro-060909-153135.
Sutton RS, Barto AG: Reinforcement learning: An introduction. 1998, Cambridge, MA: MIT press
Yin HH, Knowlton BJ: The role of the basal ganglia in habit formation. Nature Reviews Neuroscience. 2006, 7 (6): 464-476. 10.1038/nrn1919.