From: Which Temporal Difference learning algorithm best reproduces dopamine activity in a multi-choice task?