Buffet, Dutech, Charpillet, 2007. Shaping multi-agent systems with gradient reinforcement learning 15.. https://doi.org/10.1007/s10458-006-9010-5