Peters, J. Dept. Empirical Inference, Max Planck Institute for Intelligent Systems, Max Planck Society;
Link (Any fulltext)
Dann, C., Neumann, G., & Peters, J. (2014). Policy Evaluation with Temporal Differences: A Survey and Comparison. Journal of Machine Learning Research, 15, 809-883. Retrieved from http://www.jmlr.org/papers/volume15/dann14a/dann14a.pdf.