Förster A, Peters, J Department Empirical Inference, Max Planck Institute for Biological Cybernetics, Max Planck Society; Dept. Empirical Inference, Max Planck Institute for Intelligent Systems, Max Planck Society;
Wierstra, D., Förster A, Peters, J., & Schmidhuber, J. (2010). Recurrent Policy Gradients. Logic Journal of the IGPL, 18(5), 620-634. doi:10.1093/jigpal/jzp049.