Publications

Johnson, E., Pike-Burke, C. and Rebeschini, P. (2023) “Optimal convergence rate for exact policy mirror descent in discounted Markov decision processes”, in Advances in Neural Information Processing Systems. NeurIPS, pp. 76496–76524.