Publications

Johnson, E., Pike-Burke, C. and Rebeschini, P. (2024) “Sample-efficiency in multi-batch reinforcement learning: the need for dimension-dependent adaptivity”, in Proceedings of the International Conference on Learning Representations (ICLR 2024). OpenReview.
Miao, N., Teh, Y. and Rainforth, T. (2024) “SelfCheck: using LLMs to zero-shot check their own step-by-step reasoning”, in The Twelfth International Conference on Learning Representations ICLR 2024. International Conference on Learning Representations.