Author Archives: Julien Jacques

ICML 2026

FĂ©licitations Ă  Abdelkrim Zitouni, ainsi qu’Ă  ses encadrants Nadia Kabachi et Juba Agoun pour sa publication acceptĂ©e Ă  ICML 2026 : « PAC-Bayesian Reinforcement Learning Trains Generalizable Policies »

10/12/25 : Offre de #stage Designing Task-Specific Reward and Loss Functions for Large Language Models #llm

Subject. Recent alignment techniques such as Reinforcement Learning from Human Feedbac (RLHF) [Christiano et al., 2017] and Reinforcement Learning from AI Feedback (RLAIF) [Bai et al., 2022] have improved the… Read more »