Xu, E\*., Ye, K\*., Zhou, H\*., Zhu, L., Quinzan, F. and **Shi, C**. (2025). [Doubly Robust Alignment for Large Language Models](https://arxiv.org/pdf/2506.01183), _NeurIPS_. **Python module** [<span style="font-family:courier;">**DRPO4LLM**</span>](https://github.com/DRPO4LLM/DRPO4LLM) <br/> [<font size="3">slides</font>](./slides/DRPO.pdf) [<font size="3">video</font>](https://www.bilibili.com/video/BV1xNuuzVEeD?spm_id_from=333.788.videopod.sections&vd_source=0ff25cf8645aa63231bec2428b94bf6f&p=3) <font size="3">presented Tsinghua Statistics + AI Frontier Summit</font>.
0 commit comments