You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: _pages/about.md
+10-2Lines changed: 10 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -13,23 +13,31 @@ I focus on advancing generative AI (both for transformer and diffusion) through
13
13
14
14
News
15
15
======
16
+
- (Dec. 2025) We release [DDRL](https://research.nvidia.com/labs/dir/ddrl/), a data-regularized RL algorithm for diffusion! It has been successfully used in NVIDIA [Cosmos-Predict2.5](https://research.nvidia.com/labs/dir/cosmos-predict2.5/), where I was responsible for RL. Check our [post](https://x.com/haotian_yeee/status/1997138677529825452?s=20)!
16
17
- (Dec. 2025) [Data Attribution for RL](https://arxiv.org/abs/2505.19281) was accepted by NeurIPS 2025 (<fontcolor="#DC143C">Oral</font>). See you in San Diego!
17
18
- (Nov. 2025) We released NVIDIA [Cosmos-Predict2.5](https://research.nvidia.com/labs/dir/cosmos-predict2.5/), the Cosmos World Foundation Models specialized for video generation. I am responsible for designing the RL algorithm and performing large-scale post-training (1K+ GPUs) for the release. The algorithm paper will be publicized soon.
18
19
- (Oct. 2025) [Diffusion Inference-Time Acceleration](https://arxiv.org/abs/2507.15260) was accepted by ICCV 2025. See you in Hawaii!
19
20
- (Apr. 2025) Google intern work [LLM Constrained Decoding](https://arxiv.org/abs/2504.09135) was accepted by AISTATS 2025, and [ICV-Hallucination](https://arxiv.org/abs/2410.15778) by ICLR 2025 (<fontcolor="#DC143C">Spotlight</font>). See you in Singapore and Phuket!
20
21
<!-- - (Sept. 2024) I started my part-time internship at [NVIDIA Deep Imagination Research](https://research.nvidia.com/labs/dir/). ->
21
22
<!-- - (Sept. 2024) Two papers was accepted by NeurIPS 2024, including [training-free guidance](https://arxiv.org/abs/2409.15761) (<font color="#DC143C">Spotlight</font>) and [geometric trajectory models](https://arxiv.org/abs/2410.13027). See you in Vancouver! ->
22
-
<!-- - (May. 2024) Three papers got accepted by ICML 2024, inclduing [LLM selection](https://arxiv.org/abs/2402.02314), [in-context vectors](https://arxiv.org/abs/2311.06668), and [monitoring AI usage in peer reviews](https://arxiv.org/pdf/2403.07183). See you in Vienna! -->
23
-
<!-- - (Feb. 2024) Our [Forward Laplacian paper](https://www.nature.com/articles/s42256-024-00794-x) is accepted by Nature Machine Intelligence! We release [LapJAX](https://github.com/YWolfeee/lapjax), a JAXbased package designed for accelerating general secondorder operators (e.g., Laplacian) computation. -->
23
+
<!-- - (May 2024) Three papers got accepted by ICML 2024, including [LLM selection](https://arxiv.org/abs/2402.02314), [in-context vectors](https://arxiv.org/abs/2311.06668), and [monitoring AI usage in peer reviews](https://arxiv.org/pdf/2403.07183). See you in Vienna! -->
24
+
<!-- - (Feb. 2024) Our [Forward Laplacian paper](https://www.nature.com/articles/s42256-024-00794-x) is accepted by Nature Machine Intelligence! We release [LapJAX](https://github.com/YWolfeee/lapjax), a JAX-based package designed for accelerating general second-order operators (e.g., Laplacian) computation. -->
24
25
25
26
Selected Publications
26
27
======
27
28
-**(In submission) Data-regularized Reinforcement Learning for Diffusion Models at Scale**
0 commit comments