You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
My Ph.D. research aims to advance generative AI capabilities for open problems through post-training and inference-time algorithm design. Specifically, for both text and vision generative models:
15
+
My Ph.D. research aims to advance model capabilities for open problems through post-training and inference-time algorithm design. Specifically, for both text and vision models:
16
16
- How do we continuously improve them via post-training, even at superhuman levels?
17
17
- How do we scale test-time compute effectively and efficiently?
18
18
@@ -22,13 +22,14 @@ Feel free to reach out if you are interested in my research or simply want to ch
22
22
23
23
<h2id="news">News</h2>
24
24
- (June 2026) I will be giving a few talks on SimpleTES. Please check out our [blog](https://haotianye.com/blog/simpletes/) for our thoughts, and let me know if you are interested in the work!
25
-
- (May 2026) Check out [SimpleTES](https://www.wizardquant.com/will/simpletes), our new work on strategically scaling evaluation-driven discovery loops — achieving state-of-the-art on 21 problems across six domains with open-source models. See [paper](https://arxiv.org/abs/2604.19341), [code](https://github.com/wq-will/SimpleTES), and [tweet](https://x.com/haotian_yeee/status/2056782234192166926).
25
+
- (May 2026) Check out [SimpleTES](https://www.wizardquant.com/will/simpletes), our new work on strategically scaling evaluation-driven discovery loops — achieving state-of-the-art on 21 problems across six domains with open-source models. See [paper](https://arxiv.org/abs/2604.19341), [code](https://github.com/wq-will/SimpleTES), and [X](https://x.com/haotian_yeee/status/2056782234192166926).
26
26
- (Apr. 2026) I passed my Ph.D. qualifying exam at Stanford!
27
27
- (Feb. 2026) [InfoTok](https://arxiv.org/abs/2512.16975) (<fontcolor="#DC143C">Oral</font>) and [Scaling Law Discovery](https://linhaowei1.github.io/scaling_law_discovery/) accepted by ICLR 2026. See you in Rio de Janeiro!
28
-
- (Feb. 2026) I had the pleasure of presenting our work on [DDRL](https://research.nvidia.com/labs/dir/ddrl/) at CalTech (hosted by [Anima Anandkumar](https://www.eas.caltech.edu/people/anima)) and ByteDance Seed (hosted by [Haoqi Fan](https://haoqifan.github.io/)). Thanks for the invitation!
29
-
- (Jan. 2026) We release an interesting [study](https://linhaowei1.github.io/scaling_law_discovery/) on whether AI can discover its own scaling laws better than humans. Check it out!
30
-
- (Jan. 2026) Check out [InfoTok](https://arxiv.org/abs/2512.16975), our adaptive information-theoretic tokenizer that achieves 40% higher compression without information loss and represents videos via coarse-grained to fine-grained sequences.
31
-
- (Dec. 2025) We release [DDRL](https://research.nvidia.com/labs/dir/ddrl/), the RL algorithm powering NVIDIA's [Cosmos-Predict2.5](https://research.nvidia.com/labs/dir/cosmos-predict2.5/) video generative models (2B/14B)! An amazing experience scaling the post-training of models on ~2,000 H100 GPUs. Check our [post](https://x.com/haotian_yeee/status/1997138677529825452?s=20) and [models](https://github.com/nvidia-cosmos/cosmos-predict2.5)!
28
+
<!-- - (Feb. 2026) I had the pleasure of presenting our work on [DDRL](https://research.nvidia.com/labs/dir/ddrl/) at CalTech (hosted by [Anima Anandkumar](https://www.eas.caltech.edu/people/anima)) and ByteDance Seed (hosted by [Haoqi Fan](https://haoqifan.github.io/)). Thanks for the invitation! -->
29
+
<!-- - (Jan. 2026) We release an interesting [study](https://linhaowei1.github.io/scaling_law_discovery/) on whether AI can discover its own scaling laws better than humans. Check it out! -->
30
+
<!-- - (Jan. 2026) Check out [InfoTok](https://arxiv.org/abs/2512.16975), our adaptive information-theoretic tokenizer that achieves 40% higher compression without information loss and represents videos via coarse-grained to fine-grained sequences. -->
31
+
<!-- - (Dec. 2025) We release [DDRL](https://research.nvidia.com/labs/dir/ddrl/), the RL algorithm powering NVIDIA's [Cosmos-Predict2.5](https://research.nvidia.com/labs/dir/cosmos-predict2.5/) video generative models (2B/14B)! An amazing experience scaling the post-training of models on ~2,000 H100 GPUs. Check our [post](https://x.com/haotian_yeee/status/1997138677529825452?s=20) and [models](https://github.com/nvidia-cosmos/cosmos-predict2.5)! -->
32
+
32
33
<!-- - (Dec. 2025) [Data Attribution for RL](https://arxiv.org/abs/2505.19281) was accepted by NeurIPS 2025 (<font color="#DC143C">Oral</font>). See you in San Diego! -->
33
34
<!-- - (Nov. 2025) We released NVIDIA [Cosmos-Predict2.5](https://research.nvidia.com/labs/dir/cosmos-predict2.5/), the Cosmos World Foundation Models specialized for video generation. I am responsible for designing the RL algorithm and performing large-scale post-training (1K+ GPUs) for the release. -->
34
35
<!-- - (Oct. 2025) [Diffusion Inference-Time Acceleration](https://arxiv.org/abs/2507.15260) was accepted by ICCV 2025. See you in Hawaii! -->
@@ -49,7 +50,7 @@ Feel free to reach out if you are interested in my research or simply want to ch
49
50
<divclass="pub-right">
50
51
<divclass="pub-title">SimpleTES: A General Framework for Strategically Scaling Evaluation-Driven Discovery Loops</div>
0 commit comments