<p> Nowadays, I'm mainly working on LLMs <a href="https://jingfengyang.github.io/gpt" style="color:#4133ff;">[Blog: Reproduction and Usage of GPT3/ChatGPT]</a>, including 1) pretraining (data, infrastructure, scaling laws) 2) post-training (instruction tuning, human and AI preference learning) 3) evaluation 4) language agents (tool using, planning and reasoning, long-context handling) 5) alignment and AI safety <a href="https://jingfengyang.github.io/safety" style="color:#4133ff;">[Blog: AI Safety: Why, What, and How]</a>.</p>
0 commit comments