PhD student in Statistics at UC Irvine. Multi-task RL and multi-objective RL for LLM post-training.
This is a page not in the menu. You can use markdown in this page.