
Yuanpeng Li
PhD student in Statistics at UC Irvine. Multi-task RL and multi-objective RL for LLM post-training.
- Irvine, CA
- University of California, Irvine
- Google Scholar
- Github

PhD student in Statistics at UC Irvine. Multi-task RL and multi-objective RL for LLM post-training.