📝 Publications
PrePrint
- Reasoning and Tool-use Compete in Agentic RL: From Quantifying Interference to Disentangled Tuning. Yu Li, Mingyang Yi, Xiuyu Li, et al. Preprint, 2026.
- ETS: Energy-Guided Test-Time Scaling for Training-Free RL Alignment. Xiuyu Li*, Jinkai Zhang*, Mingyang Yi, et al. Preprint, 2026.