(1)
Ronald D. Peters; Yimingdong Cao; Mahesh Yadav. Meta-Reflective Reinforcement Learning for Adaptive Decision-Making in Tool-Using LLM Systems. CSET 2026, 4.