Ronald D. Peters, Yimingdong Cao, and Mahesh Yadav. “Meta-Reflective Reinforcement Learning for Adaptive Decision-Making in Tool-Using LLM Systems”. Computer Science and Engineering Transactions 4, no. 1 (May 22, 2026). Accessed June 24, 2026. https://csetx.org/index.php/cset/article/view/118.