Ronald D. Peters, Yimingdong Cao, and Mahesh Yadav. 2026. “Meta-Reflective Reinforcement Learning for Adaptive Decision-Making in Tool-Using LLM Systems”. Computer Science and Engineering Transactions 4 (1). https://csetx.org/index.php/cset/article/view/118.