Ronald D. Peters, Yimingdong Cao, & Mahesh Yadav. (2026). Meta-Reflective Reinforcement Learning for Adaptive Decision-Making in Tool-Using LLM Systems. Computer Science and Engineering Transactions, 4(1). Retrieved from https://csetx.org/index.php/cset/article/view/118