1.
Ronald D. Peters, Yimingdong Cao, Mahesh Yadav. Meta-Reflective Reinforcement Learning for Adaptive Decision-Making in Tool-Using LLM Systems. CSET [Internet]. 2026 May 22 [cited 2026 Jun. 24];4(1). Available from: https://csetx.org/index.php/cset/article/view/118