RONALD D. PETERS; YIMINGDONG CAO; MAHESH YADAV. Meta-Reflective Reinforcement Learning for Adaptive Decision-Making in Tool-Using LLM Systems. Computer Science and Engineering Transactions, [S. l.], v. 4, n. 1, 2026. Disponível em: https://csetx.org/index.php/cset/article/view/118. Acesso em: 24 jun. 2026.