Multi-Agent Collaboration with Adversarial Filtering for Reliable Medical Diagnosis Support

Authors

  • Nathan Hawkins School of Computing, Clemson University, Clemson, SC, USA.
  • Keith Bell Department of Computer Science and Engineering, University of Nevada, Reno, Reno, NV, USA.

Keywords:

multi-agent systems, adversarial filtering, medical diagnosis support, robust AI, clinical decision support, socio-technical infrastructure, governance, algorithmic fairness

Abstract

The integration of artificial intelligence into clinical decision support systems promises to augment diagnostic accuracy and operational efficiency, yet the adoption of such systems remains constrained by concerns over reliability, robustness, and adversarial vulnerability. This paper proposes a novel architectural framework that combines multi-agent collaboration with adversarial filtering to enhance the trustworthiness of medical diagnosis support. The system comprises a suite of specialized diagnostic agents, each trained on distinct data modalities or clinical subdomains, whose outputs are aggregated through a central arbitrator. Prior to aggregation, an adversarial filtering module screens each agent contribution for potential manipulated or out-of-distribution inputs, thereby mitigating the impact of adversarial perturbations and improving overall system resilience. We examine the structural trade-offs inherent in such a distributed architecture, including the balance between agent specialization and coordination overhead, the computational cost of real-time filtering, and the implications for governance and fairness. Drawing on perspectives from large-scale socio-technical systems, we discuss deployment considerations such as interoperability with existing health information infrastructures, regulatory compliance, and sustainability of model maintenance. Ethical dimensions around algorithmic bias, accountability, and patient safety are analyzed within the context of adversarial filtering mechanisms. The paper further explores policy implications for certification and continuous monitoring of medical AI systems. By situating the proposed framework within the broader landscape of robust and trustworthy AI, we argue that multi-agent architectures enhanced with adversarial defenses offer a viable path toward reliable clinical decision support. The work contributes a systems-level blueprint for future research and practical implementation in high-stakes medical environments.

References

1. Topol, E. J. (2019). High-performance medicine: the convergence of human and artificial intelligence. Nature Medicine, 25(1), 44–56.

2. Rajpurkar, P., Chen, E., Banerjee, O., & Topol, E. J. (2022). AI in health and medicine. Nature Medicine, 28(1), 31–38.

3. Busoniu, L., Babuska, R., & De Schutter, B. (2008). A comprehensive survey of multiagent reinforcement learning. IEEE Transactions on Systems, Man, and Cybernetics, Part C, 38(2), 156–172.

4. Goodfellow, I. J., Shlens, J., & Szegedy, C. (2015). Explaining and harnessing adversarial examples. In International Conference on Learning Representations (ICLR).

5. Hu, S. (2026). Research on Security Enhancement Methods for Adversarial Robust Large Language Model Intelligent Agents for Medical Decision-Making Tasks. arXiv preprint arXiv:2605.08257.

6. Amodei, D., Olah, C., Steinhardt, J., Christiano, P., Schulman, J., & Mané, D. (2016). Concrete problems in AI safety. arXiv preprint arXiv:1606.06565.

7. Char, D. S., Shah, N. H., & Magnus, D. (2018). Implementing machine learning in health care—addressing ethical challenges. New England Journal of Medicine, 378(11), 981–983.

8. Danks, D., & London, A. J. (2017). Algorithmic bias in autonomous systems. In International Joint Conference on Artificial Intelligence (pp. 4691–4697).

9. Esteva, A., Kuprel, B., Novoa, R. A., Ko, J., Swetter, S. M., Blau, H. M., & Thrun, S. (2017). Dermatologist-level classification of skin cancer with deep neural networks. Nature, 542(7639), 115–118.

10. Finlayson, S. G., Bowers, J. D., Ito, J., Zittrain, J. L., Beam, A. L., & Kohane, I. S. (2019). Adversarial attacks on medical machine learning. Science, 363(6433), 1287–1289.

11. Ghassemi, M., Oakden-Rayner, L., & Beam, A. L. (2021). The false hope of current approaches to explainable artificial intelligence in health care. The Lancet Digital Health, 3(11), e745–e750.

12. Horvitz, E. (2022). AI in healthcare: Balancing opportunities and risks. Journal of the American Medical Association, 328(12), 1211–1212.

13. Jiang, F., Jiang, Y., Zhi, H., Dong, Y., Li, H., Ma, S., … & Wang, Y. (2017). Artificial intelligence in healthcare: past, present and future. Stroke and Vascular Neurology, 2(4), 230–243.

14. Kaur, D., Uslu, S., Rittichier, K. J., & Durresi, A. (2022). Trustworthy artificial intelligence: a review. ACM Computing Surveys, 55(2), 1–38.

15. Kelly, C. J., Karthikesalingam, A., Suleyman, M., Corrado, G., & King, D. (2019). Key challenges for delivering clinical impact with artificial intelligence. BMC Medicine, 17(1), 1–9.

16. Liu, Y., Chen, P. H. C., Krause, J., & Peng, L. (2019). How to read articles that use machine learning: users' guides to the medical literature. JAMA, 322(18), 1806–1816.

17. Matheny, M. E., Whicher, D., & Thadaney Israni, S. (2020). Artificial intelligence in health care: a report from the National Academy of Medicine. Journal of the American Medical Association, 323(6), 509–510.

18. Mittelstadt, B. D., Allo, P., Taddeo, M., Wachter, S., & Floridi, L. (2016). The ethics of algorithms: Mapping the debate. Big Data & Society, 3(2), 1–21.

19. Obermeyer, Z., & Emanuel, E. J. (2016). Predicting the future—big data, machine learning, and clinical medicine. New England Journal of Medicine, 375(13), 1216–1219.

20. Panch, T., Szolovits, P., & Atun, R. (2018). Artificial intelligence, machine learning and health systems. Journal of Global Health, 8(2), 020303.

21. Park, Y., & Ho, J. C. (2021). Artificial intelligence in healthcare: A systematic review of applications and challenges. Healthcare Informatics Research, 27(1), 3–16.

22. Raji, I. D., Smart, A., White, R. N., Mitchell, M., Gebru, T., Hutchinson, B., … & Barnes, P. (2020). Closing the AI accountability gap: Defining an end-to-end framework for internal algorithmic auditing. In Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency (pp. 33–44).

23. Saria, S., Butte, A., & Saria, S. (2021). The case for multi-agent systems in clinical decision support. Journal of Biomedical Informatics, 114, 103674.

24. Stoyanovich, J., Van Bavel, J. J., & West, I. V. (2020). The imperative of computational transparency in AI. Nature Machine Intelligence, 2(10), 590–592.

25. Yu, K. H., Beam, A. L., & Kohane, I. S. (2018). Artificial intelligence in healthcare. Nature Biomedical Engineering, 2(10), 719–731.

Downloads

Published

2026-05-08

How to Cite

Nathan Hawkins, & Keith Bell. (2026). Multi-Agent Collaboration with Adversarial Filtering for Reliable Medical Diagnosis Support. Computer Science and Engineering Transactions, 4(1). Retrieved from https://csetx.org/index.php/cset/article/view/127