Causal AI-AugETM: Integrating Structural Causal Models with Exposure–Toxicity Modeling for Robust Drug Safety Assessment Under Confounding Bias

Larry Diaz; Samuel Parker; George Baker; Hugo Thornton

Authors

Larry Diaz Department of Computer Science, University of Houston, Houston, TX, USA.
Samuel Parker Department of Computer Science, Binghamton University, Binghamton, NY, USA.
George Baker Department of Computer Science, University of New Hampshire, Durham, NH, USA.
Hugo Thornton Department of Electrical Engineering and Computer Science, University of Kansas, Lawrence, KS, USA.

Keywords:

causal inference, structural causal models, exposure-toxicity modeling, drug safety, confounding bias, AI-augmented clinical trials, robust assessment, socio-technical infrastructure, fairness, governance

Abstract

The assessment of drug safety in early-phase clinical trials and post-market surveillance is persistently challenged by confounding bias arising from non-randomized treatment assignment, time-varying exposures, and unmeasured covariates. Traditional exposure-toxicity joint models, while effective in capturing dose-response relationships, lack the structural causal reasoning necessary to distinguish genuine toxicity signals from spurious associations. This paper introduces Causal AI-AugETM, a comprehensive framework that integrates structural causal models with an AI-augmented exposure-toxicity joint modeling paradigm. The framework systematically embeds causal graphs, intervention calculus, and counterfactual reasoning into the exposure-toxicity modeling pipeline, enabling robust estimation of causal effects under complex confounding scenarios. Emphasis is placed on the system-level architecture that governs data ingestion, causal discovery, model inference, and feedback loops for continuous updating. The design explicitly addresses structural trade-offs between model flexibility and interpretability, computational feasibility and statistical precision, as well as fairness across demographic subgroups that may be differentially affected by confounding by indication. Furthermore, the paper discusses governance and policy implications, including regulatory validation standards, transparency requirements, and accountability mechanisms for AI-driven safety assessments. Deployment considerations such as scalability via cloud-native and federated infrastructures, sustainability of model retraining cycles, and integration with existing pharmacovigilance systems are examined. Cross-domain comparisons with causal inference applications in climate science and econometrics provide additional insight into best practices. The Causal AI-AugETM framework represents a principled step toward trustworthy, causally informed drug safety assessment that can adapt to evolving real-world data environments while maintaining scientific rigor and societal trust.

References

1. Hernán, M. A., & Robins, J. M. (2020). Causal inference: What if. Chapman & Hall/CRC.

2. Tsiatis, A. A., & Davidian, M. (2004). Joint modeling of longitudinal and time-to-event data: An overview. Statistica Sinica, 14(3), 809–834.

3. Esteva, A., Robicquet, A., Ramsundar, B., Kuleshov, V., DePristo, M., Chou, K., ... & Dean, J. (2019). A guide to deep learning in healthcare. Nature Medicine, 25(1), 24–29.

4. Topol, E. J. (2019). High-performance medicine: The convergence of human and artificial intelligence. Nature Medicine, 25(1), 44–56.

5. D’Amour, A., Heller, K., Moldovan, D., Adlam, B., Alaa, A., Beutel, A., ... & Zhang, L. (2020). Underspecification presents challenges for credibility in modern machine learning. Journal of Machine Learning Research, 23(1), 1–61.

6. Kusner, M. J., Loftus, J. R., Russell, C., & Silva, R. (2017). Counterfactual fairness. In Advances in Neural Information Processing Systems (pp. 4066–4076).

7. Rizopoulos, D. (2012). Joint models for longitudinal and time-to-event data with applications in R. CRC Press.

8. Daniel, R. M., Cousens, S. N., De Stavola, B. L., Kenward, M. G., & Sterne, J. A. C. (2013). Methods for dealing with time-dependent confounding. Statistics in Medicine, 32(9), 1584–1618.

9. Pearl, J. (2009). Causality (2nd ed.). Cambridge University Press.

10. Spirtes, P., Glymour, C., & Scheines, R. (2000). Causation, prediction, and search (2nd ed.). MIT Press.

11. Robins, J. M., & Hernán, M. A. (2009). Estimation of the causal effects of time-varying exposures. In Advances in longitudinal data analysis (pp. 553–599). CRC Press.

12. Athey, S., & Imbens, G. W. (2016). Recursive partitioning for heterogeneous causal effects. Proceedings of the National Academy of Sciences, 113(27), 7353–7360.

13. Li, Y., Wang, J., Ye, J., & Reddy, C. K. (2017). A multi-task learning formulation for survival analysis. In Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 1715–1724).

14. Ranganath, R., Perotte, A., Elhadad, N., & Blei, D. M. (2016). Deep survival analysis. In Machine Learning for Healthcare Conference (pp. 101–114).

15. Zhang, Z., Chen, L., & Xu, Y. (2023). Neural joint models for exposure-toxicity analysis with high-dimensional covariates. Journal of Biomedical Informatics, 140, 104321.

16. Wang, Y. (2025, August). AI-AugETM: An AI-augmented exposure–toxicity joint modeling framework for personalized dose optimization in early-phase clinical trials. In 2025 19th International Conference on Complex Medical Engineering (CME) (pp. 182-186). IEEE.

17. Johansson, F. D., Shalit, U., & Sontag, D. (2016). Learning representations for counterfactual inference. In International Conference on Machine Learning (pp. 3020–3029).

18. van der Laan, M. J., & Rose, S. (2011). Targeted learning: Causal inference for observational and experimental data. Springer.

19. Glymour, C., Zhang, K., & Spirtes, P. (2019). Review of causal discovery methods based on graphical models. Frontiers in Genetics, 10, 524.

20. Arjovsky, M., Bottou, L., Gulrajani, I., & Lopez-Paz, D. (2019). Invariant risk minimization. arXiv preprint arXiv:1907.02893.

21. Gelman, A., Carlin, J. B., Stern, H. S., Dunson, D. B., Vehtari, A., & Rubin, D. B. (2013). Bayesian data analysis (3rd ed.). CRC Press.

22. Molenberghs, G., & Kenward, M. G. (2007). Missing data in clinical studies. Wiley.

23. Saria, S., & Subbaswamy, A. (2019). Tutorial: Safe and reliable machine learning. arXiv preprint arXiv:1904.07204.

24. Robins, J. M. (1986). A new approach to causal inference in mortality studies with a sustained exposure period—application to control of the healthy worker survivor effect. Mathematical Modelling, 7(9–12), 1393–1512.

25. Bang, H., & Robins, J. M. (2005). Doubly robust estimation in missing data and causal inference models. Biometrics, 61(4), 962–973.

26. U.S. Food and Drug Administration. (2021). Artificial intelligence/machine learning (AI/ML)-based software as a medical device (SaMD) action plan. FDA.

27. Dwork, C., Hardt, M., Pitassi, T., Reingold, O., & Zemel, R. (2012). Fairness through awareness. In Proceedings of the 3rd Innovations in Theoretical Computer Science Conference (pp. 214–226).

28. Li, T., Sahu, A. K., Talwalkar, A., & Smith, V. (2020). Federated learning: Challenges, methods, and future directions. IEEE Signal Processing Magazine, 37(3), 50–60.

29. Strubell, E., Ganesh, A., & McCallum, A. (2019). Energy and policy considerations for deep learning in NLP. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (pp. 3645–3650).

30. Hannart, A., Pearl, J., Otto, F. E. L., Naveau, P., & Ghil, M. (2016). Causal counterfactual theory for the attribution of weather and climate events. Journal of Climate, 29(8), 3001–3024.

Causal AI-AugETM: Integrating Structural Causal Models with Exposure–Toxicity Modeling for Robust Drug Safety Assessment Under Confounding Bias

Authors

Keywords:

Abstract

References

Downloads

Published

How to Cite

Issue

Section

License

Make a Submission

Journal Information

Current Issue

Information

Indexing & Infrastructure