Graph-Based Human Activity Reasoning from Multi-Person Motion Trajectories

Elliot Wood; Walid Karlsson

Authors

Elliot Wood Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, USA.
Walid Karlsson Department of Computer Science and Engineering, University at Buffalo, Buffalo, NY, USA.

Keywords:

graph neural networks, multi-agent trajectory prediction, human activity recognition, socio-technical systems, relational reasoning, fairness in AI, smart infrastructure

Abstract

Understanding collective human activity from multi-person motion trajectories presents a fundamental challenge at the intersection of computer vision, graph theory, and socio-technical systems engineering. This paper introduces a graph-based reasoning framework designed to infer high-level social and functional activities from raw trajectory data captured across spatially distributed environments. Unlike conventional activity recognition approaches that rely on single-agent classifiers or frame-level appearance features, the proposed framework models each individual as a node within a dynamically evolving graph, with edges encoding relational attributes such as proximity, velocity correlation, interaction duration, and role asymmetry. We argue that such graph representations are uniquely suited to capture the structural and temporal dependencies inherent in multi-agent scenarios, including crowd movement, collaborative tasks, and adversarial behaviors. The paper systematically examines the architectural trade-offs between static and time-varying graph models, the integration of trajectory encoding with relational inference mechanisms, and the computational scalability required for real-time deployment in urban surveillance, smart infrastructure, and autonomous coordination systems. We further explore governance and fairness implications, particularly concerning bias propagation through learned relational priors, privacy risks associated with trajectory reconstruction, and the need for transparent audit mechanisms in high-stakes environments. Through a cross-domain analysis spanning sports analytics, pedestrian modeling, and industrial warehouse coordination, we demonstrate that graph-based reasoning offers a robust, interpretable, and policy-aware alternative to end-to-end black-box models. The paper concludes with a forward-looking discussion on sustainable deployment architectures, federated learning over distributed sensor networks, and the role of regulatory frameworks in shaping the responsible adoption of trajectory-based activity inference.

References

1. Alahi, A., Goel, K., Ramanathan, V., Robicquet, A., Fei-Fei, L., & Savarese, S. (2016). Social LSTM: Human trajectory prediction in crowded spaces. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 961–971.

2. Gupta, A., Johnson, J., Fei-Fei, L., Savarese, S., & Alahi, A. (2018). Social GAN: Socially acceptable trajectories with generative adversarial networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2255–2264.

3. Kipf, T. N., & Welling, M. (2017). Semi-supervised classification with graph convolutional networks. International Conference on Learning Representations.

4. Veličković, P., Cucurull, G., Casanova, A., Romero, A., Liò, P., & Bengio, Y. (2018). Graph attention networks. International Conference on Learning Representations.

5. Sankar, A., Wu, Y., Gou, L., Zhang, W., & Yang, H. (2020). DySAT: Deep neural representation learning on dynamic graphs via self-attention networks. Proceedings of the 13th International Conference on Web Search and Data Mining, 519–527.

6. Xu, D., Ruan, C., Körpeoglu, E., Kumar, S., & Achan, K. (2020). Self-attention with relative position representations for human activity recognition. Advances in Neural Information Processing Systems, 33, 1511–1522.

7. Li, Y., Yu, R., Shahabi, C., & Liu, Y. (2018). Diffusion convolutional recurrent neural network: Data-driven traffic forecasting. International Conference on Learning Representations.

8. Zhou, B., Tang, X., & Wang, X. (2015). Learning collective crowd behaviors with dynamic pedestrian-agents. International Journal of Computer Vision, 111(1), 50–68.

9. Hamilton, W. L., Ying, R., & Leskovec, J. (2017). Inductive representation learning on large graphs. Advances in Neural Information Processing Systems, 30.

10. Chen, J., Ma, T., & Xiao, C. (2018). FastGCN: Fast learning with graph convolutional networks via importance sampling. International Conference on Learning Representations.

11. Choi, S., Kim, J., & Choo, J. (2020). Adaptive temporal sampling for efficient video understanding. Proceedings of the European Conference on Computer Vision, 123–139.

12. Satyanarayanan, M. (2017). The emergence of edge computing. Computer, 50(1), 30–39.

13. Wang, S., Zhang, X., Zhang, Y., Wang, L., Yang, J., & Wang, W. (2020). A survey on mobile edge networks: Convergence of computing, caching and communications. IEEE Access, 8, 1480–1500.

14. Buolamwini, J., & Gebru, T. (2018). Gender shades: Intersectional accuracy disparities in commercial gender classification. Proceedings of the 1st Conference on Fairness, Accountability and Transparency, 77–91.

15. Mehrabi, N., Morstatter, F., Saxena, N., Lerman, K., & Galstyan, A. (2021). A survey on bias and fairness in machine learning. ACM Computing Surveys, 54(6), 1–35.

16. Doshi-Velez, F., & Kim, B. (2017). Towards a rigorous science of interpretable machine learning. arXiv preprint arXiv:1702.08608.

17. Jin, H., Yi, H., Zhao, W., Luo, J., Ye, S., Guan, Z., ... & Yu, T. (2026). HY-Himmel Technical Report: Hierarchical Interleaved Multi-stream Motion Encoding for Long Video Understanding. arXiv preprint arXiv:2605.08158.

18. Voigt, P., & Von dem Bussche, A. (2017). The EU General Data Protection Regulation (GDPR): A practical guide. Springer International Publishing.

19. Le, H. M., Carr, P., Yue, Y., & Lucey, P. (2017). Data-driven ghosting using deep imitation learning. Proceedings of the ACM SIGGRAPH Conference on Motion in Games, 1–6.

20. Felsen, P., Lucey, P., & Savarese, S. (2018). Learning social etiquette in human-robot interaction from human-human trajectories. Proceedings of the IEEE International Conference on Robotics and Automation, 750–757.

21. Helbing, D., & Molnar, P. (1995). Social force model for pedestrian dynamics. Physical Review E, 51(5), 4282–4286.

22. Moussaïd, M., Helbing, D., & Theraulaz, G. (2011). How simple rules determine pedestrian behavior and crowd disasters. Proceedings of the National Academy of Sciences, 108(17), 6884–6888.

23. Luo, Y., Cai, P., Bera, A., Hsu, D., & Manocha, D. (2018). PORCA: Modeling and prediction of human motion in crowded environments with occlusion handling. ACM Transactions on Graphics, 37(4), 1–12.

24. Kumar, A., Gupta, S., & Malik, J. (2020). Learning navigation behaviors with graph neural networks. Proceedings of the Conference on Robot Learning, 110–120.

25. Wu, Z., Pan, S., Chen, F., Long, G., Zhang, C., & Yu, P. S. (2021). A comprehensive survey on graph neural networks. IEEE Transactions on Neural Networks and Learning Systems, 32(1), 4–24.

Graph-Based Human Activity Reasoning from Multi-Person Motion Trajectories

Authors

Keywords:

Abstract

References

Downloads

Published

How to Cite

Issue

Section

License

Make a Submission

Journal Information

Current Issue

Information

Indexing & Infrastructure