Efficient interaction recognition in video for edge devices: a lightweight approach

Tóm tắt

Efficient and accurate recognition of human interactions is crucial for numerous service applications, including security surveillance and public safety. However, achieving real-time interaction recognition on resource-constrained edge devices poses significant computational challenges. In this paper, we propose a lightweight methodology for detecting human activity and interactions in video streams, specifically tailored for edge computing environments. Our approach utilizes distance estimation and interaction detection based on pose estimation techniques, enabling rapid analysis of video data while conserving computational resources. By leveraging a distance grid for proximity analysis and TensorFlow's MoveNet for pose estimation, our method achieves promising results in interaction recognition. We demonstrate the feasibility of our approach through empirical evaluation and discuss its potential implications for real-world deployment on edge devices.

Tài liệu tham khảo

Azimi, S., De Sio, C., & Sterpone, L. (2023). Enhanced Video Surveillance Systems for “Signal for Help” Detection on Edge Devices. 2023 IEEE International Symposium on Technology and Society (ISTAS), 1–4. https://doi.org/10.1109/ISTAS57930.2023.10305989

Deng, Y., Han, T., & Ansari, N. (2020). FedVision: Federated Video Analytics With Edge Computing. IEEE Open Journal of the Computer Society, 1, 62–72. https://doi.org/10.1109/OJCS.2020.2996184

Ezzat, M. A., Abd El Ghany, M. A., Almotairi, S., & Salem, M. A.-M. (2021). Horizontal Review on Video Surveillance for Smart Cities: Edge Devices, Applications, Datasets, and Future Trends. Sensors, 21(9), 3222. https://doi.org/10.3390/s21093222

Guo, Y., Zou, B., Ren, J., Liu, Q., Zhang, D., & Zhang, Y. (2019). Distributed and Efficient Object Detection via Interactions Among Devices, Edge, and Cloud. IEEE Transactions on Multimedia, 21(11), 2903–2915. https://doi.org/10.1109/TMM.2019.2912703

Huang, Y., Zhao, H., Qiao, X., Tang, J., & Liu, L. (2021). Towards Video Streaming Analysis and Sharing for Multi-Device Interaction with Lightweight DNNs. IEEE INFOCOM 2021 - IEEE Conference on Computer Communications, 1–10. https://doi.org/10.1109/INFOCOM42981.2021.9488846

Kim, J.-H., Kim, N., & Won, C. S. (2021). Deep Edge Computing for Videos. IEEE Access, 9, 123348–123357. https://doi.org/10.1109/ACCESS.2021.3109904

Nikouei, S. Y., Chen, Y., Aved, A. J., & Blasch, E. (2021). I-ViSE: Interactive Video Surveillance as an Edge Service Using Unsupervised Feature Queries. IEEE Internet of Things Journal, 8(21), 16181–16190. https://doi.org/10.1109/JIOT.2020.3016825

Patrikar, D. R., & Parate, M. R. (2022). Anomaly detection using edge computing in video surveillance system: Review. International Journal of Multimedia Information Retrieval, 11(2), 85–110. https://doi.org/10.1007/s13735-022-00227-8

Wang, F., Zhang, M., Wang, X., Ma, X., & Liu, J. (2020). Deep Learning for Edge Computing Applications: A State-of-the-Art Survey. IEEE Access, 8, 58322–58336. https://doi.org/10.1109/ACCESS.2020.2982411

Wang, Q., Fang, W., & Xiong, N. N. (2024). TLEE: Temporal-Wise and Layer-Wise Early Exiting Network for Efficient Video Recognition on Edge Devices. IEEE Internet of Things Journal, 11(2), 2842–2854. https://doi.org/10.1109/JIOT.2023.3293506

Wang, Y., Zhu, A., Ma, H., Ai, L., Song, W., & Zhang, S. (2023). 3D-ShuffleViT: An Efficient Video Action Recognition Network with Deep Integration of Self-Attention and Convolution. Mathematics, 11(18), 3848. https://doi.org/10.3390/math11183848.