Multi-strategy Optimization for Cross-modal Pedestrian Re-identification Based on Deep Q-Network Reinforcement Learning
Abstract
Cross-modal pedestrian re-identification (C-ReID) is a crucial task in computer vision, aiming to match pedestrian identities across different modalities of data. This paper proposes a reinforcement learningbased framework, RLCMPRF, to tackle the challenges of modality variability, data diversity, annotation difficulties, and optimal strategy selection. RLCMPRF uses deep Q-network (DQN) reinforcement learning to dynamically select the best feature extraction and matching strategies, ensuring robustness against these challenges. We introduce a dual-stream network to process multimodal images, followed by a feature fusion layer for integration. The DQN-based strategy learning is complemented by a reward function designed to optimize matching accuracy, speed, and robustness. Experimental results demonstrate that RLCMPRF outperforms state-of-the-art methods based on deep learning, attention mechanisms, meta-learning, and generative adversarial networks. RLCMPRF achieves a success rate of 82% and an average cumulative reward of 150, showing improvements in convergence speed and generalization ability across multiple datasets.
Full Text:
PDFReferences
References
. Bouzid A, Sierra-Sosa D, Elmaghraby A. A Robust pedestrian re-identification and out-of-distribution detection framework. Drones. 2023; 7(6).
. Liu SL, Zhang SM, Diao ZJ, Fang ZB, Jiao ZY, Zhong ZY. Pedestrian re-identification based on attention mechanism and Multi- scale feature fusion. Mathematical Biosciences and Engineering. 2023; 20(9):16913-38.
. Zheng Y, Zhou Y, Zhao JQ, Jian M, Yao R, Liu B, Chen Y. A siamese pedestrian alignment network for person re-identification. Multimedia Tools and Applications. 2021; 80(25):33951-70.
. Organisciak D, Sakkos D, Ho ESL, Aslam N, Shum HPH. Unifying person and vehicle re-identification. IEEE Access. 2020; 8:115673-84.
. Wang YY, Li X, Jiang MX, Zhang HY, Tang E. Cross-view pedestrian clustering via graph convolution network for unsupervised person re-identification. Journal of Intelligent & Fuzzy Systems. 2020; 39(3):4453-62.
. Cheng KY, Tao F, Zhan YZ, Li MZ, Li KL. Hierarchical attributes learning for pedestrian re-identification via parallel stochastic gradient descent combined with momentum correction and adaptive learning rate. Neural Computing & Applications. 2020; 32(10):5695-712.
. Zhao YB, Zhu SH. Occluded pedestrian re-identification via Res-ViT double-branch hybrid network. Multimedia Systems. 2024; 30(1).
. Xue C, Deng ZL, Wang S, Hu EW, Zhang Y, Yang WW, Wang YM. GLSFF: Global-local specific feature fusion for cross-modality pedestrian re-identification. Computer Communications. 2024; 215:157-68.
. Ke X, Lin XR, Qin LY. Lightweight convolutional neural network-based pedestrian detection and re-identification in multiple scenarios. Machine Vision and Applications. 2021; 32(2).
. Zhang XQ, Wang XX, Gu CH. Online multi-object tracking with pedestrian re-identification and occlusion processing. Visual Computer. 2021; 37(5):1089 -99.
. Yang XF, Zhou ZH, Wang QS, Wang ZW, Li X, Li HF. Cross-domain unsupervised pedestrian re-identification based on multi-view decomposition. Multimedia Tools and Applications. 2022; 81(27):39387-408.
. Wei D, Hu XQ, Wang ZY, Shen JL, Ren HJ. Pose-Guided multi-scale structural relationship learning for video-based pedestrian re-identification. IEEE Access. 2021; 9:34845-58.
. Chen H, Zhao Y, Wang SG. Person re-identification based on contour information embedding. Sensors. 2023; 23(2).
. Chen MZY, Banitaan S, Maleki M. Pedestrian group re-identification and trajectory prediction through zone-based clustering. IEEE Access. 2024; 12: 101549-62.
. Liu YJ, Shao WB, Sun XR. Learn robust pedestrian representation within minimal modality discrepancy for visible-infrared person re-identification. Journal of Computer Science and Technology. 2022; 37(3):641-51.
. Manzoor S, An YC, In GG, Zhang YY, Kim S, Kuc TY. SPT: Single pedestrian tracking framework with re-identification-based learning using the Siamese model. Sensors. 2023; 23(10).
. Han H, Zhou MC, Shang XW, Cao W, Abusorrah A. KISS plus for rapid and accurate pedestrian re-identification. IEEE Transactions on Intelligent Transportation Systems. 2021; 22(1):394-403.
. Hong F, Lu CH, Tao W, Jiang WW. OMNet: Object-Perception multi-branch network for pedestrian re-identification. Big Data Research. 2022; 27.
. Yun X, Ge M, Sun YJ, Dong KW, Hou XF. Margin CosReid network for pedestrian re-identification. Applied Sciences-Basel. 2021; 11(4).
. Grigorev A, Tian ZH, Rho S, Xiong JX, Liu SH, Jiang F. Deep person re-identification in UAV images. Eurasip Journal on Advances in Signal Processing. 2019; (1).
. Cheng KY, Xu FJ, Tao F, Qi M, Li MZ. Data-driven pedestrian re-identification based on hierarchical semantic representation. Concurrency and Computation-Practice & Experience. 2018; 30(23).
. Yu ZY, Li LS, Xie JL, Wang CS, Li WJ, Ning X. Pedestrian 3D Shape understanding for person re-identification via multi-view learning. IEEE Transactions on Circuits and Systems for Video Technology. 2024; 34(7):5589-602.
. Shan YX, Yu G, Gao YH. Pedestrian re-identification based on gait analysis. IEEE Access. 2023; 11:106013-23.
. Zhao BT, Wang YJ, Su KK, Ren H, Han XY. Semi-supervised pedestrian re-identification via a teacher-student model with similarity-preserving generative adversarial networks. Applied Intelligence. 2023; 53(2):1605-18.
. Lei MW, Song YC, Zhao JD, Wang X, Lyu J, Xu JD, Yan WQ. End-to-End network for pedestrian detection, tracking and re-identification in real-time surveillance system. Sensors. 2022; 22(22).
. Sun ZJ, Wang X, Zhang YL, Song YC, Zhao JD, Xu JD, et al. A comprehensive review of pedestrian re-identification based on deep learning. Complex & Intelligent Systems. 2024; 10(2):1733-68.
. Bouzid A, Sierra-Sosa D, Elmaghraby A. Directional statistics-based deep metric learning for pedestrian tracking and re-identification. Drones. 2022; 6(11).
. Geng SZ, Yu M, Liu Y, Yu Y, Bai J. Re-ranking pedestrian re-identification with multiple Metrics. Multimedia Tools and Applications. 2019; 78(9):11631- 53.
. Deng X, Liao KY, Zheng YL, Lin GF, Lei H. A deep multi-feature distance metric learning method for pedestrian re-identification. Multimedia Tools and Applications. 2021; 80(15):23113-31.
. Yang XF, Wang QS, Li WK, Zhou ZH, Li HF. Unsupervised domain adaptation pedestrian re-identification based on an improved dissimilarity space. Image and Vision Computing. 2022; 118.
DOI: https://doi.org/10.31449/inf.v49i11.7247

This work is licensed under a Creative Commons Attribution 3.0 License.