A Multi-Scale Deformable Convolutional Neural Network with Adaptive Adjustment for Robust Packaging Image Recognition

Wen Sun

Abstract


This paper presents an innovative image recognition model for package inspection, designed to fulfill the demands of real-time and precise categorization in difficult industrial environments. Traditional techniques reliant on human feature extraction frequently underperform when confronted with lighting variability, background interference, and deformation of package items. To mitigate these limitations, the proposed model integrates a multi-scale convolutional architecture that captures both local and global characteristics through the use of parallel convolutional filters of varying sizes. An adaptive adjustment method is incorporated into the network to dynamically alter the placement of convolutional operations according to image content, hence improving flexibility and feature representation. A thorough data augmentation strategy incorporating geometric transformation, brightness modification, and semantic-level blending is implemented to boost the model's robustness and generalization capacity. Experiments performed on a bespoke industrial packaging dataset comprising 10,000 labeled images reveal that the proposed model attains a classification accuracy of 96.8 percent, a recall of 95.3 percent, and an F1-score of 93.8 percent, with an inference time of 11.2 milliseconds and a parameter count of 21.3 million. In comparison to current deep learning architectures like Residual Networks, the model demonstrates considerable enhancements in accuracy and speed. These results support its appropriateness for practical packaging inspection systems.


Full Text:

PDF

References


V. Pagire, M. Chavali, A. Kale, A comprehensive review of object detection with traditional and deep learning methods, Signal Processing 237 (2025) 110075. https://doi.org/10.1016/j.sigpro.2025.110075.

S. Chen, D. Liu, Y. Pu, Y. Zhong, Advances in deep learning-based image recognition of product packaging, Image Vis Comput 128 (2022) 104571. https://doi.org/10.1016/j.imavis.2022.104571.

S. John, A. Danti, Lightweight Model for Occlusion Removal from Face Images, Annals of Emerging Technologies in Computing 8 (2024) 1–14. https://doi.org/10.33166/AETiC.2024.02.001.

S. Aisyah, F.S. Nainggolan, M. Simanjuntak, E.A. Lubis, Food Packaging Search Application From Text Image In Android With Deep Convolutional Neural Network (DCNN) Method, J Phys Conf Ser 1230 (2019) 012078. https://doi.org/10.1088/1742-6596/1230/1/012078.

L.D. Medus, M. Saban, J. V. Francés-Víllora, M. Bataller-Mompeán, A. Rosado-Muñoz, Hyperspectral image classification using CNN: Application to industrial food packaging, Food Control 125 (2021) 107962. https://doi.org/10.1016/j.foodcont.2021.107962.

H. Fırat, M.E. Asker, M.İ. Bayindir, D. Hanbay, Spatial-spectral classification of hyperspectral remote sensing images using 3D CNN based LeNet-5 architecture, Infrared Phys Technol 127 (2022) 104470. https://doi.org/10.1016/j.infrared.2022.104470.

O. Jarkas, J. Hall, S. Smith, R. Mahmud, P. Khojasteh, J. Scarsbrook, R.K.L. Ko, ResNet and Yolov5-enabled non-invasive meat identification for high-accuracy box label verification, Eng Appl Artif Intell 125 (2023) 106679. https://doi.org/10.1016/j.engappai.2023.106679.

S. Zheng, T. Zhou, Z. Li, Image Segmentation with Multi-Scale Feature Fusion of Local Binary Patterns for Flexible Integrated Circuit Packaging Substrates, in: 2023 China Automation Congress (CAC), IEEE, 2023: pp. 5704–5708. https://doi.org/10.1109/CAC59555.2023.10451417.

X. Yang, M. Han, H. Tang, Q. Li, X. Luo, Detecting Defects With Support Vector Machine in Logistics Packaging Boxes for Edge Computing, IEEE Access 8 (2020) 64002–64010. https://doi.org/10.1109/ACCESS.2020.2984539.

H. Zoubir, M. Rguig, M. El Aroussi, A. Chehri, R. Saadane, Concrete Bridge Crack Image Classification Using Histograms of Oriented Gradients, Uniform Local Binary Patterns, and Kernel Principal Component Analysis, Electronics (Basel) 11 (2022) 3357. https://doi.org/10.3390/electronics11203357.

M.A. Chandra, S.S. Bedi, Survey on SVM and their application in image classification, International Journal of Information Technology 13 (2021) 1–11. https://doi.org/10.1007/s41870-017-0080-1.

A. Rahman, L. He, H. Wang, Activation function optimization scheme for image classification, Knowl Based Syst 305 (2024) 112502. https://doi.org/10.1016/j.knosys.2024.112502.

R. Schäfer, T. Nicke, H. Höfener, A. Lange, D. Merhof, F. Feuerhake, V. Schulz, J. Lotz, F. Kiessling, Overcoming data scarcity in biomedical imaging with a foundational multi-task model, Nat Comput Sci 4 (2024) 495–509. https://doi.org/10.1038/s43588-024-00662-z.

A.R. Munappy, J. Bosch, H.H. Olsson, A. Arpteg, B. Brinne, Data management for production quality deep learning models: Challenges and solutions, Journal of Systems and Software 191 (2022) 111359. https://doi.org/10.1016/j.jss.2022.111359.

C. Zhang, M. Han, J. Jia, C. Kim, Packaging Design Image Segmentation Based on Improved Full Convolutional Networks, Applied Sciences 14 (2024) 10742. https://doi.org/10.3390/app142210742.

K. He, X. Zhang, S. Ren, J. Sun, Deep Residual Learning for Image Recognition, in: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016: pp. 770–778. https://doi.org/10.1109/CVPR.2016.90.




DOI: https://doi.org/10.31449/inf.v49i9.9547

Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.