Performance Evaluation of the Convolutional Neural Networks for Object Identification Using RGB and Binary Images

Jasim Mohammed Dahr, Alaa Sahl Gaafar

Abstract


Convolutional Neural Network (CNN) is a topmost deep learning technique because of its capability to learn features autonomously with domain-specific images similar to the classical machine learning approaches. One common tactic for training CNN architecture is to transfer learned knowledge of a pre-trained network used to perform one task into a fresh task. The quest to detect objects more accurately remains an open research area. The use of traditional classification methods is widespread, but limited by time-ineffectiveness, and subjectiveness. The advent of CNN attempts to estimate and extract features inside images for the improved precision of image classification. This paper evaluates the two CNN models (CNN-1 and CNN-2) for object identification with two input image formats.  The input image-types including: binary, and RGB colour images collected from the MINST and CIFAR-10 databases respectively. The training and validation of the two CNN models were conducted on the Google Colaboratory Virtual Machine with sampled images. The results showed that, the object identification task for the RGB colour images outperformed binary images-based system in terms of speed (14.20ms to 29.00ms). Conversely, the object identification task with the binary images was superior to the RGB colour images for the loss function (3.20% to 88.30%), and accuracy (99.10% to 71.06%). These results were comparable to existing studies using the speed and accuracy. Thus, the paper has demonstrated the image data types significantly impact on speed, loss, and accuracy, which offer guides for evolving reliable applications like surveillance, medical diagnosis, agriculture, and transportation systems.


Full Text:

PDF

References


B. Mueller, T. Kinoshita, A. Peebles, M. A. Graber, and S. Lee, “Artificial intelligence and machine learning in emergency medicine: a narrative review,” Acute medicine & surgery, vol. 9, no. 1, p. e740, 2022.

M. Kumar et al., “Healthcare Internet of Things (H-IoT): Current trends, future prospects, applications, challenges, and security issues,” Electronics (Basel), vol. 12, no. 9, p. 2050, 2023.

N. Altwaijry and I. Al-Turaiki, “Arabic handwriting recognition system using convolutional neural network,” Neural Comput Appl, vol. 33, no. 7, pp. 2249–2261, 2021.

A. Abbas, M. M. Abdelsamea, and M. M. Gaber, “Classification of COVID-19 in chest X-ray images using DeTraC deep convolutional neural network,” Applied Intelligence, vol. 51, pp. 854–864, 2021.

A. Mohiyuddin, A. R. Javed, C. Chakraborty, M. Rizwan, M. Shabbir, and J. Nebhen, “Secure cloud storage for medical IoT data using adaptive neuro-fuzzy inference system,” International Journal of Fuzzy Systems, vol. 24, no. 2, pp. 1203–1215, 2022.

J. Lu, L. Tan, and H. Jiang, “Review on convolutional neural network (CNN) applied to plant leaf disease classification,” Agriculture, vol. 11, no. 8, p. 707, 2021.

Y. Amethiya, P. Pipariya, S. Patel, and M. Shah, “Comparative analysis of breast cancer detection using machine learning and biosensors,” Intelligent Medicine, vol. 2, no. 2, pp. 69–81, 2022.

Y. Li, M. Li, J. Qi, D. Zhou, Z. Zou, and K. Liu, “Detection of typical obstacles in orchards based on deep convolutional neural network,” Comput Electron Agric, vol. 181, p. 105932, 2021.

A. Díaz-Álvarez, M. Clavijo, F. Jiménez, and F. Serradilla, “Inferring the driver’s lane change intention through lidar-based environment analysis using convolutional neural networks,” Sensors, vol. 21, no. 2, p. 475, 2021.

A. Hazra, P. Choudhary, S. Inunganbi, and M. Adhikari, “Bangla-Meitei Mayek scripts handwritten character recognition using convolutional neural network,” Applied Intelligence, vol. 51, no. 4, pp. 2291–2311, 2021.

M. K. Benkaddour, S. Lahlali, and M. Trabelsi, “Human age and gender classification using convolutional neural network,” in 2020 2nd international workshop on human-centric smart environments for health and well-being (IHSH), IEEE, 2021, pp. 215–220.

S. S. Alqahtany, A. B. Alkhodre, A. Al Abdulwahid, and M. Alohaly, “A Dynamic Multi-Layer Steganography Approach Based on Arabic Letters’ Diacritics and Image Layers,” Applied Sciences, vol. 13, no. 12, p. 7294, 2023.

D. H. Hubel and T. N. Wiesel, “Receptive fields, binocular interaction and functional architecture in the cat’s visual cortex,” J Physiol, vol. 160, no. 1, p. 106, 1962.

K. Simonyan and A. Zisserman, “Very deep convolutional networks for large-scale image recognition,” arXiv preprint arXiv:1409.1556, 2014.

D. Yu, Q. Xu, H. Guo, C. Zhao, Y. Lin, and D. Li, “An efficient and lightweight convolutional neural network for remote sensing image scene classification,” Sensors, vol. 20, no. 7, p. 1999, 2020.

R. Pramanik and S. Bag, “Segmentation‐based recognition system for handwritten Bangla and Devanagari words using conventional classification and transfer learning,” IET Image Process, vol. 14, no. 5, pp. 959–972, 2020.

A. Chaudhary, A. Hazra, and P. Chaudhary, “Diagnosis of chest diseases in x-ray images using deep convolutional neural network,” in 2019 10th International Conference on Computing, Communication and Networking Technologies (ICCCNT), IEEE, 2019, pp. 1–6.

S. Malakar, S. Paul, S. Kundu, S. Bhowmik, R. Sarkar, and M. Nasipuri, “Handwritten word recognition using lottery ticket hypothesis based pruned CNN model: a new benchmark on CMATERdb2. 1.2,” Neural Comput Appl, vol. 32, pp. 15209–15220, 2020.

M. A. H. Akhand, M. Ahmed, M. M. H. Rahman, and M. M. Islam, “Convolutional neural network training incorporating rotation-based generated patterns and handwritten numeral recognition of major Indian scripts,” IETE J Res, vol. 64, no. 2, pp. 176–194, 2018.

R. Ghosh, C. Vamshi, and P. Kumar, “RNN based online handwritten word recognition in Devanagari and Bengali scripts using horizontal zoning,” Pattern Recognit, vol. 92, pp. 203–218, 2019.

M. Tan, R. Pang, and Q. V Le, “Efficientdet: Scalable and efficient object detection,” in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2020, pp. 10781–10790.

R. Yang and Y. Yu, “Artificial convolutional neural network in object detection and semantic segmentation for medical imaging analysis,” Front Oncol, vol. 11, p. 638182, 2021.

F. G. Venhuizen et al., “Deep learning approach for the detection and quantification of intraretinal cystoid fluid in multivendor optical coherence tomography,” Biomed Opt Express, vol. 9, no. 4, pp. 1545–1569, 2018.

R. Girshick, J. Donahue, T. Darrell, and J. Malik, “Rich feature hierarchies for accurate object detection and semantic segmentation,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2014, pp. 580–587.

J. Zhou, K. Feng, and L. Luo, “Research on fine-grained pattern recognition based on attention pattern-generated model,” in IOP Conference Series: Earth and Environmental Science, IOP Publishing, 2019, p. 022038.

Z. Akata, S. Reed, D. Walter, H. Lee, and B. Schiele, “Evaluation of output embeddings for fine-grained image classification,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2015, pp. 2927–2936.

J. S. A. V. Gokaraju, W. K. Song, M.-H. Ka, and S. Kaitwanidvilai, “Human and bird detection and classification based on Doppler radar spectrograms and vision images using convolutional neural networks,” Int J Adv Robot Syst, vol. 18, no. 3, p. 17298814211010568, 2021.

S. Paisitkriangkrai, J. Sherrah, P. Janney, and V.-D. Hengel, “Effective semantic pixel labelling with convolutional networks and conditional random fields,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2015, pp. 36–43.

D. Dais, I. E. Bal, E. Smyrou, and V. Sarhosis, “Automatic crack classification and segmentation on masonry surfaces using convolutional neural networks and transfer learning,” Autom Constr, vol. 125, p. 103606, 2021.

W. Yang, X. Zhang, and P. Luo, “Transferability of convolutional neural network models for identifying damaged buildings due to earthquake,” Remote Sens (Basel), vol. 13, no. 3, p. 504, 2021.

X.-B. Fu, S.-L. Yue, and D.-Y. Pan, “Camera-based basketball scoring detection using convolutional neural network,” International Journal of Automation and Computing, vol. 18, no. 2, pp. 266–276, 2021.

R. J. S. Raj, S. J. Shobana, I. V. Pustokhina, D. A. Pustokhin, D. Gupta, and K. Shankar, “Optimal feature selection-based medical image classification using deep learning model in internet of medical things,” IEEE Access, vol. 8, pp. 58006–58017, 2020.

R. Prasetya and A. Ridwan, “Data mining application on weather prediction using classification tree, naïve bayes and K-nearest neighbor algorithm with model testing of supervised learning probabilistic brier score, confusion matrix and ROC,” J. Appl. Commun. Inf. Technol, vol. 4, no. 2, pp. 25–33, 2019.




DOI: https://doi.org/10.31449/inf.v48i21.6568

Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.