Job Resumes Recommendation using Integration of Fuzzy Discernibility Matrix Feature Selection and Convolutional Neural Network Multi-label Text Classification

Regiolina Hayami, Nooraini Yusoff, Kauthar Mohd Daud, Yulia Fatma

Abstract


The manual analysis of job resumes poses specific challenges, including the time-intensive process and the high likelihood of human error, emphasizing the need for automation in content-based recommendations. Recent advancements in deep learning, particularly Convolutional Neural Networks (CNN) for Multi-label Text Classification (MLTC), offer a promising solution for addressing these challenges through artificial intelligence. While CNN is renowned for its robust feature extraction capabilities, it faces specific challenges such as managing high-dimensional data and poor data interpretability. To address these limitations, this study employs the Fuzzy Discernibility Matrix (FDM) feature selection technique to determine the relevance of skills for each job vacancy. FDM assigns weights to the features of each job category and ranks their relevance based on the highest scores, which are then utilized in the MLTC CNN model. The integration of FDM and MLTC CNN serves as the foundation for generating content-based recommendations derived from key features in job resumes. This study produces a content-based job recommendation system that displays the top three job categories, accompanied by explanations of the skills supporting each selected category. These recommendations also consider cosine similarity values between analyzed items and the integrated results of FDM and MLTC CNN. The application of FDM for feature weighting has been proven to enhance multi-label text classification outcomes by providing better insights during the feature selection process. With a recall of 97.26%, precision of 94.81%, and accuracy of 98.58%, the MLTC model integrating FDM feature selection and CNN demonstrates robust performance characteristics in content-based job recommendation tasks.

Full Text:

PDF

References


M. N. Freire and L. N. de Castro, “e-Recruitment recommender systems: a systematic review,” Knowl Inf Syst, vol. 63, no. 1, pp. 1–20, 2021, doi: 10.1007/s10115-020-01522-8.

Y. Afoudi, M. Lazaar, and M. Al Achhab, “recommendation system,” 2019 International Conference on Wireless Technologies, Embedded and Intelligent Systems (WITS), pp. 1–6, 2019.

C. Gan, Q. Feng, and Z. Zhang, “Scalable multi-channel dilated CNN–BiLSTM model with attention mechanism for Chinese textual sentiment analysis,” Future Generation Computer Systems, vol. 118, pp. 297–309, 2021, doi: 10.1016/j.future.2021.01.024.

K. Florentin and F. Jiechieu, “Skills prediction based on multi-label resume classification using CNN with model predictions explanation,” Neural Comput Appl, vol. 0123456789, 2020, doi: 10.1007/s00521-020-05302-x.

P. Zhang, W. Gao, J. Hu, and Y. Li, “Multi-label feature selection based on the division of label topics,” Inf Sci (N Y), vol. 553, pp. 129–153, 2021, doi: 10.1016/j.ins.2020.12.036.

M. Naumov et al., “Deep learning recommendation model for personalization and recommendation systems,” arXiv preprint arXiv:1906.00091, 2019, doi: https://doi.org/10.48550/arXiv.1906.00091.

Z. Batmaz, A. Yurekli, A. Bilge, and C. Kaleli, “A review on deep learning for recommender systems : challenges and remedies,” Artif Intell Rev, 2018, doi: 10.1007/s10462-018-9654-y.

Y. Kim, “Convolutional Neural Networks for Sentence Classification,” in Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014, pp. 1746–1751. doi: 10.3115/v1/D14-1181.

W. Liao, Y. Wang, Y. Yin, X. Zhang, and P. Ma, “Improved sequence generation model for multi-label classification via CNN and initialized fully connection,” Neurocomputing, vol. 382, pp. 188–195, 2020, doi: 10.1016/j.neucom.2019.11.074.

B. Guo, C. Zhang, J. Liu, and X. Ma, “Improving text classification with weighted word embeddings via a multi-channel TextCNN model,” Neurocomputing, vol. 363, no. xxxx, pp. 366–374, 2019, doi: 10.1016/j.neucom.2019.07.052.

Z. Shen and S. Zhang, “A Novel Deep-Learning-Based Model for Medical Text Classification,” in Proceedings of the 2020 9th International Conference on Computing and Pattern Recognition, 2020, pp. 267–273. doi: https://doi.org/10.1145/3436369.3436469.

M. Pota, M. Esposito, G. De Pietro, and H. Fujita, “Best practices of convolutional neural networks for question classification,” Applied Sciences (Switzerland), vol. 10, no. 14, 2020, doi: 10.3390/app10144710.

B. Liu, Y. Zhou, and W. Sun, “Character-level text classification via convolutional neural network and gated recurrent unit,” International Journal of Machine Learning and Cybernetics, vol. 11, no. 8, pp. 1939–1949, 2020, doi: 10.1007/s13042-020-01084-9.

B. Guo, C. Zhang, J. Liu, and X. Ma, “Improving text classification with weighted word embeddings via a multi-channel TextCNN model,” Neurocomputing, vol. 363, pp. 366–374, Oct. 2019, doi: 10.1016/j.neucom.2019.07.052.

J. Xu et al., “Incorporating context-relevant concepts into convolutional neural networks for short text classification,” Neurocomputing, vol. 386, pp. 42–53, Apr. 2020, doi: 10.1016/j.neucom.2019.08.080.

L. Lenc and P. Král, “Deep neural networks for Czech multi-label document classification,” in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Springer Verlag, 2018, pp. 460–471. doi: 10.1007/978-3-319-75487-1_36.

A. Omar, T. M. Mahmoud, T. Abd-El-Hafeez, and A. Mahfouz, “Multi-label Arabic text classification in Online Social Networks,” Inf Syst, vol. 100, Sep. 2021, doi: 10.1016/j.is.2021.101785.

M. Zanker, L. Rook, and D. Jannach, “Measuring the impact of online personalisation: Past, present and future,” International Journal of Human Computer Studies, vol. 131, no. January, pp. 160–168, 2019, doi: 10.1016/j.ijhcs.2019.06.006.

P. Lops, D. Jannach, C. Musto, T. Bogers, and M. Koolen, “Trends in content-based recommendation Preface to the special issue on Recommender systems based on rich item descriptions,” User Model User-adapt Interact, vol. 29, no. 2, pp. 239–249, 2019, doi: 10.1007/s11257-019-09231-w.

P. Lops, D. Jannach, C. Musto, T. Bogers, and M. Koolen, “Trends in content-based recommendation Preface to the special issue on Recommender systems based on rich item descriptions,” User Model User-adapt Interact, vol. 29, no. 2, pp. 239–249, 2019, doi: 10.1007/s11257-019-09231-w.

Z. Weidong, W. Ran, and L. Haitao, “a researcher ’ s background knowledge and research target,” Inf Process Manag, vol. 0, pp. 1–13, 2016, doi: 10.1016/j.ipm.2016.04.004.

D. Wang, Y. Liang, D. Xu, X. Feng, and R. Guan, “Knowledge-Based Systems A content-based recommender system for computer science publications,” Knowl Based Syst, vol. 157, no. May, pp. 1–9, 2018, doi: 10.1016/j.knosys.2018.05.001.

G. van Capelleveen, C. Amrit, D. M. Yazan, and H. Zijm, “The recommender canvas: A model for developing and documenting recommender system design,” Expert Syst Appl, vol. 129, pp. 97–117, 2019, doi: 10.1016/j.eswa.2019.04.001.

Y. Afoudi, M. Lazaar, and M. Al Achhab, “recommendation system,” 2019 International Conference on Wireless Technologies, Embedded and Intelligent Systems (WITS), pp. 1–6, 2019.

H. Setiadi, R. Saptono, R. Anggrainingsih, and R. Andriani, “Recommendation Feature of Scientific Articles on Open Journal System Using Content-based Filtering,” 2019 6th IEEE International Conference on Engineering Technologies and Applied Science, no. 516, 2019.

Y. Kim, “Convolutional Neural Networks for Sentence Classification,” 2014.

Y. Song, Q. V. Hu, and L. He, “P-CNN: Enhancing text matching with positional convolutional neural network,” Knowl Based Syst, vol. 169, pp. 67–79, 2019, doi: 10.1016/j.knosys.2019.01.028.

A. Jacovi, O. S. Shalom, and Y. Goldberg, “Understanding convolutional neural networks for text classification,” arXiv preprint arXiv:1809.08037, 2018.

R. P. Kawiswara and F. Thalib, “Implementasi Algoritma Convolutional Neural Network Pada Algoritma K-Means Untuk Kategorisasi Data Teks,” J Teknol, vol. 7, no. 2, pp. 149–160, 2020.

R. Chatterjee, D. K. Sanyal, and A. Chatterjee, “Fuzzy-Discernibility Matrix-based an Efficient Feature Selection Technique for Improved Motor-Imagery EEG Signal Classification,” bioRxiv, 2021.

A. Tan, J. Liang, W. Z. Wu, J. Zhang, L. Sun, and C. Chen, “Fuzzy rough discrimination and label weighting for multi-label feature selection,” Neurocomputing, vol. 465, pp. 128–140, 2021, doi: 10.1016/j.neucom.2021.09.007.

T. J. Sejnowski, Neural Information Processing Systems. 2019. doi: 10.7551/mitpress/11474.003.0014.

X. Chen, D. Li, P. Wang, and X. Yang, “A Deep Convolutional Neural Network with Fuzzy Rough Sets for FER,” IEEE Access, vol. 8, pp. 2772–2779, 2020, doi: 10.1109/ACCESS.2019.2960769.

W. Qian, C. Xiong, and Y. Wang, “A ranking-based feature selection for multi-label classification with fuzzy relative discernibility,” Applied Soft Computing Journal, vol. 102, p. 106995, 2021, doi: 10.1016/j.asoc.2020.106995.

J. M. Moyano, E. L. Gibaja, K. J. Cios, and S. Ventura, “Review of ensembles of multi-label classifiers: Models, experimental study and prospects,” Information Fusion, vol. 44, pp. 33–45, 2018, doi: 10.1016/j.inffus.2017.12.001.




DOI: https://doi.org/10.31449/inf.v49i13.6848

Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.