A Hybrid Investment Risk Prediction Framework Integrating ADASYN-RF, CS-SVM, PCA-BP, and ARIMA Models
Abstract
Investment risk is often the result of a long-term interplay among multiple factors, such as market volatility and business performance. Predicting investment risk in advance helps investors avoid losses and protect their assets. However, current prediction methods rely heavily on historical data, leading to poor timeliness and unreliable results. Therefore, this study proposes an investment risk prediction model based on the adaptive comprehensive sampling algorithm and the random forest algorithm to predict investment risks accurately. This model makes full use of the adaptive comprehensive sampling algorithm to balance the categories of risk data, and the random forest captures the risk characteristics to achieve investment risk prediction. CS-SVM is introduced to improve the prediction model and prevent overfitting. Meanwhile, principal component analysis and backpropagation networks are combined to solve the problems of unstructured data processing and time series dependency modeling. In the experiment, the study took historical investment data as the dataset and compared it with the investment risk prediction models constructed by three algorithms: CatBoost-GJO, TS-GA, and GAN-Stacking. Evaluate the prediction accuracy of each method in investment risk prediction, including the Sharpe ratio, value at risk, volatility, win rate and profit and loss. The results show that the prediction accuracy of this model reaches 99.0%, the Sharpe ratio is 1.9, and the maximum drawdown is 5.9%, all superior to the comparison models. Moreover, in the actual investment risk prediction, its volatility and value at risk are only 8.2% and 7.3% respectively, while the winning rate and profit and loss ratio can reach 88.9%, 6:1. These results indicate that the proposed model achieves high accuracy in investment risk prediction, effectively addressing the limitations of existing methods and providing a new approach to improve prediction performance. It contributes to the development of intelligent and efficient investment risk forecasting.
Full Text:
PDFReferences
Qin G, Juan M, Rui M H. Iot-based intelligent power supply management using ensemble learning for seismic observation stations. Informatica, 2025, 49(8). 1-18. DOI: 10.31449/inf.v49i8.6502
Mahmood S A, Hamadi S S. Ensemble machine learning algorithms for predicting thyroid disorders in diabetic patients: a comparative analysis. Informatica, 2025, 49(24). 7-25. DOI: 10.31449/inf.v49i24.8373
Ileberi E, Sun Y. Advancing model performance With ADASYN and recurrent feature elimination and cross-validation in machine learning-assisted credit card fraud detection: A comparative analysis. IEEE access, 2024. 12(1): 133315-133327. DOI: 10.1109/ACCESS.2024.3457922
Aish M A. Predictive modeling of cerebral strokes: an ADASYN-RF approach for imbalanced data. VFAST Transactions on Software Engineering, 2024, 12(4): 12-26. DOI: 10.21015/vtse.v12i4.1932
Zhang M. Enhanced load forecasting for electric vehicle charging stations using a hybrid random forest-convolutional neural network algorithm. Informatica, 2024, 48(19): 89-102. DOI: 10.31449/inf.v48i19.6649
Saiyed A. AI-Driven Innovations in Fintech: Applications, challenges, and future trends. International Journal of Electrical and Computer Engineering Research, 2025, 5(1): 8-15. DOI: 10.53375/ijecer.2025.437
Song M, Ma H, Zhu Y, Zhang M. Credit risk prediction based on improved ADASYN sampling and optimized LightGBM. Journal of Social Computing, 2024, 5(3): 232-241. DOI: 10.23919/JSC.2024.0019.
Salehpour A, Norouzi M, Balafar M A, SamadZamini K. A cloud‐based hybrid intrusion detection framework using XGBoost and ADASYN‐Augmented random forest for IoMT. IET Communications, 2024, 18(19): 1371-1390. DOI: 10.1049/cmu2.12833
Dawei Y, Bing Z, Bingbing G, Xibo G, Razzaghzadeh B. Predicting the CPT-based pile set-up parameters using HHO-RF and PSO-RF hybrid models. Structural Engineering and Mechanics, 2023, 86(5): 673-686. DOI: 10.12989/sem.2023.86.5.673
Ding W, Sun H. Prediction of PM2. 5 concentration based on the weighted RF-LSTM model. Earth Science Informatics, 2023, 16(4): 3023-3037. DOI: 10.1007/s12145-023-01111-7
Jiao Z. Dynamic Financial Distress Prediction Using Combined LASSO and GBDT Algorithms. Informatica, 2024, 48(17): 139-152. DOI: 10.31449/inf.v48il7.6493
Belhadi A, Kamble S S, Mani V, Benkhati I, Touriki F E. An ensemble machine learning approach for forecasting credit risk of agricultural SMEs' investments in agriculture 4.0 through supply chain finance. Annals of Operations Research, 2025, 345(2): 779-807. DOI: 10.1007/s10479-021-04366-9
Li X, Wang J, Yang C. Risk prediction in financial management of listed companies based on optimized BP neural network under digital economy. Neural Computing and Applications, 2023, 35(3): 2045-2058. DOI: 10.1007/s00521-022-07377-0
Doğru A, Buyrukoğlu S, Arı M. A hybrid super ensemble learning model for the early-stage prediction of diabetes risk. Medical & Biological Engineering & Computing, 2023, 61(3): 785-797. DOI: 10.1007/s11517-022-02749-z
Zheng G, Kong L, Brintrup A. Federated machine learning for privacy preserving, collective supply chain risk prediction. International Journal of Production Research, 2023, 61(23): 8115-8132. DOI: 10.1080/00207543.2022.2164628
Rhodes J S, Cutler A, Moon K R. Geometry-and accuracy-preserving random forest proximities. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023, 45(9): 10947-10959. DOI: 10.1109/TPAMI.2023.3263774
Kurani A, Doshi P, Vakharia A, Shah M. A comprehensive comparative study of artificial neural network (ANN) and support vector machines (SVM) on stock forecasting. Annals of Data Science, 2023, 10(1): 183-208. DOI: 10.1007/s40745-021-00344-x
Cheng J, Xiong Y. Multi-strategy adaptive cuckoo search algorithm for numerical optimization. Artificial Intelligence Review, 2023, 56(3): 2031-2055. DOI: 10.1007/s10462-022-10222-4
Fei R, Guo Y, Li J, Hu B, Yang L. An improved BPNN method based on probability density for indoor location. IEICE TRANSACTIONS on Information and Systems, 2023, 106(5): 773-785. DOI: 10.1587/transinf.2022DLP0073
Marukatat S. Tutorial on PCA and approximate PCA and approximate kernel PCA. Artificial Intelligence Review, 2023, 56(6): 5445-5477. DOI: 10.1007/s10462-022-10297-z
DOI: https://doi.org/10.31449/inf.v49i21.10550
This work is licensed under a Creative Commons Attribution 3.0 License.








