Research on HSE Risk Assessment Method for Multi-source Heterogeneous Data Driven by Transformer-FL Framework

Wenlei Sun, Min Yang

Abstract


With the widespread application of multi-source heterogeneous data in HSE risk assessment, how to efficiently integrate different data sources and improve assessment accuracy has become an urgent problem to be solved. This paper proposes an HSE risk assessment method based on Transformer and federated learning, aiming to improve the accuracy of risk prediction through the effective integration of multi-source data. In this method, the Transformer model captures deep correlations in multi-source data through self-attention mechanism, and federated learning ensures cross-device collaborative training while protecting data privacy. Experimental results using a multi-source heterogeneous dataset from the chemical, manufacturing, and energy industries show that the Transformer-FL framework significantly improves risk assessment accuracy. The dataset includes real-time environmental data, accident records, and operation logs. Experiments on high-performance workstations with Nvidia RTX A6000 GPUs and Intel Xeon processors reported accuracy improvements: chemical industry (58.9% to 41.2%), manufacturing (35.6% to 23.4%), and energy industry (50.1% to 36.8%). The Transformer-FL framework has reduced the HSE risk value of traditional methods from 58.9% to 41.2%, indicating a lower risk, while the accuracy of risk assessment has improved by 17.7%. It is important to note that the percentages in this context refer to the risk value, where a lower value signifies reduced risk, and the accuracy improvement refers to the increase in correct predictions. It is important to note that the percentages in this context refer to the risk level; a lower percentage represents a lower residual risk, which indicates improvement. In contrast, accuracy improvements are calculated as the percentage increase in correct predictions. In the manufacturing industry, despite strong data homogeneity, the accuracy rate has increased from 35.6% to 23.4%, demonstrating the advantages of this framework in heterogeneous data environments. The experimental results show that the Transformer-FL framework has significant advantages in different HSE scenarios, especially when the amount of data is large, and the fusion effect far exceeds that of traditional risk assessment methods. Overall, this framework provides an intelligent, efficient and privacy-protected solution for HSE risk assessment, which can meet the dual needs of multi-source heterogeneous data processing and security in the industrial field.


Full Text:

PDF


DOI: https://doi.org/10.31449/inf.v49i28.9994

Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.