Accurate prediction of the endpoint mass fraction of carbon in the molten steel of RH (Ruhrstahl Heraeus) can effectively improve the quality of continuously cast products. In order to realize this goal, data mining was firstly applied to preprocess the RH industrial data. Then, grey correlation analysis, Spearman correlation coefficient, and random forest out-of-bag error scoring were used to select the features that had a strong correlation with the endpoint mass fraction of carbon in the molten steel. Next, the principal component analysis method was applied to reduce the dimensions. Finally, the XGBoost model, the XGBoost model optimized by the particle swarm optimization algorithm, and the XGBoost model optimized by the whale optimization algorithm were applied to predict the endpoint mass fraction of carbon in the molten steel. The results show that grey correlation analysis is better than Spearman rank correlation coefficient and random forest in analyzing the selected features. After the optimization of the particle swarm optimization algorithm and whale optimization algorithm, the XGBoost model has a greater prediction hit rate. The XGBoost model optimized by the whale optimization algorithm is better than that by the particle swarm optimization algorithm. In the case of the XGBoost model optimized by the whale optimization algorithm, the hit rate reaches 91.26% and 98.97% if the error range is within ±5×10-6, and ±7×10-6.
RaoJiang-ping, YangZhi-zheng, LiGuang-qiang, et al. Theoretical study and process optimization of RH refining for IF steel[J]. Steelmaking, 2022, 38(5): 59-66.
[3]
FengK, XuA J, WuP F, et al. Case-based reasoning model based on attribute weights optimized by genetic algorithm for predicting end temperature of molten steel in RH[J]. Journal of Iron and Steel Research International, 2019, 26(6): 585-592.
[4]
LuW, MaoZ Z, YuanP. Ladle furnace liquid steel temperature prediction model based on optimally pruned bagging[J]. Journal of Iron and Steel Research International, 2012, 19(12): 21-28.
[5]
ChenG J, YangJ, LiY G, et al. Mathematical simulation of decarburization with CO2 injection during RH refining of ultra-low-carbon steel[J]. Metals and Materials International, 2025, 31: 167-181.
[6]
LiY W, LiuB G, PengJ H, et al. Prediction model of microwave calcining of ammonium diuranate using incremental improved back-propagation neural network[J]. Acta Metallugica Sinica(English Letters), 2011, 24(1): 34-42.
[7]
WangS F, TangY, LiX B, et al. Analyses and predictions of rock cuttabilities under different confining stresses and rock properties based on rock indentation tests by conical pick[J]. Transactions of Nonferrous Metals Society of China, 2021, 31(6): 1766-1783.
[8]
LeeE H, KimK, KhoS Y, et al. Estimating express train preference of urban railway passengers based on extreme gradient boosting (XGBoost) using smart card data[J]. Transportation Research Record, 2021, 2675: 64-76.
LiuZhi-ming, ZhanDong-ping, GeQi-zhen, et al. Prediction model of mass fraction of endpoint carbon of electric furnace based on BP neural network[J]. Industrial Heating, 2018, 47(4): 28-31.
ChenChao, NongWei-min, WangNan. Prediction on end-point carbon content and temperature of Consteel electric arc furnace based on machine learning model[J]. Metallurgical Industry Automation, 2023, 47(6): 37-44.
[18]
SunY, BrownM B, PrapopoulouM, et al. The application of stochastic machine learning methods in the prediction of skin penetration[J]. Applied Soft Computing, 2011, 11(2): 2367-2375.
[19]
AchourA, KammounM A, HajejZ. Towards optimizing multi-level selective maintenance via machine learning predictive models[J]. Applied Sciences, 2024, 14(1): 313-318.
[20]
QuZ, GentonM G. Sparse functional boxplots for multivariate curves[J]. Journal of Computational and Graphical Statistics, 2022, 31(4): 976-989.
[21]
ErO, KülekciM K, EsmeU, et al. Multi response optimization of friction stir spot welding process using Taguchi based grey relational analysis[J]. Cukurova University Journal of the Faculty of Engineering, 2021, 36(2): 421-432.
[22]
AydinH, BayramA, EsmeU, et al. Application of grey relation analysis (GRA) and Taguchi method for the parametric optimization of friction stir welding (FSW) process[J]. Materials and Technology, 2010, 44(4): 205-211.
ChaiBao-tang, LeiHong, XuMeng, et al. Predicted temperature of molten steel at the end of RH refining on the base of BP neural network[J]. Steelmaking, 2023, 39(5): 33-40,47.
[25]
GaïffasS, MeradI, YuY Y. WildWood: a new random forest algorithm[J]. IEEE Transactions on Information Theory, 2023, 69(10): 6586-6604.
[26]
TarchouneI, DjebbaA, MerouaniH F, et al. An improved random forest based on feature selection and feature weighting for case retrieval in CBR system application to medical data[J]. International Journal of Software Innovation, 2022, 10(1): 14-16.
[27]
GewersF L, FerreiraG R, ArrudaH F, et al. Principal component analysis: a natural approach to data exploration[J]. ACM Computing Surveys, 2021, 54(4): 1-34.
[28]
IzoninI, TkachenkoR, ShakhovskaN, et al. A two-step data normalization approach for improving classification accuracy in the medical diagnosis domain[J]. Mathematics, 2022, 10(11): 1942-1947.
[29]
LiZ H, QinL, GuoB S, et al. Characterization of the convoluted 3D intermetallic phases in a recycled Al alloy by synchrotron X-ray tomography and machine learning[J]. Acta Metallugica Sinica(English Letters), 2022, 35(1): 115-123.
[30]
ZhengQ, FengB W, LiuZ Y, et al. Application of improved particle swarm optimisation algorithm in hull form optimisation[J]. Journal of Marine Science and Engineering, 2021, 9(9): 955-962.