Laser & Optoelectronics Progress, Volume. 59, Issue 12, 1215011(2022)

Improved XGBoost Stray Current Prediction and Interpretable Model

Zhaoliang Meng1,2,3, Zetao Zhang1、*, Yuan Yang2, Guofeng Li3, Chongbo Tao3, and Yijiang Niu3
Author Affiliations
  • 1College of Electronic Information, Xi’an Polytechnic University, Xi’an 710600, Shaanxi , China
  • 2International Engineering College, Xi’an University of Technology, Xi’an 710048, Shaanxi , China
  • 3Power Electronics Division of CRRC Yongji Motor Co., Ltd., Xi’an 710000, Shaanxi , China
  • show less

    To address the issue that there are many characteristics influencing the stray current of a subway track, the conventional feature selection method affects the prediction accuracy of the model, and the interpretability of the model results is poor, this paper proposes a stray current prediction model based on optimal feature improved eXtreme Gradient Boosting (XGBoost). Using the flexibility and the strong searchability of the genetic algorithm, we found the first M features that minimizing the mean square error (MSE) of the objective function generation by generation in the set containing the original V features. Simultaneously, the stray current prediction model under the optimal feature selection method (OFS-XGBoost) is established. To address the issue that the prediction results of the OFS-XGBoost are good, however, the machine learning black-box model has an insufficient explanatory ability for the prediction results, an attribution analysis framework based on SHAP theory is proposed to show the influence of feature set on the prediction results of the model in an understandable way based on the marginal contribution of stray current feature samples to improve the inference accuracy. The results show that the prediction error of the proposed model is only 1.684%, which is lower than the prediction models such as random forest and back propagation (BP) neural network under the same optimization strategy. The attribution analysis method based on SHAP value explains the impact of input characteristics on stray current prediction results from a global and individual perspective, helping intelligent subway health management based on improving model interpretability.

    Tools

    Get Citation

    Copy Citation Text

    Zhaoliang Meng, Zetao Zhang, Yuan Yang, Guofeng Li, Chongbo Tao, Yijiang Niu. Improved XGBoost Stray Current Prediction and Interpretable Model[J]. Laser & Optoelectronics Progress, 2022, 59(12): 1215011

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Machine Vision

    Received: Jul. 16, 2021

    Accepted: Aug. 17, 2021

    Published Online: May. 23, 2022

    The Author Email: Zhang Zetao (zz667892021@163.com)

    DOI:10.3788/LOP202259.1215011

    Topics