首页»
最新录用
Petroleum Science > DOI: http://doi.org/10.1016/j.petsci.2025.08.011
A novel fusion of interpretable boosting algorithm and feature selection for predicting casing damage Open Access
文章信息
作者:Juan Li, Mandella Ali M. Fargalla, Wei Yan, Zi-Xu Zhang, Wei Zhang, Zi-Chen Zou, Tang Qing, Tao Yang, Chao-Dong Tan, Guang-Cong Li
作者单位:
投稿时间:
引用方式:Juan Li, Mandella Ali M. Fargalla, Wei Yan, Zi-Xu Zhang, Wei Zhang, Zi-Chen Zou, Tang Qing, Tao Yang, Chao-Dong Tan, Guang-Cong Li, A novel fusion of interpretable boosting algorithm and feature selection for predicting casing damage, Petroleum Science, 2025, http://doi.org/10.1016/j.petsci.2025.08.011.
文章摘要
Abstract: Casing damage resulting from sand production in unconsolidated sandstone reservoirs can significantly impact the average production of oil wells. However, the prediction task remains challenging due to the complex damage mechanism caused by sand production. This paper presents an innovative approach that combines feature selection (FS) with boosting algorithms to accurately predict casing damage in unconsolidated sandstone reservoirs. A novel TriScore FS technique is developed, combining mRMR, Random Forest, and F-test. The approach integrates three distinct feature selection approaches—TriScore, Wrapper, and hybrid TriScore-wrapper and four interpretable Boosting models (AdaBoost, XGBoost, LightGBM, CatBoost). Moreover, shapley additive explanations (SHAP) was used to identify the most significant features across engineering, geological, and production features. The CatBoost model, using the Hybrid TriScore-rapper G1G2 FS method, showed exceptional performance in analyzing data from the Gangxi Oilfield. It achieved the highestaccuracy (95.5%) and recall rate (89.7%) compared to other tested models. Casing service time, casing wall thickness, and perforation density were selected as the top three most important features. This framework enhances predictive robustness and is an effective tool for policymakers and energy analysts, confirming its capability to deliver reliable casing damage forecasts.
关键词
-
Keywords: Casing damage; Machine learning; Feature selection; Sand production; Boosting algorithm