Research on Control Factors and Parameter Optimization of Surfactant Flooding in Low-Permeability Reservoirs Using Random Forest Algorithm
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
As oil and gas development increasingly targets low and ultra-low permeability reservoirs, conventional recovery techniques often prove insufficient for mobilizing residual oil. Surfactant flooding, a key chemical enhanced oil recovery (EOR) technology, thus requires careful system optimization and mechanistic investigation. This study focuses on low-permeability reservoirs in the Changqing Oilfield, evaluating three surfactant systems—YHS-Z1 (a 7:3 mass ratio blend of hydroxypropyl sulfobetaine and cocamide, YHS-Z2 (a polyether carboxylate, a nonionic-anionic composite) and a middle-phase microemulsion system (Heavy alkylbenzene sulfonate and hydroxysulfobetaine were combined with a mass ratio of 7:3)—through a series of experiments including interfacial tension measurement, contact angle analysis, static and dynamic oil displacement tests, as well as emulsion transport/retention index assessments, to comprehensively characterize their oil displacement properties. Based on the experimental data, five machine learning models—Linear Regression, Support Vector Regression, XGBoost, Gradient Boosting Regression, and Random Forest—were constructed and comparatively analyzed for their predictive performance. The results demonstrate that the Random Forest (RF) model achieved the optimal prediction performance (MAE = 2.8007%, MAPE = 7.4107%, R² = 0.0454). Further analysis using the SHAP algorithm revealed that the retention index is the primary global factor (accounting for 49.79% of the variance), while significant intergroup differences exist in the primary factors across different surfactant systems. Concurrently, single-factor and multi-factor sensitivity analyses were conducted to elucidate synergistic effects and threshold behaviors among parameters. The optimal parameter combination, identified via a random search method, achieved a predicted recovery ratio of 45.61%, representing a 6.57% improvement over the highest experimental value. This study demonstrates that machine learning methods can effectively identify the dominant factors in oil displacement and enable synergistic parameter optimization, thereby providing a theoretical foundation for the efficient development of surfactant flooding in low-permeability reservoirs.