Shear wave travel time (also known as Delta-T Shear and commonly abbreviated as DTS) is an important parameter in petroleum for exploration, production, and characterization of borehole stability. Direct measurement of DTS is often limited by high costs and a constraint of geography, making machine learning (ML) predictive approaches necessary. This study aims to explore the effectiveness of ML models in predicting DTS, emphasizing the importance of data preprocessing techniques to improve prediction accuracy. Preprocessing techniques include Yeo-Johnson transformation to handle non-normality, outlier elimination using z-score, and data smoothing using the Savitzky-Golay filter and median filter. Incorporating smoothing techniques can fill important gaps in some existing studies and may improve the performance of machine learning models in predicting DTS, particularly in situations with limited or noisy data. Four ML models were tested in this study, namely Linear Regression (LR), K-Nearest Neighbors (KNN), Extreme Gradient Boosting (XGBoost), and Random Forest (RF), with performance evaluation based on metrics RMSE (Root Mean Squared Error), MAE (Mean Absolute Error), and R2 (coefficient of determination). The results showed that the RF model produced the best performance with RMSE of 9.41, MAE of 6.35, and R2 of 0.90 in scenarios with Yeo-Johnson transformation, outlier elimination, and smoothing techniques using a median filter with a window size of 5.
                        
                        
                        
                        
                            
                                Copyrights © 2024