Liu Mochen,Yang Kuankuan,Yan Yinfa,Song Zhanhua,Tian Fuyang, Li Fade,Yu Zhenwei,Rongyao Zhang,Yang Qinglu,Lu Yao
Abstract
Accurate and non-destructive detection of total nitrogen (TN), total phosphorus (TP), and total potassium (TK) levels in soil is crucial for precise soil testing and fertilization in modernized precision agriculture. Traditional methods for soil composition analysis are expensive, time-consuming, and destructive. This research aims to establish a low-cost, high-precision, and non-destructive method for soil nutrient detection based on visible-near-infrared (Vis-NIR) spectroscopy (350–2500 nm) combined with improved machine learning algorithms. The Vis-NIR spectra of soil samples were acquired using the RS-5400 high-resolution ground feature spectrometer. Subsequently, the Monte Carlo sampling cross-validation (MCCV) algorithm was used to eliminate abnormal samples, and then different preprocessing methods were performed on the spectral data including first-derivative (FD), Savitzky-Golay smoothing (SG) and others. The optimal preprocessing method was selected from these options. In order to remove redundant information and increase the speed of calculation, five algorithms such as competitive adaptive reweighted sampling (CARS), iteratively retains informative variables (IRIV) and the variable iterative space shrinkage approach (VISSA)-IRIV algorithm were used to select feature variables. The characteristic wavelengths closely related to TN, TP, and TK in the soil have been extracted. Then, the RBF kernel (radial basis function) and poly kernel were mixed to obtain the RBF-poly hybrid kernel function, and then the hybrid kernel function support vector machine (RBF-poly-SVM) and the radial basis kernel function support vector machine (RBF-SVM) were applied respectively. Establish prediction models and introduce the whale optimization algorithm (WOA) to optimize the g (kernel function parameter), c (penalty factor) and k-rbf (weight coefficient) parameters in the two models. The performance of the developed models was tested using the coefficient of determination (R2), the root mean squared error (RMSE) and the ratio of performance to deviation (RPD). The results demonstrated that among all models, the RBF-poly -SVM modeling methods were superior to the RBF-SVM model. The best results for estimation of TN, TP, and TK elements were achieved by the models of SG-square-FD + IRIV + RBF-poly-SVM (R2C=0.960, R2V=0.902, RPD=3.206), square-FD + IRIV + RBF-poly-SVM (R2C=0.999, R2V=0.937, RPD=3.939), square root + VISSA-IRIV + RBF-poly-SVM (R2C=0.955, R2V=0.904, RPD=2.608), respectively. The findings of the current approach own practical implications for agriculture and environmental management, as they enable more efficient and accurate soil nutrient monitoring and management.
Paper Linkage:https://www.sciencedirect.com/science/article/pii/S0167198725001217