TY - JOUR
T1 - Soil Salinity Mapping of Plowed Agriculture Lands Combining Radar Sentinel-1 and Optical Sentinel-2 with Topographic Data in Machine Learning Models
AU - Tola, Diego
AU - Satgé, Frédéric
AU - Pillco Zolá, Ramiro
AU - Sainz, Humberto
AU - Condori, Bruno
AU - Miranda, Roberto
AU - Yujra, Elizabeth
AU - Molina-Carpio, Jorge
AU - Hostache, Renaud
AU - Espinoza-Villar, Raúl
N1 - Publisher Copyright:
© 2024 by the authors.
PY - 2024/9
Y1 - 2024/9
N2 - This study assesses the relative performance of Sentinel-1 and -2 and their combination with topographic information for plow agricultural land soil salinity mapping. A learning database made of 255 soil samples’ electrical conductivity (EC) along with corresponding radar (R), optical (O), and topographic (T) information derived from Sentinel-2 (S2), Sentinel-1 (S1), and the SRTM digital elevation model, respectively, was used to train four machine learning models (Decision tree—DT, Random Forest—RF, Gradient Boosting—GB, Extreme Gradient Boosting—XGB). Each model was separately trained/validated for four scenarios based on four combinations of R, O, and T (R, O, R+O, R+O+T), with and without feature selection. The Recursive Feature Elimination with k-fold cross validation (RFEcv 10-fold) and the Variance Inflation Factor (VIF) were used for the feature selection process to minimize multicollinearity by selecting the most relevant features. The most reliable salinity estimates are obtained for the R+O+T scenario, considering the feature selection process, with R2 of 0.73, 0.74, 0.75, and 0.76 for DT, GB, RF, and XGB, respectively. Conversely, models based on R information led to unreliable soil salinity estimates due to the saturation of the C-band signal in plowed lands.
AB - This study assesses the relative performance of Sentinel-1 and -2 and their combination with topographic information for plow agricultural land soil salinity mapping. A learning database made of 255 soil samples’ electrical conductivity (EC) along with corresponding radar (R), optical (O), and topographic (T) information derived from Sentinel-2 (S2), Sentinel-1 (S1), and the SRTM digital elevation model, respectively, was used to train four machine learning models (Decision tree—DT, Random Forest—RF, Gradient Boosting—GB, Extreme Gradient Boosting—XGB). Each model was separately trained/validated for four scenarios based on four combinations of R, O, and T (R, O, R+O, R+O+T), with and without feature selection. The Recursive Feature Elimination with k-fold cross validation (RFEcv 10-fold) and the Variance Inflation Factor (VIF) were used for the feature selection process to minimize multicollinearity by selecting the most relevant features. The most reliable salinity estimates are obtained for the R+O+T scenario, considering the feature selection process, with R2 of 0.73, 0.74, 0.75, and 0.76 for DT, GB, RF, and XGB, respectively. Conversely, models based on R information led to unreliable soil salinity estimates due to the saturation of the C-band signal in plowed lands.
KW - machine learning
KW - plowed lands
KW - Sentinel-1
KW - Sentinel-2
KW - soil salinity mapping
UR - http://www.scopus.com/inward/record.url?scp=85205223003&partnerID=8YFLogxK
U2 - 10.3390/rs16183456
DO - 10.3390/rs16183456
M3 - Artículo
AN - SCOPUS:85205223003
SN - 2072-4292
VL - 16
JO - Remote Sensing
JF - Remote Sensing
IS - 18
M1 - 3456
ER -