Next Article in Journal
Multitemporal Field-Based Maize Plant Height Information Extraction and Verification Using Solid-State LiDAR
Previous Article in Journal
Ecology, Cultivation, and Utilization of the Dittany of Crete (Origanum dictamnus L.) from Ancient Times to the Present: A Short Review
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Improving the Spatiotemporal Transferability of Hyperspectral Remote Sensing for Estimating Soil Organic Matter by Minimizing the Coupling Effect of Soil Physical Properties on the Spectrum: A Case Study in Northeast China

1
College of Biological and Agricultural Engineering, Jilin University, Changchun 130012, China
2
School of Geomatics and Prospecting Engineering, Jilin Jianzhu University, Changchun 130118, China
*
Author to whom correspondence should be addressed.
Agronomy 2024, 14(5), 1067; https://doi.org/10.3390/agronomy14051067
Submission received: 16 April 2024 / Revised: 7 May 2024 / Accepted: 16 May 2024 / Published: 17 May 2024

Abstract

:
Soil organic matter (SOM) is important for the global carbon cycle, and hyperspectral remote sensing has proven to be a promising method for fast SOM content estimation. However, because of the neglect of the spectral response of soil physical properties, the accuracy and spatiotemporal transferability of the SOM prediction model are poor. This study aims to improve the spatiotemporal transferability of the SOM prediction model by alleviating the coupling effect of soil physical properties on spectra. Based on satellite hyperspectral images and soil physical variables, including soil moisture (SM), soil surface roughness (root-mean-square height, RMSH), and soil bulk weight (SBW), a soil spectral correction model was established based on the information unmixing method. Two important grain-producing areas in Northeast China were selected as study areas to verify the performance and transferability of the spectral correction model and SOM content prediction model. The results showed that soil spectral corrections based on fourth-order polynomials and the XG-Boost algorithm had excellent accuracy and generalization ability, with residual predictive deviations (RPDs) exceeding 1.4 in almost all the bands. In addition, when the soil spectral correction strategy was adopted, the accuracy of the SOM prediction model and the generalization ability after the model migration were significantly improved. The SOM prediction accuracy based on the XG-Boost-corrected spectrum was the highest, with a coefficient of determination (R2) of 0.76, a root-mean-square error (RMSE) of 5.74 g/kg, and an RPD of 1.68. The prediction accuracy, R2 value, RMSE, and RPD of the model after the migration were 0.72, 6.71 g/kg, and 1.53, respectively. Compared with the direct migration prediction of the model, adopting the soil spectral correction model based on fourth-order polynomials and XG-Boost reduced the RMSE of the SOM prediction results by 57.90% and 60.27%, respectively. This performance comparison highlighted the advantages for considering soil physical properties in regional-scale SOM predictions.

1. Introduction

With social development and the continuous growth of population and consumption, global food security issues have become increasingly prominent [1]. Accurately monitoring soil conditions is of great significance for ensuring sustainable agricultural development and global food security [2,3,4]. Soil organic matter (SOM) is an important component of soil, which not only provides nutrients for plant growth but also improves soil physical, chemical, and biological conditions [5]. Obtaining the spatial distribution information of the SOM in cultivated land is a key prerequisite for the utilization and management of soil resources and can provide key basic data for agricultural disaster assessment and food security scenario simulations [6,7]. In addition, a large portion of the SOM is composed of organic carbon, which constitutes the largest carbon pool in terrestrial ecosystems [8]. Minor soil carbon pool changes can significantly alter the atmospheric CO2 concentration, thereby affecting global carbon cycling and the climate [9]. Therefore, precisely understanding the SOM content and spatial distribution is crucial for promoting sustainable agricultural development, enhancing the soil carbon sequestration potential, and regulating global climate change [10,11].
Remote sensing is a low-cost, high-accuracy, real-time method for multi-angle, multi-temporal, and large-area Earth observations [12,13,14]. To date, the ability of hyperspectral remote sensing to predict and map SOM contents has been confirmed in many studies [15,16,17]. Imaging spectra are the most important data source, and their characteristic response to the soil chemical composition is an important foundation for hyperspectral remote-sensing-based SOM content prediction [18,19,20]. As an important dyeing material of the soil, SOM can significantly absorb the incident light in the visible-light band, or even the short-wave infrared band, and there is usually a significant negative correlation between its content and the soil spectral reflectance. However, imaging spectra are not affected by the soil composition alone but comprehensively reflect the soil physical properties and chemical composition within the ground sample, and the soil physical properties and chemical composition exert a coupling effect on the response to the spectrum [21,22,23]. Previous studies on SOM prediction lack consideration of soil physical properties, ignoring the spectral response of soil physical properties, such as soil moisture (SM), soil bulk weight (SBW), and surface roughness properties (e.g., root-mean-square height, RMSH), resulting in poor accuracy and poor spatiotemporal transferability of SOM prediction models. Research has shown that the scattering contribution of soil physical properties to spectral reflectance severely affects the sensitivity of hyperspectral data to the SOM content [24]. As the SM content increases continuously until saturation, the soil reflectance decreases first and then increases because of the specular reflection effect [25]. In addition, RMSH increments enhance light scattering and transmission on the soil surface, thus decreasing the reflectivity, especially in the visible and near-infrared wavelength ranges [26]. Meanwhile, long-term high-intensity mechanized planting increases the SBW of cultivated land. The concomitant changes in SM, SBW, and spectral reflectance exhibit a complex relationship. Generally, increases in SM, SBW, and RMSH decrease the spectral reflectance, showing a coupling effect [27,28]. Noteworthily, the effect of the SOM content on the soil spectrum is far weaker than that of soil physical properties [29]. Because of the satellite revisit period and soil physical condition uncertainties, the effect of the noise interference of soil physical properties on the spectrum greatly limits the accuracy and spatiotemporal transferability of the remote-sensing-based SOM evaluation model, which needs to be solved urgently.
Reducing the influence of soil physical properties on hyperspectral data is the key to improving the spatiotemporal transferability of SOM content prediction models. To this end, some scholars have tried to develop satellite hyperspectral image correction methods considering soil physical properties at the pixel scale to reduce the sensitivity of SOM prediction models to spectral differences caused by soil physical property changes [30]. Minasny et al. developed a spectral correction model based on the EOP method to eliminate the influence of the SM [31]. Ou et al. established a spectral correction model for SM removal based on the Kubelka–Munk method to improve the accuracy of SOM predictions [27]. Castaldi et al. synthesized the dry-soil spectrum by calculating the statistical variabilities of dry and wet soils, thereby improving the model prediction accuracy [21]. Although these methods have corrected hyperspectral data to a certain extent, they are mostly based on one physical parameter and ignore the coupling response of different soil physical properties to the spectrum. In particular, in cultivated soil affected by both natural and human factors, the complex physical properties of the soil have significant spatial and temporal heterogeneities and transient variability [20,32,33]. Spectral correction models based on a single soil physical parameter have limited improvements in SOM prediction accuracy and spatiotemporal transferability. To date, because of the scarcity of soil physical data from satellite ground synchronization experiments, the potential of hyperspectral correction methods comprehensively considering multiple soil physical properties has not been fully explored.
Developing a soil spectral correction method by alleviating the coupling effect of surface physical properties on soil pixel spectra is an effective solution to improve the spatiotemporal transferability of the SOM prediction model. Complex interactions between various surface physical properties and electromagnetic waves make it difficult to simulate the relationship between soil physical properties and the spectrum with physical models [27]. Therefore, this study aims to separate the physical and chemical soil information in the spectral data with data-driven methods. Some of the issues that must be addressed in advance include (i) quantifying the effect of soil physical properties on satellite hyperspectral data and (ii) generalizing the model in areas with scarce soil sample data. Studies have indicated various functional relationships between soil physical parameters (SM, RMSH, and SBW) and spectral reflectance, including logarithmic, exponential, and power exponential functions [26,34]. However, most previous studies were based on multispectral data, while the effect of soil physical properties on reflectance remain to be clarified based on hyperspectral data with more continuous and dense bands [35,36]. Although these functional relationships may change slightly because of differences in soil types, components, etc., they generally reflect the spectral characteristics induced by soil physical properties and are an essential basis for representing soil physical property information in spectral data [37,38,39]. Based on this, spectral forward modeling can be carried out using soil physical properties, providing prior data for soil spectral correction [40]. Moreover, to effectively decompose the soil physical and chemical information in the pixel spectrum, nonlinear parameter regression and machine-learning models have been used to simulate the coupling relationship between soil physical property spectra and pixel spectra, respectively. These two data-driven methods search for the rules between data through statistical analysis and machine-learning training, respectively. In this way, unknown data can be predicted, and the generalization ability of the soil spectral correction method can be guaranteed by the regression equation. For the SOM prediction model, the soil spectral correction redistributes the observational information of the original hyperspectral remote sensing to ensure the uniformity of the soil physical properties of all the pixels, which helps to improve the spatiotemporal transferability of the model [41,42,43].
This work seeks to establish a hyperspectral SOM prediction model with high spatiotemporal transferability, which can guide soil investigation and parameter prediction. The objectives of this study are (i) evaluating the impact of soil physical properties on the image pixel spectrum and their contribution to the bias in SOM content prediction, (ii) developing soil spectral correction methods by alleviating the coupling impact of soil physical properties on the satellite spectrum, and (iii) determining the spatiotemporal transferability potential of satellite hyperspectral data for SOM retrieval based on a soil spectral correction strategy. Data-poor regions might benefit from the proposed SOM prediction model with strong spatiotemporal transferability when mapping the SOM to develop appropriate policies.

2. Materials and Methods

This study comprehensively considered the coupling effects of soil physical properties, such as SM, RMSH, and SBW, on satellite hyperspectral images and proposed a soil spectral correction method. The workflow includes data collection, spectral analysis, spectral correction, and modeling evaluation, as shown in Figure 1. First, parameter estimation equations were used to establish empirical relationships between satellite hyperspectral data and the three main soil physical parameters: SM, RMSH, and SBW. The spectral data of the soil physical properties were obtained by spectral forward simulation using soil physical parameters. Then, a soil pixel spectral correction model was constructed based on the simulated spectrum, soil pixel spectrum, and ground spectrum using multi-order polynomials and various machine-learning models to remove the effects of the SM, RMSH, and SBW on the image spectrum. Finally, the SOM prediction model was constructed using XG-Boost, and the performance of the soil spectral correction model and its improvement effect on the SOM prediction accuracy were evaluated at two sites.

2.1. Study Area

Site 1 is in the protected black soil cultivated land of Heilongjiang Province, Northeast China (131°30′–132°03′ E, 46°36′–46°49′ N), as shown in Figure 2, which has an area of 1095 km2. This region features a temperate continental monsoon climate, with an annual precipitation of 450–650 mm. The precipitation is mainly concentrated in June–September, accounting for 80% of the annual total. The terrain of the study area is high in the south and low in the north, higher in the west than in the east, and most of the area is an accumulation plain [44]. This research area is in one of only four black soil areas in the world, characterized by a deep cultivated layer and fertile soil. The thickness of the soil layer containing humus is 25–80 cm, suitable for planting crops such as corn and soybeans [45].
Site 2 is in the protected black soil cultivated land of Jilin Province, China (125°24′–125°43′ E, 44°36′–44°46′ N), as shown in Figure 2, which has an area of 713 km2. Its terrain is flat, with an elevation between 189 and 237 m. This region is a transitional zone between the humid mountainous area in the east and the semi-arid plain area in the west. This study area features a temperate continental semi-humid monsoon climate, with an average annual temperature of 4.6 °C and an annual precipitation of 600–700 mm. This region has rich river systems, relatively abundant agricultural water resources, and strong SM spatial heterogeneity. The soil in this region is mainly phaeozems with a 0.6–1.0 m humus-containing layer [46,47]. Site 2 has a significantly different soil type, surface characteristics, and other environmental factors than Site 1, which can verify the spatiotemporal transferability of the SOM content prediction model in this study.

2.2. Datasets

2.2.1. Soil Sample Collection and Treatment

A total of 104 surface soil samples were collected from Site 1 between 29 and 30 October 2022 (Figure 2b). Among them, 80 soil samples were used as the training set for the models, and the remaining 24 samples were used as the validation set. Between 14 and 15 April 2023, 40 surface soil samples were collected from Site 2 (Figure 2c) for testing the spatiotemporal transferability of the models. First, a 3D laser scanner (Trimble TX8, Trimble, CA, USA; scanning speed: 1 million points per second) was installed at the midpoint of each edge in the quadrant to scan the soil surface structure (Figure 3). Next, nine subsamples were collected with a ring knife (depth: 5 cm and volume: 200 mL) in each 30 × 30 m quadrant. The real-time kinematic (RTK) survey technique was used to record the longitude and latitude of the quadrant’s midpoint.
After the samples were transported to the laboratory, the SM and SBW of the nine subsamples from each quadrant were obtained through weighing and drying, and the average of the subsamples was calculated. Then, the nine subsamples were mixed into a composite sample for laboratory spectral measurements (ASD FieldSpec 4 spectrometer, ASD Specialty Healthcare, LLC, Sioux Falls, SD, USA) (taking the average of 10 measurements) and SOM content determination using the potassium dichromate heating method [48]. To ensure the same SBW of each sample, soil samples were loaded in a disposable culture dish for spectral measurements. The soil surface point cloud data of each measurement site were spliced, cropped, and filtered. The processed point cloud data were used to establish a three-dimensional relative coordinate system (Figure 3b); the Z coordinates of all the point cloud data were extracted, and the RMSH of the quadrant was calculated.

2.2.2. Image Acquisition and Treatment

The Ziyuan-1-02D (ZY1-02D) hyperspectral image data were acquired from the Aerospace Information Research Institute at the Chinese Academy of Sciences. The image generation time is synchronized with the soil-sampling time. All the images have less than 1% cloud coverage. The technical parameters of the ZY1-02D satellite hyperspectral camera are listed in Table 1. The sensor exhibits strong noises in wavelength ranges from 400 to 450 nm and from 2460 to 2500 nm and is affected by atmospheric water vapor absorption in wavelength ranges from 1290 to 1408 nm and from 1828 to 1963 nm [49]. Therefore, the 450 to 1290 nm, 1408 to 1828 nm, and 1963 to 2460 nm bands were selected as the spectral bands in this study. The images were subjected to stripe removal, geometric correction, and atmospheric correction. To correct the bidirectional reflectance distribution function effect of the image, the zenith angle and azimuth angle of the sun (and satellite) are calculated, and the image is corrected by the kernel-driven bidirectional reflectance distribution function model [50].

2.3. Spectral Correction Strategy

The image pixel spectrum comprehensively reflects the soil physical properties (e.g., SM, RMSH, and SBW) and chemical composition within the ground quadrant. Spectral correction aims to separate reflection features attributed to physical and chemical properties of the soil in pixel spectral data, thus alleviating the coupling effect of the soil physical properties on the spectrum. First, linear, exponential, power exponential, and logarithmic parameter estimation equations were used to establish empirical relationships between satellite hyperspectral data and the three soil physical parameters, SM, RMSH, and SBW, on a band-by-band basis. These parameter estimation methods for fitting the relationship between the soil physical properties and spectral reflectance have been verified in several studies [34,36].
Parameter estimation equations were used to establish empirical relationships between satellite hyperspectral data and the three main soil physical parameters: SM, RMSH, and SBW. The spectral data of the soil physical properties were obtained by spectral forward simulation using soil physical parameters. The soil ground spectrum, measured with dried and ground soil samples, is regarded as a “pure spectrum” that only reflects the soil’s chemical composition information [51]. Based on this, a spectral correction model was constructed, which took the pixel spectrum and three sets of soil physical parametric simulated spectra as input and laboratory-measured spectra as training targets. Through multi-order polynomials and various machine-learning algorithms, correction relations between the pixel spectrum and the ground spectrum were established to strip the reflection information attributed to soil physical properties in the pixel spectrum. The multi-order polynomial equation is as follows:
R G = i = 1 n ( a i R S M + b i R R M S H + c i R S B W + d i R P ) i + e
where R G is the laboratory-measured soil reflectance, R S M is the soil reflectance simulated based on SM, R R M S H is the soil reflectance simulated based on RMSH, R S B W is the soil reflectance simulated based on SBW, R P is the soil reflectance of the image pixel, i is the polynomial order, and a i , b i , c i , d i , and e are regression coefficients.

2.4. Machine-Learning Models

2.4.1. Competitive Adaptive Reweighted Sampling (CARS)

CARS is adopted to extract sensitive bands corresponding to SOM in the hyperspectral data. CARS imitates the “survival of the fittest” principle of Darwin’s evolutionary theory. Through adaptive weighted sampling, it screens out the wavelengths with large absolute coefficients of the PLS model and removes the wavelengths with small weights, thus obtaining many subsets of wavelength variables. Next, the subset of wavelengths with the lowest root-mean-square error is selected via cross-validation as the optimal subset [42,52].

2.4.2. eXtreme Gradient Boosting (XG-Boost)

XG-Boost is an ensemble-learning model based on the boosting strategy, which combines several CART trees into a strong learner. As an ensemble algorithm framework, it supports the parallel gradient lifting of the base learner, thus greatly improving the model-training speed. The Newton method is used to solve the extreme value of the loss function, which is expanded to the second order using the Taylor formula. The loss function is optimized with the first-order gradient function and second-order gradient function to reduce the model complexity [53]. Simultaneously, the probability for over-fitting is reduced through regularization, significantly improving the model’s generalization ability.

2.4.3. Model Validation

In this study, the coefficient of determination (R2), root-mean-square error (RMSE), and residual predictive deviation (RPD) were selected as evaluation indices, as expressed below:
R 2 = 1 i = 1 n ( y i Y i ) 2 / i = 1 n ( y i y ¯ ) 2
R M S E = 1 n i = 1 n ( y i Y i ) 2
R P D = S D / R M S E
where n is the number of samples, yi is the measured value of sample i, Yi is the predicted value of sample i, and y ¯ is the average value of the measurements.

3. Results

3.1. Descriptive Statistics of Soil Parameters

The basic statistical analysis results of the soil parameters are listed in Table 2. At Site 1, the mean SM, RMSH, and SBW values were 0.25 cm3/cm3, 2.49 cm, and 0.98 g/cm3, respectively, with standard deviations (SDs) of 0.08 cm3/cm3, 0.77 cm, and 0.15 g/cm3. The SOM content varied significantly from 25.84 to 75.97 g/kg, with an SD of 10.51 g/kg and a coefficient of variation (CV) of 24.30%. Site 2 had significantly different soil properties from Site 1. The average SM, RMSH, and SBW were 0.37 cm3/cm3, 3.65 cm, and 1.13 g/cm3, respectively, which were significantly higher than those at Site 1 and had stronger variabilities. The SOM content at Site 2 ranged from 27.40 to 72.97 g/kg, averaging 41.57 g/kg, which was lower than that at Site 1.

3.2. Effect of Soil Physical Properties on Soil Spectra

To verify the reliability of the ZY1-02D hyperspectral image, the soil pixel spectrum was compared with the soil ground spectrum (Figure 4). Although the soil pixel spectrum has a similar shape to that of the soil ground spectrum, it has some noise and a relatively low smoothness, especially in the VNIR wavelength range. In addition, the spectral reflectance in the soil pixels was slightly lower than that measured in the laboratory. The Spearman correlation coefficients (SCCs) and Pearson correlation coefficients (PCCs) between the pixel reflectance and ground reflectance were calculated. The results showed that the PCCs were below 0.5 at most wavelengths, while the SCCs in the 480 to 680 nm and 2000 to 2500 nm wavelength ranges were basically greater than 0.5, indicating a possible nonlinear relationship. To reveal the factors affecting the pixel spectrum, the differences in the soil reflectance between different physical property gradients were compared. With the increase in SM, the soil spectral reflectance decreased significantly, especially in the 500 to 1300 nm and 1450 to 1700 nm wavelength ranges (Figure 5). The soil spectral reflectance decreased relatively slightly with the increase in SBW. The effect of the RMSH on the soil spectrum was the most significant, and the reflectance decreased significantly with the increase in RMSH. In summary, the coupling effect of the SM, SBW, and RMSH on the spectrum is an important reason for the deviation in the two sets of spectral data, which severely limits the acquisition of the soil’s “pure spectrum” by the imaging spectrometer. Therefore, it is necessary to separate the soil physical and chemical information in the pixel spectral data and improve the SOM prediction accuracy of hyperspectral remote sensing.

3.3. Empirical Relationship between Satellite Hyperspectral Image and Soil Physical Properties

The empirical coefficients were regressed based on the field data and soil pixel spectrum to determine the relationship of the soil reflectance with SM, RMSH, and SBW (Figure 6). Among the fitting equations between the SM and soil reflectance, the exponential equation has the best fitting effect. Except for the 2000 to 2500 nm wavelength range, the fitting results were good, with R2 ranging from 0.49 to 0.68. In the 2000 to 2500 nm wavelength range, the fitting between the SM and reflectance was not good, possibly because of the effect of the absorption of clay minerals on the spectral characteristics, caused by the hydroxyl groups in the soil. With higher clay mineral contents, the water retention capacity of the soil was greater. According to the fitting results between the SBW and soil reflectance, the exponential equation fitted the best in the 450 to 1800 nm wavelength range, with R2 ranging from 0.50 to 0.69, while the power exponential equation fitted the best in the 2000 to 2500 nm wavelength range. In terms of the entire wavelength range, RMSH had the strongest fitting relationship with the soil reflectance among the three groups of soil physical parameters, implying its most significant effect on the soil spectrum. Among the four equations, the logarithmic equation had the best fitting effect, with R2 ranging from 0.55 to 0.69. In general, the best empirical relationship of the soil reflectance with SM is exponential, that with RMSH is logarithmic, and that with SBW is exponential in the 450 to 1800 nm wavelength range and power exponential in the 2000 to 2500 nm wavelength range. Three sets of soil reflectance data were simulated based on the empirical relationship between soil physical parameters and soil spectra (Figure 7). The soil reflectance simulated based on SM showed an almost uniform trend between 2000 and 2500 nm, implying that the effect of the SM on the reflectance in this wavelength range was suppressed by other factors, resulting in insignificant spectral features. Other than that, the remaining simulated soil spectra showed significant differences.

3.4. Modeling of Soil Spectral Correction

Empirical coefficient models and machine-learning models were employed to establish the correction relationship between the soil pixel spectrum and soil “pure spectrum”. The multi-order polynomial-based soil spectral correction model showed improved accuracy with the increasing order, and its RPD and RMSE were optimal in all the bands at the fourth order (Figure 8). When the order reached the fifth order, R2 still maintained an increasing trend, but RPD decreased significantly, indicating that the model was over-fitting. The RMSE of the fourth-order polynomial model was less than 0.04, and the RPD was above 1.5, indicating good correction accuracy.
Four machine-learning algorithms, namely, support vector machine regression (SVR), an extreme learning machine (ELM), a backpropagation neural network (BPNN), and XG-Boost, were used to construct soil spectral correction models in the same way. The best soil spectral correction model was determined by comparing the mapping ability of the different machine-learning algorithms to the coupling relationship between multiple soil spectra (Figure 9). The correction results showed that the four machine-learning algorithms differed significantly in the soil spectral correction accuracy. The accuracy fluctuations around the 1000 nm wavelength may be caused by other noise in the spectral data. Other than that, all the soil spectral correction results were good, and the overall accuracy was high relative to that of the polynomial-based models. As a representative algorithm of the ensemble-learning model, XG-Boost achieved the best spectral correction results, with R2 above 0.6 and RPD above 1.6 for all the bands. The second best was ELM, while SVM and BPNN performed poorly.
The soil spectral correction accuracy based on the fourth-order-polynomial model and XG-Boost model was high, and these two sets of spectral data were selected for in-depth analysis. The results showed that the corrected spectrum based on XG-Boost was smoother than the corrected spectrum based on the fourth-order polynomial and fit the shape of the soil’s “pure spectrum” more closely (Figure 10). The calculated correlation coefficients between the soil’s “pure spectrum” and the corrected spectrum showed that the PCCs at most wavelengths were above 0.8, which was greatly improved compared with the correlation between the soil’s “pure spectrum” and the original pixel spectrum. Therefore, after the spectral correction, the spectral response, induced by soil physical properties, in the soil pixel spectrum was alleviated, and the proportion of the information on the soil’s chemical-composition-response signals in the image spectral data was significantly increased. In general, the correction results of the XG-Boost model are slightly better than those of the fourth-order-polynomial model. However, the accuracy improvement effect of these methods on hyperspectral SOM prediction needs further analysis through modeling.

3.5. SOM Content Prediction Accuracy

Four types of soil spectral data, namely, a pixel spectrum, a fourth-order-polynomial-corrected spectrum, an XG-Boost-corrected spectrum, and a ground-based spectrum, were used to establish the SOM content prediction models. To reduce the data dimensionality and improve the computational efficiency of the model, the spectral data were first subjected to feature extraction. Pearson’s correlation coefficient threshold was used to determine the sensitive bands of the SOM. The correlation coefficient distribution between the four sets of soil spectral data and SOM content showed a relatively consistent trend. Specifically, the correlation coefficient decreased with the increasing wavelength before the 800 nm wavelength and increased after the 800 nm wavelength (Figure 11a). The bands with absolute correlation coefficients above 0.5 were selected as the sensitive bands of the SOM. The sensitive spectral bands corresponding to the SOM in the four spectral datasets of the pixel spectrum, fourth-order-polynomial-corrected spectrum, XG-Boost-corrected spectrum, and ground-based spectrum were concentrated in wavelength ranges from 628 to 1023 nm, from 524 to 1223 nm, from 542 to 1560 nm, and from 550 to 1762 nm, respectively. CARS was adopted to further extract the optimal subset of features containing the least-redundant information in the sensitive bands. The optimal number of CARS iterations was determined by the RMSECV of the multiple regression (Figure 11b). The bands listed in Table 3 are the spectral bands selected by CARS for SOM inversion modeling and validation analysis.
The four sets of spectral bands selected through CARS and the SOM contents were used as the input data of the model. The XG-Boost algorithm is used to construct the SOM prediction model. The results indicated that these two spectral correction methods significantly improved the SOM prediction accuracy based on the original pixel spectrum. Among all the SOM prediction results, the ground spectral data had the highest prediction accuracy, with R P 2 , RMSEP, and RPD values of 0.79, 4.89 g/kg, and 1.97, respectively (Figure 12). The prediction accuracy evaluated based on R P 2 was 0.64 when the original pixel spectral dataset was used as model input data. Adopting the soil spectral correction strategy based on fourth-order polynomials increased the prediction accuracy ( R P 2 ) by 0.05, decreased RMSEP by 2.28 g/kg, and increased RPD by 0.38. The soil spectral correction strategy based on the XG-Boost model had a greater SOM prediction accuracy improvement, with an R P 2 increase of 0.12, an RMSEP decrease of 3.10 g/kg, and an RPD increase of 0.59. The SOM prediction accuracy with the corrected spectrum came close to that with the ground spectrum, implying that the spectral correction model can effectively improve the hyperspectral SOM prediction accuracy.

4. Discussion

4.1. The Spatiotemporal Transferability of the Soil Spectral Correction Model and SOM Prediction Model

The soil pixel spectral correction methods based on empirical coefficient and machine-learning models provided new strategies for improving the SOM prediction accuracy based on hyperspectral images. The soil spectral correction method based on XG-Boost significantly affected the SOM prediction accuracy improvement, but its correction process and principles were difficult to express mathematically. Despite its weak SOM prediction accuracy improvement, the soil spectral correction method based on the fourth-order-polynomial model expressed the improved method with the coefficient equation, which was more conducive to its promotion. The high transferability of the soil spectral correction method is a key prerequisite for constructing an SOM prediction model with strong generalization ability [54]. To verify their spatiotemporal transferability, 40 groups of Site 2 soil pixel spectra and ground experimental data were imported into the two spectral correction models. The spectral correction results showed that the corrected soil spectra were very consistent with the shape of the soil’s “pure spectra” (Figure 13). Compared with the soil spectrum after the fourth-order-polynomial correction, the soil spectrum corrected with the XG-Boost model is smoother. According to the accuracy of the model migration test results, the soil spectral correction model based on XG-Boost showed better migration performance, with RPD above 1.4 for all the bands. The machine-learning algorithms represented by XG-Boost were much better than the coefficient models in terms of the calculation ability to establish the coupling relationship between multiple soil spectra and its adaptability to new data. The reason is that this ensemble-learning model comprehensively utilizes all the eigenvalues of each soil sample point and continuously adjusts the weight of the tree through iteration to explore the optimal solution of the coupling relationship between the soil physical properties and pixel spectrum [43,55].
The poor transferability of SOM remote-sensing prediction models is mainly attributed to their poor applicability to different spatiotemporal spectral data [56,57]. To evaluate the improvement effect of the soil spectral correction methods on the spatiotemporal transferability of SOM prediction models, the SOM prediction model established using Site 1 data was used to predict the SOM content at Site 2. The SOM prediction model based on ground spectra exhibited the best transferability, with RMSEP only increasing by 0.39 g/kg (Figure 14). However, the transferability of SOM prediction models based on the original pixel spectrum is extremely poor, as surface physical property changes cause spectral reflectance deviations. After the transferability verification, RMSEP increased by 8.05 g/kg, while RDP decreased by 44.04%. Adopting the two soil spectral correction strategies significantly improved the transferability of the prediction model, with an RPD of over 1.4 for the model transferability validation. The SOM prediction model based on the XG-Boost-corrected spectrum had greater transferability. Compared with the model transferability validation based on the pixel spectrum, RMSEP was reduced by 60.27%, and RPD was increased by 150.82%. These findings proved the effectiveness of the soil spectral correction methods and the feasibility of the corrected satellite hyperspectral data in SOM content predictions. The SOM prediction model based on corrected satellite hyperspectral data can be used even at two sites with different soil types, soil physical properties, SOM contents, and spatiotemporal features. The core of this soil spectral correction method is to comprehensively consider the coupling effect of various soil properties on spectral reflectance to restore the true spectral characteristics of the research target. For different research objectives, the main factors affecting the spectral response of the target can be analyzed according to the actual environment and imaging conditions [35]. Therefore, the proposed method is not limited to SOM predictions and can provide valuable insights for soil property predictions based on satellite hyperspectral data.

4.2. Contribution of Soil Physical Properties to SOM Content Prediction Bias

The soil spectral correction method proved to greatly improve the SOM content prediction accuracy and spatiotemporal transferability of the pixel spectrum. In other words, soil properties (SM, RMSH, and SBW) may be the main factors leading to errors in SOM estimation based on original pixel spectra. This section investigated the error dependence of the original pixel spectrum and two sets of corrected spectral data on SM, RMSH, and SBW, and their contribution to the SOM content prediction bias was estimated through the stepwise regression method. The results showed that the cumulative deviation contribution rate of these three soil properties to the SOM prediction results based on the original pixel spectrum was over 70% at both sites (Figure 15). Thus, soil physical properties are the main error source of SOM predictions [50]. The contribution of the SM to the SOM bias was the highest, followed by RMSH and SBW. This is related to the most significant response of the pixel spectrum to SM within the sensitive wavelength range of the SOM and possibly the stronger spatial heterogeneity of the SM compared to RMSH and SBW. Adopting the spectral correction strategy significantly reduced the bias of the SM, RMSH, and SBW in SOM predictions. The soil spectral correction based on XG-Boost more significantly reduced the SOM prediction bias caused by soil physical properties than the soil spectral correction based on fourth-order polynomials. Despite the increased relative contribution of random errors, the total bias in terms of the accuracy of the predicted results was significantly reduced. Therefore, the spectral correction strategy did not introduce more error sources but only improved the relative contribution of other error factors to the SOM prediction bias, such as hyperspectral-image-processing uncertainty and field data acquisition uncertainty [58,59]. Judging from the relative contribution rate of the soil physical properties to the SOM prediction deviation before and after the spectral correction, the contribution rate of the SM to the SOM prediction bias decreased the most, by over 10%, followed by RMSH. The spectral correction method based on polynomials and XG-Boost reduced the average relative contribution rate of the RMSH at the two sites by 10% and 14.5%, respectively. Although the declined contribution rate of the SBW to the SOM prediction bias was the smallest, the reduction exceeded 6%. By comparing the improvement effects of different input variables on the SOM prediction bias, the improvement effects of soil physical properties on the SOM content prediction accuracy ranked as SM > RMSH > SBW, consistent with the order of the spatial heterogeneity of these three soil physical properties and the order of the sensitivity of soil spectra to them in the VNIR range. Thus, soil physical properties with strong spatial heterogeneity and a sensitive spectral response should be prioritized in soil spectral correction.

4.3. The Potential and Limitations of the Soil Spectral Correction Model

As soil spectral correction methods are designed to address the coupled effects of surface physical properties on hyperspectral images, they are suitable for the remote-sensing image processing of various soil chemical composition predictions. Such methods suppress the sensitivity of spectral data to SM, RMSH, and SBW and reduce the possibility of SOM prediction results falling into the local optimum. Another advantage is that determining the empirical relationship of SM, RMSH, and SBW with the hyperspectral soil reflectance of ZY1-02D satellite spectra improves the generalization ability and application efficiency of the method. In addition, this study has great potential application value and is expected to realize the combined application of optical and radar remote sensing in the estimation of soil physical and chemical properties. Although the soil spectral correction model can restore most soil “pure spectrum” characteristics, some uncertainties may remain. The applicability of the proposed method to airborne hyperspectral sensors or other hyperspectral satellites requires evaluation with more data. In addition, this experiment only considered the influence of soil properties within 5 cm of the surface layer on the spectrum, while the spectrum may have different sensing depths for different soil properties [60]. Even though the spectrum only directly detects SM changes in shallow soil (from about 0 to 2 cm), this depth also changes under different SBW and RMSH conditions [20,61]. The vertical heterogeneity of the SM and SBW may be the main factor causing soil spectral correction errors. Adopting a hierarchical strategy to establish spectral correction models for soil properties at different depths or assigning different weights to soil physical properties at different depths may maximize the effectiveness of the model within the specified depth and range.

4.4. Future Work and Suggested Next Steps

This study, for the first time, used the “pure spectrum” derived from soil spectral correction, considering SM, RMSH, and SBW, for SOM content prediction and confirmed its excellent spatiotemporal transferability. The strategy for alleviating the influence of soil physical properties on spectral coupling may provide a paradigm for remote-sensing-based soil element content predictions in the future. The soil spectral correction of the whole hyperspectral image requires real-time high-resolution soil physical parameter data at the regional scale. The strong sensitivity of synthetic aperture radar remote sensing to soil physical parameters makes it possible to obtain rich information on soil physical parameters in practical applications [62]. Future research may combine the advantages of hyperspectral imaging and radar remote sensing to improve the prediction accuracy of soil physicochemical parameters [63]. Given the type and dimensional heterogeneity of optical and radar data, a new link to combine these two data types was provided. Meanwhile, some interesting new directions have also emerged. Because of the difference in the imaging time between radar and hyperspectral images, the corresponding surface physical properties also change, especially SM, which is highly susceptible to weather. Therefore, combining optical and radar data to eliminate the bias due to the temporal phase may be the optimal strategy to solve this problem. In addition, the depth perception of optical and radar remote sensing for soil properties needs further clarification. As radar sensors have better penetrability than optical sensors, quantifying the vertical heterogeneity of SM and SBW with radar remote sensing could be the key to further improving the soil spectral correction accuracy [64,65]. These strategies reduce errors in large-scale soil physical and chemical parameter surveys to support regional strategic arrangements for sustainable agricultural development.

5. Conclusions

This study utilized satellite and ground hyperspectral data and soil physical parameter data to construct two soil spectral correction models based on fourth-order polynomials and XG-Boost, respectively, to alleviate the coupling effect of soil physical properties on the pixel spectrum. The performance of the soil spectral correction models and their influence on the accuracy and spatiotemporal transferability of the SOM prediction model were evaluated using data from two sites. The main conclusions are as follows. (1) The soil pixel spectral reflectance is nonlinearly related to the soil ground spectral reflectance. The difference in the surface physical properties is the main factor for the deviation in these two spectral data types. RMSH has the most significant effect on the soil pixel spectrum, followed by SM and SBW. (2) The fourth-order-polynomial and XG-Boost models have good soil spectral correction accuracies. The soil spectral correction model based on XG-Boost has higher accuracy and stronger spatiotemporal transferability, as it considers all the features to continuously adjust the weight of the tree and prevent the result from falling into the local optimum. (3) Soil spectral correction significantly alleviates the coupling effect of soil physical properties on soil pixel spectra, effectively improves the accuracy of the SOM prediction model, and, more importantly, greatly enhances the spatiotemporal transferability of the SOM prediction model based on the pixel spectrum. In the future, more accurate SOM prediction results can be obtained by fully considering more soil properties. This work provides a new research paradigm for predicting soil property parameters in other regions.

Author Contributions

Conceptualization, N.L. and Y.S.; methodology, R.J.; software, H.Y.; validation, X.Z.; formal analysis, R.J.; investigation, N.L. and R.J.; resources, R.J.; data curation, N.L., R.J. and X.Z.; writing—original draft preparation, R.J. and Y.S.; writing—review and editing, R.J. and Y.S.; visualization, N.L. and Y.S.; supervision, R.J.; project administration, R.J.; funding acquisition, N.L., H.Y. and X.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Jilin Provincial Scientific and Technological Development Program (20230101373JC and 20240303035NC), the 14th Five-Year National Key Research and Development Plan of China (2022YFD1500504), and the Common Application Support Platform for Land Observation Satellites of China’s Civil Space Infrastructure Project, China (CASPLOS-CCSI).

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Luo, C.; Zhang, W.; Zhang, X.; Liu, H. Mapping the soil organic matter content in a typical black-soil area using optical data, radar data and environmental covariates. Soil Tillage Res. 2024, 235, 105912. [Google Scholar] [CrossRef]
  2. Sun, W.; Liu, S.; Zhang, X.; Li, Y. Estimation of soil organic matter content using selected spectral subset of hyperspectral data. Geoderma 2022, 409, 115653. [Google Scholar] [CrossRef]
  3. Mirzaee, S.; Ghorbani-Dashtaki, S.; Mohammadi, J.; Asadi, H.; Asadzadeh, F. Spatial variability of soil organic matter using remote sensing data. Catena 2016, 145, 118–127. [Google Scholar] [CrossRef]
  4. Obade, V.d.P.; Lal, R. Assessing land cover and soil quality by remote sensing and geographical information systems (GIS). Catena 2013, 104, 77–92. [Google Scholar] [CrossRef]
  5. Lehmann, J.; Kleber, M. The contentious nature of soil organic matter. Nature 2015, 528, 60–68. [Google Scholar] [CrossRef] [PubMed]
  6. Wang, J.; He, T.; Lv, C.; Chen, Y.; Jian, W. Mapping soil organic matter based on land degradation spectral response units using Hyperion images. Int. J. Appl. Earth Obs. Geoinf. 2010, 12, S171–S180. [Google Scholar] [CrossRef]
  7. Wu, J.; Jin, S.; Zhu, G.; Guo, J. Monitoring of Cropland Abandonment Based on Long Time Series Remote Sensing Data: A Case Study of Fujian Province, China. Agronomy 2023, 13, 1585. [Google Scholar] [CrossRef]
  8. He, X.; Yang, L.; Li, A.; Zhang, L.; Shen, F.; Cai, Y.; Zhou, C. Soil organic carbon prediction using phenological parameters and remote sensing variables generated from Sentinel-2 images. Catena 2021, 205, 105442. [Google Scholar] [CrossRef]
  9. Crowther, T.W.; van den Hoogen, J.; Wan, J.; Mayes, M.A.; Keiser, A.D.; Mo, L.; Averill, C.; Maynard, D.S. The global soil community and its influence on biogeochemistry. Science 2019, 365, eaav0550. [Google Scholar] [CrossRef]
  10. Rahmani, S.R.; Ackerson, J.P.; Schulze, D.; Adhikari, K.; Libohova, Z. Digital Mapping of Soil Organic Matter and Cation Exchange Capacity in a Low Relief Landscape Using LiDAR Data. Agronomy 2022, 12, 1338. [Google Scholar] [CrossRef]
  11. Schuster, J.; Hagn, L.; Mittermayer, M.; Maidl, F.-X.; Huelsbergen, K.-J. Using Remote and Proximal Sensing in Organic Agriculture to Assess Yield and Environmental Performance. Agronomy 2023, 13, 1868. [Google Scholar] [CrossRef]
  12. Angelopoulou, T.; Tziolas, N.; Balafoutis, A.; Zalidis, G.; Bochtis, D. Remote Sensing Techniques for Soil Organic Carbon Estimation: A Review. Remote Sens. 2019, 11, 676. [Google Scholar] [CrossRef]
  13. Li, T.; Mu, T.; Liu, G.; Yang, X.; Zhu, G.; Shang, C. A Method of Soil Moisture Content Estimation at Various Soil Organic Matter Conditions Based on Soil Reflectance. Remote Sens. 2022, 14, 2411. [Google Scholar] [CrossRef]
  14. Wang, S.; Guan, K.; Zhang, C.; Lee, D.; Margenot, A.J.; Ge, Y.; Peng, J.; Zhou, W.; Zhou, Q.; Huang, Y. Using soil library hyperspectral reflectance and machine learning to predict soil organic carbon: Assessing potential of airborne and spaceborne optical soil sensing. Remote Sens. Environ. 2022, 271, 112914. [Google Scholar] [CrossRef]
  15. Suleymanov, A.; Gabbasova, I.; Komissarov, M.; Suleymanov, R.; Garipov, T.; Tuktarova, I.; Belan, L. Random Forest Modeling of Soil Properties in Saline Semi-Arid Areas. Agriculture 2023, 13, 976. [Google Scholar] [CrossRef]
  16. Wang, L.; Zhou, Y. Combining Multitemporal Sentinel-2A Spectral Imaging and Random Forest to Improve the Accuracy of Soil Organic Matter Estimates in the Plough Layer for Cultivated Land. Agriculture 2023, 13, 8. [Google Scholar] [CrossRef]
  17. Zhan, D.; Mu, Y.; Duan, W.; Ye, M.; Song, Y.; Song, Z.; Yao, K.; Sun, D.; Ding, Z. Spatial Prediction and Mapping of Soil Water Content by TPE-GBDT Model in Chinese Coastal Delta Farmland with Sentinel-2 Remote Sensing Data. Agriculture 2023, 13, 1088. [Google Scholar] [CrossRef]
  18. Luo, C.; Wang, Y.; Zhang, X.; Zhang, W.; Liu, H. Spatial prediction of soil organic matter content using multiyear synthetic images and partitioning algorithms. Catena 2022, 211, 106023. [Google Scholar] [CrossRef]
  19. Luo, C.; Zhang, W.; Zhang, X.; Liu, H. Mapping of soil organic matter in a typical black soil area using Landsat-8 synthetic images at different time periods. Catena 2023, 231, 107336. [Google Scholar] [CrossRef]
  20. Croft, H.; Anderson, K.; Kuhn, N.J. Evaluating the influence of surface soil moisture and soil surface roughness on optical directional reflectance factors. Eur. J. Soil Sci. 2014, 65, 605–612. [Google Scholar] [CrossRef]
  21. Castaldi, F.; Palombo, A.; Pascucci, S.; Pignatti, S.; Santini, F.; Casa, R. Reducing the Influence of Soil Moisture on the Estimation of Clay from Hyperspectral Data: A Case Study Using Simulated PRISMA Data. Remote Sens. 2015, 7, 15561–15582. [Google Scholar] [CrossRef]
  22. Prudnikova, E.; Savin, I. Some Peculiarities of Arable Soil Organic Matter Detection Using Optical Remote Sensing Data. Remote Sens. 2021, 13, 2313. [Google Scholar] [CrossRef]
  23. Wang, S.; Gao, J.; Zhuang, Q.; Lu, Y.; Gu, H.; Jin, X. Multispectral Remote Sensing Data Are Effective and Robust in Mapping Regional Forest Soil Organic Carbon Stocks in a Northeast Forest Region in China. Remote Sens. 2020, 12, 393. [Google Scholar] [CrossRef]
  24. Castaldi, F.; Hueni, A.; Chabrillat, S.; Ward, K.; Buttafuoco, G.; Bomans, B.; Vreys, K.; Brell, M.; van Wesemael, B. Evaluating the capability of the Sentinel 2 data for soil organic carbon prediction in croplands. ISPRS-J. Photogramm. Remote Sens. 2019, 147, 267–282. [Google Scholar] [CrossRef]
  25. Rienzi, E.A.; Mijatovic, B.; Mueller, T.G.; Matocha, C.J.; Sikora, F.J.; Castrignano, A. Prediction of Soil Organic Carbon under Varying Moisture Levels using Reflectance Spectroscopy. Soil Sci. Soc. Am. J. 2014, 78, 958–967. [Google Scholar] [CrossRef]
  26. Zhang, Y.; Zhou, L.; Zhou, Y.; Zhang, L.; Yao, X.; Shi, K.; Jeppesen, E.; Yu, Q.; Zhu, W. Chromophoric dissolved organic matter in inland waters: Present knowledge and future challenges. Sci. Total Environ. 2021, 759, 143550. [Google Scholar] [CrossRef] [PubMed]
  27. Ou, D.; Tan, K.; Li, J.; Wu, Z.; Zhao, L.; Ding, J.; Wang, X.; Zou, B. Prediction of soil organic matter by Kubelka-Munk based airborne hyperspectral moisture removal model. Int. J. Appl. Earth Obs. Geoinf. 2023, 124, 103493. [Google Scholar] [CrossRef]
  28. Yue, J.; Tian, Q.; Tang, S.; Xu, K.; Zhou, C. A dynamic soil endmember spectrum selection approach for soil and crop residue linear spectral unmixing analysis. Int. J. Appl. Earth Obs. Geoinf. 2019, 78, 306–317. [Google Scholar] [CrossRef]
  29. Lin, C.; Zhu, A.X.; Wang, Z.; Wang, X.; Ma, R. The refined spatiotemporal representation of soil organic matter based on remote images fusion of Sentinel-2 and Sentinel-3. Int. J. Appl. Earth Obs. Geoinf. 2020, 89, 102094. [Google Scholar] [CrossRef]
  30. Jiang, R.; Sui, Y.; Zhang, X.; Lin, N.; Zheng, X.; Li, B.; Zhang, L.; Li, X.; Yu, H. Estimation of soil organic carbon by combining hyperspectral and radar remote sensing to reduce coupling effects of soil surface moisture and roughness. Geoderma 2024, 444, 116874. [Google Scholar] [CrossRef]
  31. Minasny, B.; McBratney, A.B.; Bellon-Maurel, V.; Roger, J.-M.; Gobrecht, A.; Ferrand, L.; Joalland, S. Removing the effect of soil moisture from NIR diffuse reflectance spectra for the prediction of soil organic carbon. Geoderma 2011, 167–168, 118–124. [Google Scholar] [CrossRef]
  32. Zhou, T.; Geng, Y.; Chen, J.; Liu, M.; Haase, D.; Lausch, A. Mapping soil organic carbon content using multi-source remote sensing variables in the Heihe River Basin in China. Ecol. Indic. 2020, 114, 106288. [Google Scholar] [CrossRef]
  33. Wang, X.; Zhang, F.; Kung, H.-t.; Johnson, V.C. New methods for improving the remote sensing estimation of soil organic matter content (SOMC) in the Ebinur Lake Wetland National Nature Reserve (ELWNNR) in northwest China. Remote Sens. Environ. 2018, 218, 104–118. [Google Scholar] [CrossRef]
  34. Chen, S.; Zhao, K.; Jiang, T.; Li, X.; Zheng, X.; Wan, X.; Zhao, X. Predicting Surface Roughness and Moisture of Bare Soils Using Multiband Spectral Reflectance Under Field Conditions. Chin. Geogr. Sci. 2018, 28, 986–997. [Google Scholar] [CrossRef]
  35. Palmisano, D.; Satalino, G.; Balenzano, A.; Mattia, F. Coherent and Incoherent Change Detection for Soil Moisture Retrieval From Sentinel-1 Data. IEEE Geosci. Remote Sens. Lett. 2022, 19, 2503805. [Google Scholar] [CrossRef]
  36. Chen, K.S.; Wu, T.D.; Tsang, L.; Li, Q.; Shi, J.C.; Fung, A.K. Emission of rough surfaces calculated by the integral equation method with comparison to three-dimensional moment method Simulations. IEEE Trans. Geosci. Remote Sens. 2003, 41, 90–101. [Google Scholar] [CrossRef]
  37. Jiang, C.; Fang, H. GSV: A general model for hyperspectral soil reflectance simulation. Int. J. Appl. Earth Obs. Geoinf. 2019, 83, 101932. [Google Scholar] [CrossRef]
  38. Wang, Q.; Li, P.; Pu, Z.; Chen, X. Calibration and validation of salt-resistant hyperspectral indices for estimating soil moisture in arid land. J. Hydrol. 2011, 408, 276–285. [Google Scholar] [CrossRef]
  39. Yuan, J.; Wang, X.; Yan, C.-X.; Wang, S.-R.; Ju, X.-P.; Li, Y. Soil Moisture Retrieval Model for Remote Sensing Using Reflected Hyperspectral Information. Remote Sens. 2019, 11, 366. [Google Scholar] [CrossRef]
  40. Chi, J.; Crawford, M.M. Spectral Unmixing-Based Crop Residue Estimation Using Hyperspectral Remote Sensing Data: A Case Study at Purdue University. IEEE J. Sel. Top. Appl. Earth Observ. Remote Sens. 2014, 7, 2531–2539. [Google Scholar] [CrossRef]
  41. Esteves, C.; Fangueiro, D.; Braga, R.P.; Martins, M.; Botelho, M.; Ribeiro, H. Assessing the Contribution of ECa and NDVI in the Delineation of Management Zones in a Vineyard. Agronomy 2022, 12, 1331. [Google Scholar] [CrossRef]
  42. Xu, S.; Zhao, Y.; Wang, M.; Shi, X. Determination of rice root density from Vis-NIR spectroscopy by support vector machine regression and spectral variable selection techniques. Catena 2017, 157, 12–23. [Google Scholar] [CrossRef]
  43. Yang, L.; He, X.; Shen, F.; Zhou, C.; Zhu, A.X.; Gao, B.; Chen, Z.; Li, M. Improving prediction of soil organic carbon content in croplands using phenological parameters extracted from NDVI time series data. Soil Tillage Res. 2020, 196, 104465. [Google Scholar] [CrossRef]
  44. Meng, X.; Bao, Y.; Wang, Y.; Zhang, X.; Liu, H. An advanced soil organic carbon content prediction model via fused temporal-spatial-spectral (TSS) information based on machine learning and deep learning algorithms. Remote Sens. Environ. 2022, 280, 113166. [Google Scholar] [CrossRef]
  45. Cui, M.; Cai, Q.; Zhu, A.; Fan, H. Soil erosion along a long slope in the gentle hilly areas of black soil region in Northeast China. J. Geogr. Sci. 2007, 17, 375–383. [Google Scholar] [CrossRef]
  46. Ou, Y.; Rousseau, A.N.; Wang, L.; Yan, B. Spatio-temporal patterns of soil organic carbon and pH in relation to environmental factors-A case study of the Black Soil Region of Northeastern China. Agric. Ecosyst. Environ. 2017, 245, 22–31. [Google Scholar] [CrossRef]
  47. Suleman, M.M.; Xu, H.; Zhang, W.; Nizamuddin, D.; Xu, M. Soil microbial biomass carbon and carbon dioxide response by glucose-C addition in black soil of China. Soil Environ. 2019, 38, 48–56. [Google Scholar] [CrossRef]
  48. Koegel-Knabner, I.; Guggenberger, G.; Kleber, M.; Kandeler, E.; Kalbitz, K.; Scheu, S.; Eusterhues, K.; Leinweber, P. Organo-mineral associations in temperate soils: Integrating biology, mineralogy, and organic matter chemistry. J. Plant Nutr. Soil Sci. 2008, 171, 61–82. [Google Scholar] [CrossRef]
  49. Lin, N.; Jiang, R.; Li, G.; Yang, Q.; Li, D.; Yang, X. Estimating the heavy metal contents in farmland soil from hyperspectral images based on Stacked AdaBoost ensemble learning. Ecol. Indic. 2022, 143, 109330. [Google Scholar] [CrossRef]
  50. Zheng, X.; Feng, Z.; Li, L.; Li, B.; Jiang, T.; Li, X.; Li, X.; Chen, S. Simultaneously estimating surface soil moisture and roughness of bare soils by combining optical and radar data. Int. J. Appl. Earth Obs. Geoinf. 2021, 100, 102345. [Google Scholar] [CrossRef]
  51. Xu, Y.; Tan, Y.; Abd-Elrahman, A.; Fan, T.; Wang, Q. Incorporation of Fused Remote Sensing Imagery to Enhance Soil Organic Carbon Spatial Prediction in an Agricultural Area in Yellow River Basin, China. Remote Sens. 2023, 15, 2017. [Google Scholar] [CrossRef]
  52. Tan, K.; Ma, W.; Chen, L.; Wang, H.; Du, Q.; Du, P.; Yan, B.; Liu, R.; Li, H. Estimating the distribution trend of soil heavy metals in mining area from HyMap airborne hyperspectral imagery based on ensemble learning. J. Hazard. Mater. 2021, 401, 123288. [Google Scholar] [CrossRef]
  53. Lin, N.; Fu, J.; Jiang, R.; Li, G.; Yang, Q. Lithological Classification by Hyperspectral Images Based on a Two-Layer XGBoost Model, Combined with a Greedy Algorithm. Remote Sens. 2023, 15, 3764. [Google Scholar] [CrossRef]
  54. Ge, X.; Ding, J.; Teng, D.; Xie, B.; Zhang, X.; Wang, J.; Han, L.; Bao, Q.; Wang, J. Exploring the capability of Gaofen-5 hyperspectral data for assessing soil salinity risks. Int. J. Appl. Earth Obs. Geoinf. 2022, 112, 102969. [Google Scholar] [CrossRef]
  55. Guo, L.; Sun, X.; Fu, P.; Shi, T.; Dang, L.; Chen, Y.; Linderman, M.; Zhang, G.; Zhang, Y.; Jiang, Q.; et al. Mapping soil organic carbon stock by hyperspectral and time-series multispectral remote sensing images in low-relief agricultural areas. Geoderma 2021, 398, 115118. [Google Scholar] [CrossRef]
  56. Liu, S.; Chen, J.; Guo, L.; Wang, J.; Zhou, Z.; Luo, J.; Yang, R. Prediction of soil organic carbon in soil profiles based on visible-near-infrared hyperspectral imaging spectroscopy. Soil Tillage Res. 2023, 232, 105736. [Google Scholar] [CrossRef]
  57. Honarbakhsh, A.; Tahmoures, M.; Afzali, S.F.; Khajehzadeh, M.; Ali, M.S. Remote sensing and relief data to predict soil saturated hydraulic conductivity in a calcareous watershed, Iran. Catena 2022, 212, 106046. [Google Scholar] [CrossRef]
  58. Yang, R.-M.; Guo, W.-W. Modelling of soil organic carbon and bulk density in invaded coastal wetlands using Sentinel-1 imagery. Int. J. Appl. Earth Obs. Geoinf. 2019, 82, 101906. [Google Scholar] [CrossRef]
  59. Gao, L.; Zhu, X.; Han, Z.; Wang, L.; Zhao, G.; Jiang, Y. Spectroscopy-Based Soil Organic Matter Estimation in Brown Forest Soil Areas of the Shandong Peninsula, China. Pedosphere 2019, 29, 810–818. [Google Scholar] [CrossRef]
  60. Selige, T.; Boehner, J.; Schmidhalter, U. High resolution topsoil mapping using hyperspectral image and field data in multivariate regression modeling procedures. Geoderma 2006, 136, 235–244. [Google Scholar] [CrossRef]
  61. Krzyszczak, J.; Baranowski, P.; Pastuszka, J.; Wesolowska, M.; Cymerman, J.; Slawinski, C.; Siedliska, A. Assessment of soil water retention characteristics based on VNIR/SWIR hyperspectral imaging of soil surface. Soil Tillage Res. 2023, 233, 105789. [Google Scholar] [CrossRef]
  62. Eshqi Molan, Y.; Lu, Z. Modeling InSAR Phase and SAR Intensity Changes Induced by Soil Moisture. IEEE Trans. Geosci. Remote Sens. 2020, 58, 4967–4975. [Google Scholar] [CrossRef]
  63. Van Hateren, T.C.C.; Chini, M.; Matgen, P.; Pulvirenti, L.; Pierdicca, N.; Teuling, A.J.J. On the Use of Native Resolution Backscatter Intensity Data for Optimal Soil Moisture Retrieval. IEEE Geosci. Remote Sens. Lett. 2023, 20, 3001605. [Google Scholar] [CrossRef]
  64. Shilpa, K.; Raju, C.S.; Mandal, D.; Rao, Y.S.; Shetty, A. Soil Moisture Retrieval Over Crop Fields from Multi-polarization SAR Data. J. Indian Soc. Remote Sens. 2023, 51, 949–962. [Google Scholar] [CrossRef]
  65. Zhang, Z.; Lin, H.; Wang, M.; Liu, X.; Chen, Q.; Wang, C.; Zhang, H. A Review of Satellite Synthetic Aperture Radar Interferometry Applications in Permafrost Regions: Current Status, Challenges, and Trends. IEEE Geosci. Remote Sens. Mag. 2022, 10, 93–114. [Google Scholar] [CrossRef]
Figure 1. Framework of the proposed SOM estimation model.
Figure 1. Framework of the proposed SOM estimation model.
Agronomy 14 01067 g001
Figure 2. Study area overview. (a) The geographical location of the study area; (b,c) the soil-sampling points at Site 1 and Site 2, respectively; (d,e) the soil surfaces during the “bare soil period”.
Figure 2. Study area overview. (a) The geographical location of the study area; (b,c) the soil-sampling points at Site 1 and Site 2, respectively; (d,e) the soil surfaces during the “bare soil period”.
Agronomy 14 01067 g002
Figure 3. The schematic diagram of the soil sample collection and parameter measurement in the sample square. (a) Schematic diagram of the sampling strategy in the quadrant; (b) soil surface point cloud data measurement.
Figure 3. The schematic diagram of the soil sample collection and parameter measurement in the sample square. (a) Schematic diagram of the sampling strategy in the quadrant; (b) soil surface point cloud data measurement.
Agronomy 14 01067 g003
Figure 4. Imaging spectrum, laboratory spectrum, and their correlation coefficients.
Figure 4. Imaging spectrum, laboratory spectrum, and their correlation coefficients.
Agronomy 14 01067 g004
Figure 5. Spectral characteristics of soils with different physical properties.
Figure 5. Spectral characteristics of soils with different physical properties.
Agronomy 14 01067 g005
Figure 6. R2 for fitting soil physical parameters to soil pixel spectrum based on multiple parameter estimation models.
Figure 6. R2 for fitting soil physical parameters to soil pixel spectrum based on multiple parameter estimation models.
Agronomy 14 01067 g006
Figure 7. Soil spectrum simulated through empirical equations based on SM (a), RMSH (b), and SBW (c).
Figure 7. Soil spectrum simulated through empirical equations based on SM (a), RMSH (b), and SBW (c).
Agronomy 14 01067 g007aAgronomy 14 01067 g007b
Figure 8. Soil spectral correction accuracy based on multi-order polynomial regression.
Figure 8. Soil spectral correction accuracy based on multi-order polynomial regression.
Agronomy 14 01067 g008
Figure 9. Soil spectral correction accuracy based on machine-learning models.
Figure 9. Soil spectral correction accuracy based on machine-learning models.
Agronomy 14 01067 g009
Figure 10. Soil pixel spectra corrected with the XG-Boost model and the fourth-order-polynomial model and the correlation coefficient between the corrected spectrum and the laboratory spectrum.
Figure 10. Soil pixel spectra corrected with the XG-Boost model and the fourth-order-polynomial model and the correlation coefficient between the corrected spectrum and the laboratory spectrum.
Agronomy 14 01067 g010
Figure 11. (a) Pearson’s correlation coefficients between SOM contents and spectral reflectance of each band. (b) RMSECV (unit: g/kg) of multiple regression with different numbers of CARS iterations.
Figure 11. (a) Pearson’s correlation coefficients between SOM contents and spectral reflectance of each band. (b) RMSECV (unit: g/kg) of multiple regression with different numbers of CARS iterations.
Agronomy 14 01067 g011
Figure 12. Scatter plots of predicted and measured SOM contents based on the four spectral datasets.
Figure 12. Scatter plots of predicted and measured SOM contents based on the four spectral datasets.
Agronomy 14 01067 g012
Figure 13. Soil pixel spectra corrected with the XG-Boost and fourth-order-polynomial models and the RPD of two correction models.
Figure 13. Soil pixel spectra corrected with the XG-Boost and fourth-order-polynomial models and the RPD of two correction models.
Agronomy 14 01067 g013
Figure 14. Scatter plots of the measured and predicted SOM contents based on (a) original pixel spectrum, (b) ground spectrum, (c) fourth-order-polynomial-corrected spectrum, and (d) XG-Boost-corrected spectrum with Site 2 data, using the XG-Boost model established using Site 1 data.
Figure 14. Scatter plots of the measured and predicted SOM contents based on (a) original pixel spectrum, (b) ground spectrum, (c) fourth-order-polynomial-corrected spectrum, and (d) XG-Boost-corrected spectrum with Site 2 data, using the XG-Boost model established using Site 1 data.
Agronomy 14 01067 g014
Figure 15. Contribution rates of soil properties (SM, RMSH, and SBW) to the estimated SOM bias at Site 1 (a) and Site 2 (b). “Random” denotes the part that these three variables cannot explain.
Figure 15. Contribution rates of soil properties (SM, RMSH, and SBW) to the estimated SOM bias at Site 1 (a) and Site 2 (b). “Random” denotes the part that these three variables cannot explain.
Agronomy 14 01067 g015
Table 1. ZY1-02D satellite hyperspectral camera parameters.
Table 1. ZY1-02D satellite hyperspectral camera parameters.
SpecificationParameter
Spectral range (nm)400–2500
Channels76 (VNIR); 90 (SWIR)
Spectral resolution (nm)10 (VNIR); 20 (SWIR)
Swath width (km)60
Spatial resolution (m)30
Revisit cycle (d)3
Lateral swing capacity (°)±26
Table 2. Descriptive statistics of soil parameters and SOM content.
Table 2. Descriptive statistics of soil parameters and SOM content.
DatasetUnitSite 1Site 2
Min.Max.MeanSDCV%Min.Max.MeanSDCV%
SMcm3/cm30.140.470.250.0831.990.210.630.370.1437.93
RMSHcm1.324.992.490.7730.922.045.783.651.3436.71
SBWg/cm30.711.410.980.1515.310.851.511.130.1815.92
SOMg/kg25.8475.9743.2510.5124.3027.4072.9741.5710.2824.72
Table 3. Features’ band statistics selected based on CARS.
Table 3. Features’ band statistics selected based on CARS.
Spectral Correction MethodWavelength (Unit: um)Total
Pixel spectrum0.67; 0.68; 0.70; 0.72; 0.74; 0.77; 0.79; 0.84; 0.87; 0.90; 0.9311
Fourth-order-polynomial-corrected spectrum0.55; 0.60; 0.62; 0.68; 0.73; 0.76; 0.78; 0.82; 0.85; 0.87; 0.91; 0.96; 0.99; 1.0714
XG-Boost-corrected spectrum0.55; 0.62; 0.64; 0.69; 0.73; 0.77; 0.81; 0.83; 0.87; 0.89; 0.92; 0.94; 0.99; 1.05; 1.1715
Ground-based spectrum060; 0.63; 0.67; 0.70; 0.73; 0.77; 0.81; 0.85; 0.87; 0.90; 0.91; 0.96; 0.99; 1.03; 1.08; 1.2216
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Sui, Y.; Jiang, R.; Lin, N.; Yu, H.; Zhang, X. Improving the Spatiotemporal Transferability of Hyperspectral Remote Sensing for Estimating Soil Organic Matter by Minimizing the Coupling Effect of Soil Physical Properties on the Spectrum: A Case Study in Northeast China. Agronomy 2024, 14, 1067. https://doi.org/10.3390/agronomy14051067

AMA Style

Sui Y, Jiang R, Lin N, Yu H, Zhang X. Improving the Spatiotemporal Transferability of Hyperspectral Remote Sensing for Estimating Soil Organic Matter by Minimizing the Coupling Effect of Soil Physical Properties on the Spectrum: A Case Study in Northeast China. Agronomy. 2024; 14(5):1067. https://doi.org/10.3390/agronomy14051067

Chicago/Turabian Style

Sui, Yuanyuan, Ranzhe Jiang, Nan Lin, Haiye Yu, and Xin Zhang. 2024. "Improving the Spatiotemporal Transferability of Hyperspectral Remote Sensing for Estimating Soil Organic Matter by Minimizing the Coupling Effect of Soil Physical Properties on the Spectrum: A Case Study in Northeast China" Agronomy 14, no. 5: 1067. https://doi.org/10.3390/agronomy14051067

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop