Assessing the potential of complex artificial neural networks for modelling small-scale soil erosion by water

Barthel, Nils; Ott, Simone; Burkhard, Benjamin; Steinhoff-Knopp, Bastian

doi:10.5194/soil-12-321-2026

Articles | Volume 12, issue 1

https://doi.org/10.5194/soil-12-321-2026

Articles | Volume 12, issue 1

Original research article

30 Mar 2026

Original research article |

| 30 Mar 2026

Assessing the potential of complex artificial neural networks for modelling small-scale soil erosion by water

Nils Barthel, Simone Ott, Benjamin Burkhard, and Bastian Steinhoff-Knopp

Abstract

Modelling soil erosion by water is essential for developing effective mitigation strategies and preventing on- and off-site damages in agricultural areas. So far, complex artificial neural networks have rarely been applied in small-scale soil erosion modelling, and their potential still remains unclear. This study compares the performance of different neural network architectures for modelling soil erosion by water at a small spatial scale in agricultural cropland. The analysis was based on erosion rate data (in t ha⁻¹ yr⁻¹) at a 5 m × 5 m resolution, derived from a 20-year monitoring programme, and covers 458 ha of cropland across seven investigation areas in northern Germany. Nineteen predictor variables related to topography, climate, management, and soil properties were selected as inputs to assess their interrelationships with observed erosion patterns. A single-layer neural network (SNN), a deep neural network (DNN), and a convolutional neural network (CNN) were applied and evaluated against a random forest (RF) model used as a benchmark. A leave-one-area-out validation was applied to evaluate how well the models generalize to areas withheld entirely during training. While all models tended to underestimate high erosion rates, they often successfully captured the underlying spatial patterns. All tested models exhibited comparable root mean squared errors (RMSE: 2.2 t ha⁻¹ yr⁻¹). With respect to mean absolute error (MAE), the neural network models achieved slightly lower values (MAE: 0.9 t ha⁻¹ yr⁻¹) than the random forest model (MAE: 1.0 t ha⁻¹ yr⁻¹). Clearer differences between models were observed for the F₁ scores, which reflect performance across soil loss classes. Here, the CNN achieved the highest F₁ score (0.46) among the tested models. This study demonstrates the potential of complex neural networks to capture erosion patterns at the field-to-landscape scale and provides insights into the relevance of the chosen predictor variables, as well as key modelling limitations, such as the underestimation of very high erosion rates in unseen areas. It also highlights the need for more comprehensive datasets to improve generalization capabilities of the models.

Download & links

Article (PDF, 7731 KB)

Download & links

How to cite.

Received: 24 Jul 2025 – Discussion started: 08 Aug 2025 – Revised: 04 Mar 2026 – Accepted: 10 Mar 2026 – Published: 30 Mar 2026

1 Introduction

Soil erosion by water causes the loss of topsoil, along with the displacement of organic carbon and nutrients, degrading agricultural land and contributing to off-site damages such as water eutrophication and road blockages (Issaka and Ashraf, 2017). These processes have significant and long-lasting environmental and economic consequences in affected areas worldwide. A combination of anthropogenic, climatic, topographic, and pedological factors unique to each region defines the extent and distribution of soil erosion by water. Modelling the interactions among these factors at fine spatial scales can help reveal key drivers and the resulting erosion patterns. The resulting insights can support the development of targeted strategies to mitigate further soil degradation (Borrelli et al., 2018; Igwe et al., 2017).

Various approaches have been applied to estimate soil erosion by water across different spatial scales, ranging from global (Guerra et al., 2020), continental (Panagos et al., 2021), national (Plambeck, 2020), to small field plots (Anache et al., 2018). The respective modelling approaches vary in data requirements, input variables, underlying principles, and complexity. Model outputs are typically classified into erosion severity, susceptibility categories or expressed as continuous predictions of soil erosion rates over space and time. Depending on the intended output and available data, a variety of model types have been developed, including process-based, empirical, and, more recently, machine learning models (Borrelli et al., 2021).

Examples of physical process-based models are WEPP (Nearing et al., 1989; Pieri et al., 2007), Erosion 3D (Schmidt et al., 1999), and EUROSEM (Morgan et al., 1998). Empirical models include the Universal Soil Loss Equation (USLE; Wischmeier and Smith, 1978), its revised version (RUSLE; Renard et al., 1997), and further adaptations (Borrelli et al., 2021). One reason for the popularity of the USLE and its variations is their relatively low complexity, making these models time- and cost-efficient (Alewell et al., 2019; Avand et al., 2023; Kumar et al., 2022). The apparent simplicity of the USLE contrasts with the complexity of calculating its individual factors, which results in a wide range of individual USLE-based applications and estimated loss rates due to variations in input data and methods (Fiener et al., 2020).

Machine learning approaches have been increasingly used to identify areas at risk of soil erosion by water and to capture the complex relationships between environmental predictors and observed erosion patterns. This includes the application of methods such as random forests (RF; Garosi et al., 2019; Ghosh and Maiti, 2021; Jaafari et al., 2022), with several studies reporting that RF can outperform methods such as support vector machines (SVMs) and generalized additive models (GAMs). While these studies highlight the potential of RF models, their main focus has been on predicting and classifying gully-erosion susceptibility.

Another promising machine learning approach for soil erosion modelling is the application of artificial neural networks (ANNs). However, existing studies in this field have often employed simple neural network architectures, typically with only a single hidden layer. These single-layer networks have not only been used to model soil erosion susceptibility (De la Rosa et al., 1999) but also to predict quantitative soil erosion rates at the plot scale (Licznar and Nearing, 2003). These early applications demonstrate the potential of ANNs, but single-layer networks may not fully utilize the capacity of more complex architectures to capture non-linear relationships between influencing variables and the spatial distribution and magnitude of soil erosion (Avand et al., 2023).

More complex, multi-layered ANNs have rarely been applied to predict soil erosion rates quantitatively. Instead, they have been mostly used to model a classified erosion susceptibility (Golkarian et al., 2023; Khosravi et al., 2023; Sarkar and Mishra, 2018) or have been restricted to gully erosion (Ghorbanzadeh et al., 2020; Saha et al., 2021). In cases where ANNs have been applied to quantify continuous soil erosion rates, studies often rely on limited datasets, such as those collected using erosion pins over just one year (Gholami et al., 2021; Sahour et al., 2021).

While machine learning methods have shown promise in modelling soil erosion, their application to predict continuous erosion rates at fine spatial scales remains limited, particularly when using neural networks. Currently, most studies rely on input data with spatial resolutions between 5 and 300 m, with higher-resolution datasets often restricted to small areas, such as individual plots (Borrelli et al., 2021; Parsons, 2019). This lack of research combining neural networks with high-resolution data is partly due to the limited availability of long-term monitoring schemes at the landscape scale, which provides spatially explicit erosion data as ground truth for model training and validation (Batista et al., 2025).

To address this gap, this study utilized long-term soil erosion monitoring data spanning more than two decades and collected across seven study areas in northern Germany (Steinhoff-Knopp and Burkhard, 2018). This monitoring dataset, together with high-spatial-resolution predictor variables, served as input for different machine learning models. Specifically, three neural networks of increasing complexity and a random forest, used as a benchmark model, were systematically compared to evaluate their ability to predict continuous soil erosion rates. To assess model robustness and generalizability, we applied a leave-one-area-out validation strategy, in which each study area was withheld in turn for testing while the models were trained on the remaining areas.

The goal of this study was to contribute to a more comprehensive understanding of the strengths and limitations of different machine learning approaches for soil erosion modelling by aiming to answer the following research questions:

Which machine learning approach achieves the highest predictive performance in estimating spatial patterns of continuous soil erosion rates at the field-to-landscape scale?
What are the challenges and limitations when reproducing erosion patterns within the same areas and when extrapolating to previously unseen areas?
Which predictor variables are most important for predicting soil erosion by water at the field-to-landscape scale across different machine learning models?

2 Methods

2.1 Study area

This study uses data from seven distinct study areas in Lower Saxony, Germany, located between 53.1 and 51.9°N latitude and 8.3 and 10.5° E longitude. Five of these regions are clustered in the southeastern part of Lower Saxony, and named Leine-Innerstebergland, while the other two are located in the Northeast and Southwest (Fig. 1). The cultivated cropland within all the areas covers 458 ha, is prone to erosion, and represents different soil types, relief characteristics, and management conditions (Capelle and Lüders, 1985; Capelle, 1990).

https://soil.copernicus.org/articles/12/321/2026/soil-12-321-2026-f01

Figure 1Overview of study areas in Lower Saxony and mapped soil erosion rates in (a) Küingdorf, (b) Barum, (c) Lamspringe, (d) Klein Ilde, (e) Nette, (f) Adenstedt, and (g) Brüggen.

A large part of the soils – predominantly Luvisols and Cambisols – has a high silt content due to loess or sand-loess deposits, which results in higher topsoil erodibility (Steinhoff-Knopp and Burkhard, 2018). The mean slope across all study areas is 4.01°, with the southern regions having a steeper relief (4.81°) compared to the northern (2.35°) and the western region (3.68°). The primary crops by cultivated area are winter wheat, followed by winter barley, rapeseed, sugar beet, maize, and potatoes. All farmers within the monitored areas practice conventional farming and implement various soil conservation practices, including reduced tillage, contour-parallel tillage, cover crops, grassed tramlines, and drainage systems.

2.2 Data collection

The soil erosion dataset used in this study was derived from a long-term monitoring programme funded by the Lower Saxony State Authority for Mining, Energy and Geology (LBEG), covering the years 2000 to 2020. Exceptions are the investigation areas Adenstedt, where monitoring began in 2002, and Klein Ilde, where monitoring stopped in 2015. The methodology of the field surveys is based on the recommendations by Rohr et al. (1990) and the instructions by DVWK (1996) and Botschek et al. (2021). To implement a more efficient workflow, the mobile mapping application EroPad was used since 2010 (see Steinhoff et al., 2013). The field surveys were carried out each year after the snowmelt (usually in February or March) throughout the vegetation period and typically within one week after each erosive rainfall event (≥10 mm total rainfall or >10 mm h⁻¹ within 30 min), following the German definition of erosive rainfall events in place during the monitoring period (see DIN, 2005, 2017; Schwertmann et al., 1987). Mapping in early spring (after the snowmelt) records cumulative erosion caused by snowmelt and single precipitation events during the winter agricultural dormant period. Surveys were also conducted when farmers reported erosion features.

The field surveys aimed to collect data on the spatial distribution and extent of three main types of soil erosion by water: linear (i.e. rill) erosion, sheet erosion, and sheet-to-linear erosion (erosion systems showing both features of sheet and linear erosion). The losses by linear erosion features were surveyed by measuring their volume and extrapolated to an 8 m buffer around their occurrence to account for the dispersion of soil loss due to tillage practices by farmers. Deposition sites were also recorded, but only qualitatively, and were therefore not included in the modelling. A detailed description and evaluation of the survey dataset is provided in Steinhoff-Knopp and Burkhard (2018). For the modelling process, soil erosion rates (t ha⁻¹ yr⁻¹) were calculated by overlaying all surveyed erosion features in a grid resolution of 5 m × 5 m (Fig. 1).

Table 1Overview of the variables used in the soil erosion modelling process. A detailed description, including processing steps, can be found in Appendix A1.

Download Print Version | Download XLSX

Nineteen predictor variables (i.e. features or covariates) potentially influencing soil erosion were selected for the modelling process (Table 1). Further information on the methods to determine each predictor variable is provided in Appendix A1. The variables include factors related to topography, climate, soil properties, and agricultural land management. The topographic variables were derived from a digital elevation model (DEM) with a native resolution of 1 m × 1 m, resampled to 5 m × 5 m, which serves as the basis for all derived topographic variables. The USLE R factor (erosivity of rainfall), originally calculated at a grid resolution of 1 km × 1 km, was aligned with the DEM-based variables by directly assigning the values of the corresponding 1 km pixels without further modification. The USLE K factor (erodibility of topsoil), derived from a soil map at a scale of 1 : 50 000 (LBEG, 2017), and the USLE C factor (crop cover and management factor) were rasterized to the 5 m × 5 m grid. The C factor was determined for each field using agricultural land management data collected during the field surveys and through interviews with the respective farmers.

Several of the variables, especially the topographic variables, were derived from each other or capture related aspects, which could lead to a strong correlation between them (Avand et al., 2023; Jaafari et al., 2022). Therefore, the degree of the linear relationship between variables was assessed by using the Pearson correlation coefficient and the variance inflation factor (VIF). The Pearson correlation coefficient is used to gain information on the pairwise linear correlation between the variables, with ±0.7 being considered a threshold for strong correlation (Schober et al., 2018). The VIF is used to assess multicollinearity across all variables, evaluating whether variables with high pairwise correlations can still provide valuable information for the final predictions (Ebrahimi-Khusfi et al., 2021; O'Brien, 2007). Variables with a VIF below 5 are considered to have a low level of multicollinearity and are used for the modelling (Daoud, 2017).

2.3 Machine learning models

This section provides a brief introduction to the four distinct machine learning models employed in this study. The model implementations and hyperparameter selection process are described in Sect. 2.5.

2.3.1 Random forest

A random forest (RF) model combines multiple decision trees to create an ensemble model to improve prediction accuracy (Breiman, 2001). A decision tree splits data based on feature values across multiple levels of nodes, with each branch representing different decision paths leading to predictions (Kingsford and Salzberg, 2008). Each tree is trained on a random subset of the variables, and, in the case of regression tasks, as applied in this study, the final prediction is obtained by averaging the outputs of all trees.

Although mostly limited to classification outputs, RF has shown promise in previous studies on soil erosion modelling (Garosi et al., 2019; Ghosh and Maiti, 2021; Jaafari et al., 2022). Therefore, in this study, RF serves as a benchmark for evaluating the performance of neural networks against other machine learning models. The RF model used in this study consisted of 400 decision trees, with a maximum tree depth of 20, a minimum of two samples required to split an internal node, and at least one sample per leaf.

2.3.2 Single-hidden layer neural network

Artificial neural networks consist of an input layer, an output layer, and one or more so-called hidden layers of neurons (or nodes) in between (Rumelhart et al., 1986). Each neuron in these layers is interconnected through weights. During training, these weights are updated through an optimization process involving multiple iterations, a loss function, and, most commonly, backpropagation. The weighted sum of the inputs is passed through an activation function, which applies a non-linear transformation to determine the neuron's output, which is then passed on to the subsequent layer. In a fully connected (or dense) layer, each neuron is connected to every neuron in the subsequent layer through individual weights.

Three different types of neural networks were compared in this study, each with a distinct and increasingly complex architecture. The first type is a neural network with a single dense layer (SNN) of 512 neurons between the input and the output layer (see Wythoff, 1993). The Rectified Linear Unit (ReLU) activation function was applied in both the hidden and output layers in this and the following two variations of neural network models, to introduce non-linear relationships and constrain predictions to positive values (see Krizhevsky et al., 2017; Nair and Hinton, 2010). Additionally, L₂ regularization, also known as Ridge regularization, was employed across all neural networks to reduce overfitting (see Hoerl and Kennard, 1970; Ng, 2004). The Adam optimizer was used to determine the learning rate during training (see Kingma and Ba, 2014).

2.3.3 Deep neural network

The interconnected neurons and the corresponding weights enable neural networks to capture complex non-linear relationships. Increasing the number of hidden layers enhances this capability while increasing training time and the likelihood of overfitting the model to the training data. Such a neural network with multiple hidden layers is commonly known as a deep neural network (DNN) or a deep learning model (LeCun et al., 2015). In this case, the DNN was modified to have three hidden layers, each with 256 neurons.

2.3.4 Convolutional neural network

Convolutional neural networks (CNNs) are specifically designed to capture spatial relationships in grid-like data structures, such as images (LeCun et al., 2015). This is achieved through convolutional layers that receive input patches containing the neighbouring grid cells around each target pixel. These patches provide local spatial context, enabling the network to learn spatial patterns and relationships relevant for predicting soil erosion (Krizhevsky et al., 2017; LeCun et al., 2015). In this study, each input consisted of a 7 × 7 grid-cell patch. Within these patches, the convolutional layers extracted spatial features using 256 filters per layer. Two dense layers with 256 neurons each were added following the four convolutional layers. The dense layers utilize the learned spatial patterns from the convolutional layers to produce the final predicted output.

2.4 Validation metrics

The results were compared using different validation metrics. To determine the overall differences between the modelled and the mapped soil erosion rate, the root mean squared error (RMSE) and the mean absolute error (MAE) were used. As soil erosion is commonly reported in severity classes (e.g. Borrelli et al., 2018; Steinhoff-Knopp and Burkhard, 2018), we additionally evaluated classification performance using the F₁ score (harmonic mean of precision and recall) to assess the ability of the models to predict different ranges of soil erosion severity (Chinchor, 1992). For this, the continuous predicted output was categorized into six discrete classes: no erosion (0 t ha⁻¹ yr⁻¹), very low erosion (>0 to <0.25 t ha⁻¹ yr⁻¹), low erosion (0.25 to <1 t ha⁻¹ yr⁻¹), medium erosion (1 to <2 t ha⁻¹ yr⁻¹), high erosion (2 to <5 t ha⁻¹ yr⁻¹), and very high erosion (≥5 t ha⁻¹ yr⁻¹). A weighted average F₁ score was calculated to account for the imbalanced distribution of the mapped soil erosion dataset across the classes, with the majority (81.2 %) of the data falling below 1 t ha⁻¹ yr⁻¹. The resulting F₁ score ranges from 0 to 1, with values closer to 1 indicating a better overall alignment between the prediction and the different classes (Taha and Hanbury, 2015).

The importance of the predictor variables was evaluated through permutation importance, which reflects the influence of each variable on the final model output. This is done by randomly permuting one variable at a time and measuring the resulting increase in the mean squared error. A larger increase indicates that the variable has a greater impact on the predictive performance of the model. The results are normalized and expressed as percentages (Altmann et al., 2010).

2.5 Model implementation

To evaluate model predictions, we employed a leave-one-area-out validation strategy, in which one study area was excluded entirely during model training and subsequently used for validation. In each iteration, the models were trained on the remaining six areas and then used to predict soil erosion rates in the left-out area. Afterwards, the predictions were validated against the mapped erosion rates of that area. This procedure was repeated for all seven study areas, and validation metrics were calculated as weighted averages based on the relative size of the excluded area. Permutation importance values were also averaged across all seven runs. Using this validation approach enabled a comprehensive assessment of model robustness by evaluating their capacity to generalize to unseen areas with varying landscape conditions and management practices.

The predictive performance of each model depends strongly on its hyperparameters, which determine the model architecture and learning behaviour. The most important hyperparameters (e.g. number of trees or neurons) and their optimal values were described along with the respective models above (Sect. 2.3). To determine the optimal hyperparameters without any data leakage, a nested cross-validation procedure was implemented, where the left-out area was reserved exclusively for final evaluation. Hyperparameter tuning was performed on the remaining six areas using an inner leave-one-area-out procedure. This process was repeated for each study area. The final hyperparameter configuration was determined from the aggregated inner validation performance and subsequently used to retrain the model on the six training areas before evaluation on the left-out test area. To limit computational demand, hyperparameters were optimized using a random search strategy (Bergstra and Bengio, 2012; Yu and Zhu, 2020). The full search space and tuning scripts are provided in the accompanying repository.

All predictor variables were standardized and the soil loss target was log-transformed to reduce skewness and stabilize model training. Predicted values were back-transformed to the original scale for evaluation and visualization. All models were built using the scikit-learn (Pedregosa et al., 2011) and TensorFlow (Abadi et al., 2015) Python packages.

3 Results

3.1 Pairwise correlation and multicollinearity analysis

To assess relationships between predictor variables and to identify potential redundancy due to multicollinearity, the Pearson correlation coefficient (r) and the variance inflation factor (VIF) were calculated. Five pairs of variables (see Fig. B1 in Appendix B) had a strong linear correlation with Pearson coefficient close to or exceeding ±0.7: aspect 360 and slope length (r=0.69), slope and machining direction ( $r = - 0.69$ ), altitude and R factor ( $r = - 0.75$ ), topographic position index and divergence-convergence index 10 (r=0.82), divergence-convergence index and wetness index (r=0.82). However, all variables had a VIF < 5, indicating that there is not a high level of multicollinearity between the variables (see Fig. B2). Consequently, no variables were removed for the modelling process.

3.2 Model performance

Overall, the neural networks produced nearly identical error metrics when predicting soil erosion rates in previously unseen areas (Table 2). All models achieved an RMSE value of 2.2 t ha⁻¹ yr⁻¹, with corresponding 95 % confidence intervals (CI) across areas ranging from 1.2–2.6 for RF and DNN and 1.3–2.6 for SNN and CNN. The MAE ranged from 0.9 t ha⁻¹ yr⁻¹ for the neural networks to 1.0 t ha⁻¹ yr⁻¹ for the RF, with overlapping 95 % confidence intervals showing no meaningful differences among models in this regard (RF: 0.7–1.2; SNN and DNN: 0.6–1.1; CNN: 0.6–1.2).

Table 2RMSE and MAE (in t ha⁻¹ yr⁻¹) differentiated by soil erosion rate class for the random forest (RF), single-layer neural network (SNN), deep neural network (DNN), and convolutional neural network (CNN) models.

Download Print Version | Download XLSX

Table 3F₁ score evaluating the performance of the continuous model predictions (in t ha⁻¹ yr⁻¹) within the defined soil erosion rate classes, comparing random forest (RF), single-layer neural network (SNN), deep neural network (DNN), and convolutional neural network (CNN) models. The weighted average F₁ scores were computed using the number of mapped grid cells (n cells) within each class.

Download Print Version | Download XLSX

The F₁ scores shown in Table 3 illustrate some differences in model performance for low soil erosion rates by measuring how well the continuous predictions align within the defined severity classes. Although F₁ scores were generally low, the CNN achieved the highest weighted average value (0.46; 95 % CI: 0.39–0.55), followed closely by the DNN (0.44; 95 % CI: 0.38–0.55)), whereas the RF and SNN performed substantially worse in this regard, with an F₁ score of 0.14 (95 % CI: 0.09–0.17) and 0.25 (95 % CI: 0.19–0.27), respectively. The differences are largely linked to the <0.25 t ha⁻¹ yr⁻¹ class, which contained the majority of mapped grid cells. Here, the CNN and DNN achieved high F₁ scores of 0.63 and 0.61, compared with 0.12 for the RF and 0.29 for the SNN. For erosion rates equal and above 5 t ha⁻¹ yr⁻¹, all models had an F₁ score of 0.

Across all study areas, the maximum predicted soil erosion rates ranged from 2.7 t ha⁻¹ yr⁻¹ predicted by the RF to 12.1 t ha⁻¹ yr⁻¹ predicted by the CNN (Table C1 in Appendix C). The soil erosion predictions of the DNN are shown in Fig. 2. The corresponding prediction maps for all models are provided in Appendix C (Figs. C1–C8). The predictions show that all models were partially able to predict the spatial distribution of erosion features in the unseen areas. In several areas, such as Küingdorf, Lamspringe, and Adenstedt (Fig. 2a, c, f), the prediction shows spatial patterns comparable to those in the mapped data. However, the models did not consistently reproduce erosion structures in the other areas, and the severity of some high-erosion features was underestimated (e.g. Barum, Fig. 2b).

https://soil.copernicus.org/articles/12/321/2026/soil-12-321-2026-f02

Figure 2Predicted soil erosion rates using the leave-one-area-out approach by the deep neural network (DNN) in (a) Küingdorf, (b) Barum, (c) Lamspringe, (d) Klein Ilde, (e) Nette, (f) Adenstedt, and (g) Brüggen.

The difference between the predicted soil erosion rate by the DNN and mapped erosion rates is shown in Fig. 3. The visualization shows that the few high erosion features within the study areas of Barum, Lamspringe, and Klein Ilde (Fig. 2b, c, d) are strongly underestimated. In contrast, in the study areas of Küingdorf, Nette, Adenstedt, and Brüggen (Fig. 2a, e, f, g), the difference between modelled and mapped soil erosion was mostly below 5 t ha⁻¹ yr⁻¹, with no large over- and underestimations. Locations with low mapped erosion rates (<1 t ha⁻¹ yr⁻¹) were generally predicted within the same order of magnitude as the mapped values, which is supported by the F₁ scores (Table 3).

https://soil.copernicus.org/articles/12/321/2026/soil-12-321-2026-f03

Figure 3Difference between soil erosion rates predicted by the deep neural network (DNN) and the mapped values in (a) Küingdorf, (b) Barum, (c) Lamspringe, (d) Klein Ilde, (e) Nette, (f) Adenstedt, and (g) Brüggen, where positive values indicate overestimation of the model and negative values indicate underestimation.

3.3 Variable importance

The permutation importance results are shown in Fig. 4. While slope showed consistently high importance across all models, with an average contribution of 18.6 %, the ranking of several other predictors varied noticeably between models. For instance, the CNN assigned comparatively high importance to the Wetness index (20.2 %) and the RF attributed its highest importance to the LS factor (18.2 %). The SNN and DNN relied strongly on slope (≥26.5 %). Machining direction vs. aspect was the second most important variable for RF, SNN and DNN.

Several variables exhibit consistently low importance (<5 %) across all models, including aspect (360 and 180), the divergence–convergence index, flowline curvature, plan curvature, and profile curvature. These predictors contributed to a small extent, if at all, to the model performance.

https://soil.copernicus.org/articles/12/321/2026/soil-12-321-2026-f04

Figure 4Heatmap of the permutation importance of all predictor variables, shown for the overall average across models and for the random forest (RF), single-layer neural network (SNN), deep neural network (DNN), and convolutional neural network (CNN). Values represent the normalized increase in mean squared error (MSE) after permuting each variable.

Download

4 Discussion

4.1 Comparison of model performance

Large neural networks have the capacity to capture complex non-linear relationships between predictor variables and soil erosion rates through an extensive set of weights. This flexibility allows neural networks to model these relationships in greater detail. In our study, the neural network models showed comparable or slightly improved predictive performance relative to the RF. However, in terms of class-based validation metrics (Table 3), the CNN and DNN outperformed the RF and SNN predictions. In this regard, the results align with previous studies, such as Sarkar and Mishra (2018), indicating that increasing model complexity can improve predictive performance.

A large proportion (81.2 %) of the mapped data is below the threshold of 1 t ha⁻¹ yr⁻¹, and this value range therefore had a strong influence on the weighted average F₁ score. As a result of underestimating these low erosion rates, the RF achieved a lower weighted average F₁ score than the neural network based models. The neural networks, and particularly the CNN and DNN, were able to predict these low erosion rates moderately well, capturing the underlying relationships in this dominant class.

This strong class imbalance and the dominance of low soil erosion values within the dataset also affect the capability of the models to predict higher erosion rates. Additionally, while the applied log-transformation stabilises model training, it further reduces the influence of extreme erosion values. As shown in Table 3, all models fail to reproduce very high erosion rates (≥5 t ha⁻¹ yr⁻¹). Since these high values are rare, representing only 3.9 % of the overall mapped training data, they provide too few examples for the models to learn the corresponding relationships reliably, which is a common problem in soil science for prediction tasks using machine learning (Sharififar and Sarmadian, 2023). In addition, the left-out areas have different landscape structures, land management practices, and topographic patterns, leading to combinations of predictor variables that the models have not encountered during training. This distribution shift places many erosion features outside the range of conditions represented during training, making accurate prediction in previously unseen areas particularly challenging. For example, the underestimation in Barum can be explained by substantial differences in crop rotations compared to other monitoring areas, resulting in a higher C factor that was not represented in the respective training data.

As shown in the visualizations of the predictions (e.g. Figs. 2 and 3), all models were able to reproduce parts of the spatial erosion patterns in the unseen areas, although the predicted magnitudes were often underestimated. Only the neural network models predicted very high soil erosion rates, with maximum values of 6.0 and 12.1 t ha⁻¹ yr⁻¹, respectively. Both SNN and DNN produced very similar soil erosion patterns, likely due to the comparable architectures of the two modelling approaches, despite differences in their complexity. The CNN output appeared notably smoother than that of the solely pixel-based models, albeit with a lower maximum predicted erosion rate. This smoothness can be attributed to the integration of local spatial context by the convolutional layers (Fu et al., 2019; Hilburn, 2023).

While the RF, SNN, and DNN train on a pixel by pixel basis, the CNN potentially benefits from incorporating local spatial context through input patches that include neighbouring grid cells during training. However, the results show only very small differences between the evaluation metrics of all models. Although the CNN and DNN achieved the highest F₁ scores, all models exhibited nearly identical overall RMSE and MAE.

It is also important to note that the DNN, and particularly the SNN and RF models, require significantly fewer computational resources than the CNN, which can be beneficial when trained on big datasets or when the model is applied to predict soil erosion for larger areas. Therefore, depending on the specific use case, the potential trade-off between potentially improved predictive performance and the increased complexity and computational demands of large neural networks, such as the CNN should be carefully considered.

The permutation-importance analysis highlights the combined influence of topographic and anthropogenic (i.e. land management) variables on soil erosion. For example, the high importance of the predictor variable machining direction vs. aspect is likely explained by the large proportion of mapped erosion features occurring in tramlines aligned with the slope direction (Steinhoff-Knopp and Burkhard, 2018). The permutation analysis also reveals clear differences between the modelling approaches. These differences can be attributed to the respective model architectures. The RF relied on a hierarchical split structure, which caused it to prioritize variables that repeatedly improve node splits across trees, resulting in an importance pattern that differs from that of the neural networks (Breiman, 2001). The SNN and DNN relied strongly on similar predictor variables (e.g. slope and machining direction vs. aspect), indicating that their similar architectures resulted in comparable learned non-linear relationships between predictors and soil erosion (LeCun et al., 2015; Nielsen, 2015). The CNN, by applying convolutional filters over spatial neighbourhoods, prioritizes variables that contribute to local spatial patterns rather than relying solely on single-pixel attributes (LeCun et al., 2015). Consequently, the models emphasize different predictors when optimizing performance in previously unseen areas. Despite these architectural differences, a consistent result across all models is that aspect, the divergence–convergence index, flowline curvature, plan curvature, and profile curvature show very low importance and contribute little to predicting soil erosion in this study.

4.2 Limitations and future research

Typical challenges and limitations in soil erosion modelling at the landscape scale include the high range of input variables from different domains for parametrization, leading to simplification in variable estimation and mismatches in the scale of input and output data, the application of models outside of the validation range, and inherent restrictions of the used models. This study addressed several of these limitations by utilizing a dataset based on a long-term soil erosion monitoring programme across multiple study areas, capturing long-term patterns of soil erosion by water within these regions. Additionally, input variables with relatively high spatial resolution were incorporated into machine learning models capable of capturing non-linear relationships at varying levels of complexity. Despite these approaches, several challenges and limitations remain. These challenges and limitations should be considered when interpreting the results and highlight key areas for future research to enhance small-scale soil erosion predictions.

The results demonstrate that complex neural networks can partially reproduce erosion patterns when extrapolating to previously unseen areas. However, all models exhibited a strong tendency to underestimate soil erosion. This was mainly caused by the strong imbalance in the dataset, which provided too few examples for the models to learn robust relationships for high erosion rates. These findings suggest that a larger representation of high-erosion events within the same landscape may improve the learning capabilities of such extremes. However, whether this would improve spatial pattern prediction requires further testing.

The results indicate that prediction performance shows only a very slight increase with model complexity (Tables 2 and 3), and the differences between the models were small. In particular, differences were observed only for the F₁ score, where CNN and DNN achieved higher values. Whether this generally justifies the use of a more complex architecture, such as a CNN or DNN in soil erosion modelling, must be carefully evaluated based on the aim and context of the respective study. Future research could investigate this further by comparing a broader range of machine learning approaches, such as gradient boosting machines, or by evaluating ensemble methods. In addition, more recent neural network architectures such as transformers (e.g. Liu et al., 2024) may offer more effective prediction capabilities.

This study provides a useful overview of the importance of various variables in modelling soil erosion by water. Since the primary aim was to compare machine-learning approaches rather than to establish a comprehensive predictor set, some factors affecting soil erosion may not have been included or may influence soil erosion by water at spatial scales not fully represented by the selected variables. As a result, the predictions may not fully capture all relevant processes, and further research is needed to assess the impact of additional variables on soil erosion modelling. On the other hand, this study gives applicable guidance on which predictors might be omitted in soil erosion by water modelling.

5 Conclusions

By comparing different artificial neural network architectures and using a random forest model as a benchmark, this study highlights the potential of neural networks for modelling non-linear relationships between predictor variables and soil erosion at a field-to-landscape scale. Overall, all models showed very similar RMSE and MAE values, indicating comparable performance in predicting continuous soil erosion rates. The convolutional neural network achieved a marginally higher F₁ score, followed closely by the deep neural network. When applied to previously unseen areas, all models were able to partially reproduce spatial erosion patterns but tended to strongly underestimate features with high erosion rates. The permutation importance analysis further indicates the relevance of variables such as slope and machining direction vs. aspect. Furthermore, the analysis provides guidance on which predictors may be omitted in soil erosion modelling. To build upon these findings, future research should focus on further identifying and quantifying model uncertainties, as well as improving the generalizability of these models and their scalability to larger areas.

Appendix A: Data collection

(LGLN, 2024)Zevenbergen and Thorne, 1987Guisan et al., 1999Böhner et al., 2002(Fischer et al., 2019)(LBEG, 2017)Desmet and Govers (1996)Moore and Nieber, 1989Schwertmann et al. (1987)Winterrath et al. (2018)

Table A1Overview of terrain and USLE-related variables, including data sources, processing steps, and summary statistics. Grid cell size of all variables: 5 × 5 m. See column source and data processing for applied resampling methods.

^* Field block: A contiguous agricultural reference parcel bounded by stable landscape features (roads, paths, hedges, ditches, rivers, forests, etc.).

Download XLSX

Appendix B: Pairwise correlation and multicollinearity analysis

https://soil.copernicus.org/articles/12/321/2026/soil-12-321-2026-f05

Figure B1Pairwise linear correlation analysis of all predictor variables used in this study based on the Pearson correlation coefficient.

Download

https://soil.copernicus.org/articles/12/321/2026/soil-12-321-2026-f06

Figure B2Variance inflation analysis of predictor variables highlights the level of multicollinearity, with higher VIF values indicating greater multicollinearity among predictors.

Download

Appendix C: Results

C1 Raw model predictions

https://soil.copernicus.org/articles/12/321/2026/soil-12-321-2026-f07

Figure C1Predicted soil erosion rates using the leave-one-area-out approach by the random Forest (RF) in (a) Küingdorf, (b) Barum, (c) Lamspringe, (d) Klein Ilde, (e) Nette, (f) Adenstedt and (g) Brüggen.

https://soil.copernicus.org/articles/12/321/2026/soil-12-321-2026-f08

Figure C2Predicted soil erosion rates using the leave-one-area-out approach by the single-hidden layer neural network (SNN) in (a) Küingdorf, (b) Barum, (c) Lamspringe, (d) Klein Ilde, (e) Nette, (f) Adenstedt and (g) Brüggen.

https://soil.copernicus.org/articles/12/321/2026/soil-12-321-2026-f09

Figure C3Predicted soil erosion rates using the leave-one-area-out approach by the deep neural network (DNN) in (a) Küingdorf, (b) Barum, (c) Lamspringe, (d) Klein Ilde, (e) Nette, (f) Adenstedt and (g) Brüggen.

https://soil.copernicus.org/articles/12/321/2026/soil-12-321-2026-f10

Figure C4Predicted soil erosion rates using the leave-one-area-out approach by the convolutional neural network (CNN) in (a) Küingdorf, (b) Barum, (c) Lamspringe, (d) Klein Ilde, (e) Nette, (f) Adenstedt and (g) Brüggen.

C2 Differences between predicted and mapped soil erosion rates

https://soil.copernicus.org/articles/12/321/2026/soil-12-321-2026-f11

Figure C5Difference between soil erosion rates predicted by the random forest (RF) and the mapped values in (a) Küingdorf, (b) Barum, (c) Lamspringe, (d) Klein Ilde, (e) Nette, (f) Adenstedt, and (g) Brüggen, where positive values indicate overestimation of the model and negative values indicate underestimation.

https://soil.copernicus.org/articles/12/321/2026/soil-12-321-2026-f12

Figure C6Difference between soil erosion rates predicted by the single-hidden layer neural network (SNN) and the mapped values in (a) Küingdorf, (b) Barum, (c) Lamspringe, (d) Klein Ilde, (e) Nette, (f) Adenstedt, and (g) Brüggen, where positive values indicate overestimation of the model and negative values indicate underestimation.

https://soil.copernicus.org/articles/12/321/2026/soil-12-321-2026-f13

Figure C7Difference between soil erosion rates predicted by the deep neural network (DNN) and the mapped values in (a) Küingdorf, (b) Barum, (c) Lamspringe, (d) Klein Ilde, (e) Nette, (f) Adenstedt, and (g) Brüggen, where positive values indicate overestimation of the model and negative values indicate underestimation.

https://soil.copernicus.org/articles/12/321/2026/soil-12-321-2026-f14

Figure C8Difference between soil erosion rates predicted by the convolutional neural network (CNN) and the mapped values in (a) Küingdorf, (b) Barum, (c) Lamspringe, (d) Klein Ilde, (e) Nette, (f) Adenstedt, and (g) Brüggen, where positive values indicate overestimation of the model and negative values indicate underestimation.

Table C1Per-study-area summary statistics of mapped soil erosion rates and model predictions (in t ha⁻¹ yr⁻¹) from the random forest (RF), single-layer neural network (SNN), deep neural network (DNN), and convolutional neural network (CNN).

Download Print Version | Download XLSX

Code and data availability

The complete dataset and Python scripts used for model training, prediction, and evaluation are available at: https://doi.org/10.5281/zenodo.16032628 (Barthel, 2026).

Author contributions

NB: conceptualization, methodology, investigation, visualization, writing (original draft), writing (review and editing). SO: conceptualization, data curation, writing (review and editing). BB: conceptualization, project administration, funding acquisition, writing (review and editing). BSK: conceptualization, data curation, visualization, writing (review and editing).

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. The authors bear the ultimate responsibility for providing appropriate place names. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Acknowledgements

We would also like to acknowledge the many years of work by all those who have contributed to the creation of this long-term dataset through fieldwork since 2000. In particular, we thank Frank Beisiegel and Heiko van Wensen for their continuous monitoring efforts since the beginning of the programme. We thank Angie Faust for proofreading this work.

Financial support

The research and collection of the associated data were made possible through the continuous funding provided by the Lower Saxony State Authority for Mining, Energy and Geology (LBEG; grant no. 2023-10-18/00456).

The publication of this article was funded by the open-access fund of Leibniz Universität Hannover.

Review statement

This paper was edited by Pedro Batista and reviewed by two anonymous referees.

References

Abadi, M., Agarwal, A., Barham, P., Brevdo, E., Chen, Z., Citro, C., Corrado, G. S., Davis, A., Dean, J., Devin, M., Ghemawat, S., Goodfellow, I., Harp, A., Irving, G., Isard, M., Jia, Y., Jozefowicz, R., Kaiser, L., Kudlur, M., Levenberg, J., Mané, D., Monga, R., Moore, S., Murray, D., Olah, C., Schuster, M., Shlens, J., Steiner, B., Sutskever, I., Talwar, K., Tucker, P., Vanhoucke, V., Vasudevan, V., Viégas, F., Vinyals, O., Warden, P., Wattenberg, M., Wicke, M., Yu, Y., and Zheng, X.: TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems, https://www.tensorflow.org/ (last access: 4 March 2026), 2015. a

Alewell, C., Borrelli, P., Meusburger, K., and Panagos, P.: Using the USLE: Chances, challenges and limitations of soil erosion modelling, International Soil and Water Conservation Research, 7, 203–225, https://doi.org/10.1016/j.iswcr.2019.05.004, 2019. a

Altmann, A., Toloşi, L., Sander, O., and Lengauer, T.: Permutation importance: a corrected feature importance measure, Bioinformatics, 26, 1340–1347, https://doi.org/10.1093/bioinformatics/btq134, 2010. a

Anache, J. A., Flanagan, D. C., Srivastava, A., and Wendland, E. C.: Land use and climate change impacts on runoff and soil erosion at the hillslope scale in the Brazilian Cerrado, Sci. Total Environ., 622, 140–151, https://doi.org/10.1016/j.scitotenv.2017.11.257, 2018. a

Avand, M., Mohammadi, M., Mirchooli, F., Kavian, A., and Tiefenbacher, J. P.: A new approach for smart soil erosion modeling: integration of empirical and machine-learning models, Environ. Model. Assess., 28, 145–160, https://doi.org/10.1007/s10666-022-09858-x, 2023. a, b, c

Barthel, N.: Soil erosion rate modelling with machine learning v2.2.0, Zenodo [code, data set], https://doi.org/10.5281/zenodo.16032628, 2026. a

Batista, P. V., Möller, M., Schmidt, K., Waldau, T., Seufferheld, K., Htitiou, A., Golla, B., Ebertseder, F., Auerswald, K., and Fiener, P.: Soil-erosion events on arable land are nowcast by machine learning, Catena, 256, 109080, https://doi.org/10.1016/j.catena.2025.109080, 2025. a

Bergstra, J., Bengio, Y.: Random search for hyper-parameter optimization, J. Mach. Learn. Res., 13, 281–305, http://jmlr.org/papers/v13/bergstra12a.html (last access: 19 March 2026), 2012. a

Böhner, J., Köthe, R., Conrad, O., Gross, J., Ringeler, A., and Selige, T.: Soil regionalisation by means of terrain analysis and process parameterisation, in: Soil Classification 2001, edited by: Micheli, E., Nachtergaele, F., and Montanarella, L., no. 7, European Soil Bureau Research Report, 213–222, Office for Official Publications of the European Communities, Luxembourg, eUR 20398 EN, https://esdac.jrc.ec.europa.eu/ESDB_Archive/eusoils_docs/esb_rr/n07_ESBResRep07/601Bohner.pdf (last access: 4 March 2026), 2002. a

Borrelli, P., Van Oost, K., Meusburger, K., Alewell, C., Lugato, E., and Panagos, P.: A step towards a holistic assessment of soil degradation in Europe: Coupling on-site erosion with sediment transfer and carbon fluxes, Environ. Res., 161, 291–298, https://doi.org/10.1016/j.envres.2017.11.009, 2018. a, b

Borrelli, P., Alewell, C., Alvarez, P., Anache, J. A. A., Baartman, J., Ballabio, C., Bezak, N., Biddoccu, M., Cerdà, A., Chalise, D., Chen, S., Chen, W., De Girolamo, A. M., Gessesse, G. D., Deumlich, D., Diodato, N., Efthimiou, N., Erpul, G., Fiener, P., Freppaz, M., Gentile, F., Gericke, A., Haregeweyn, N., Hu, B., Jeanneau, A., Kaffas, K., Kiani-Harchegani, M., Lizaga Villuendas, I., Li, C., Lombardo, L., López-Vicente, M., Lucas-Borja, M. E., Märker, M., Matthews, F., Miao, C., Mikoš, M., Modugno, S., Möller, M., Naipal, V., Nearing, M., Owusu, S., Panday, D., Patault, E., Patriche, C. V., Poggio, L., Portes, R., Quijano, L., Rahdari, M. R., Renima, M., Ricci, G. F., Rodrigo-Comino, J., Saia, S., Nazari Samani, A., Schillaci, C., Syrris, V., Kim, H. S., Spinola, D. N., Oliveira, P. T., Teng, H., Thapa, R., Vantas, K., Vieira, D., Yang, J. E., Yin, S., Zema, D. A., Zhao, G., and Panagos, P.: Soil erosion modelling: A global review and statistical analysis, Sci. Total Environ., 780, https://doi.org/10.1016/j.scitotenv.2021.146494, 2021. a, b, c

Botschek, J., Billen, N., Brandhuber, R., Bug, J., Deumlich, D., Duttmann, R., Elhaus, D., Mollenhauer, K., Prasuhn, V., Röder, C., Schäfer, W., Unterseher, E., Wurbs, D., and Thiermann, A.: Bodenerosion durch Wasser–Kartieranleitung zur Erfassung aktueller Erosionsformen, DWA-Regelwerk, Merkblatt DWA-M, 921, Deutsche Vereinigung für Wasserwirtschaft, Abwasser und Abfall e. V. (DWA), urn:nbn:de:101:1-2021042311222349076815, 2021. a

Breiman, L.: Random forests, Mach. Learn., 45, 5–32, https://doi.org/10.1023/A:1010933404324, 2001. a, b

Capelle, A.: Die erosionsgefährdete Landesfläche in Niedersachsen und Bremen, Z. Kult. Landentwickl., 31, 11–17, 1990. a

Capelle, A. and Lüders, R.: Die potentielle Erosionsgefährdung der Böden in Niedersachsen, Göttinger Bodenkundliche Berichte, 83, 107–127, 1985. a

Chinchor, N.: MUC-4 Evaluation Metrics, in: Fourth Message Understanding Conference (MUC-4), Proceedings of a Conference Held in McLean, Virginia, 16–18 June 1992, https://aclanthology.org/M92-1002/ (last access: 19 March 2026), 1992. a

Daoud, J. I.: Multicollinearity and regression analysis, J. Phys. Conf. Ser. 949, 012009, https://doi.org/10.1088/1742-6596/949/1/012009, 2017. a

De la Rosa, D., Mayol, F., Moreno, J., Bonsón, T., and Lozano, S.: An expert system/neural network model (ImpelERO) for evaluating agricultural soil erosion in Andalucia region, southern Spain, Agriculture, Ecosyst. Environ., 73, 211–226, https://doi.org/10.1016/S0167-8809(99)00050-X, 1999. a

Desmet, P. J. and Govers, G.: A GIS procedure for automatically calculating the USLE LS factor on topographically complex landscape units, J. Soil Water Conserv., 51, 427–433, https://doi.org/10.1080/00224561.1996.12457102, 1996. a

DIN (Deutsches Institut für Normung): Bodenbeschaffenheit – Ermittlung der Erosionsgefährdung von Böden durch Wasser mit Hilfe der ABAG, Tech. rep., Beuth Verlag GmbH, Berlin, DIN 19708:2005-02, 2005. a

DVWK: Bodenerosion durch Wasser: DVWK-Merkblatt 239: Kartieranleitung zur Erfassung aktueller Erosionsformen, Wirtschafts- und Verl.-Ges. Gas und Wasser, Bonn, 1996. a

Ebrahimi-Khusfi, Z., Nafarzadegan, A. R., and Dargahian, F.: Predicting the number of dusty days around the desert wetlands in southeastern Iran using feature selection and machine learning techniques, Ecol. Indic., 125, 107499, https://doi.org/10.1016/j.ecolind.2021.107499, 2021. a

Fiener, P., Dostál, T., Krása, J., Schmaltz, E., Strauss, P., and Wilken, F.: Operational USLE-based modelling of soil erosion in Czech Republic, Austria, and Bavaria – Differences in model adaptation, parametrization, and data availability, Appl. Sci., 10, 3647, https://doi.org/10.3390/app10103647, 2020. a

Fischer, F., Winterrath, T., Junghänel, T., Walawender, E., and Auerswald, K.: Mean annual precipitation erosivity (R factor) based on RADKLIM Version 2017.002, Deutscher Wetterdienst (DWD), Offenbach/Main, Germany, https://doi.org/10.5676/DWD/RADKLIM_Rfct_V2017.002, 2019. a

Fu, B., Zhao, X., Li, Y., Wang, X., and Ren, Y.: A convolutional neural networks denoising approach for salt and pepper noise, Multimed. Tools Appl., 78, 30707–30721, https://doi.org/10.1007/s11042-018-6521-4, 2019. a

Garosi, Y., Sheklabadi, M., Conoscenti, C., Pourghasemi, H. R., and Van Oost, K.: Assessing the performance of GIS-based machine learning models with different accuracy measures for determining susceptibility to gully erosion, Sci. Total Environ., 664, 1117–1132, https://doi.org/10.1016/j.scitotenv.2019.02.093, 2019. a, b

Gholami, V., Sahour, H., and Amri, M. A. H.: Soil erosion modeling using erosion pins and artificial neural networks, Catena, 196, 104902, https://doi.org/10.1016/j.catena.2020.104902, 2021. a

Ghorbanzadeh, O., Shahabi, H., Mirchooli, F., Valizadeh Kamran, K., Lim, S., Aryal, J., Jarihani, B., and Blaschke, T.: Gully erosion susceptibility mapping (GESM) using machine learning methods optimized by the multi-collinearity analysis and K-fold cross-validation, Geomat. Nat. Haz. Risk, 11, 1653–1678, https://doi.org/10.1080/19475705.2020.1810138, 2020. a

Ghosh, A. and Maiti, R.: Soil erosion susceptibility assessment using logistic regression, decision tree and random forest: study on the Mayurakshi river basin of Eastern India, Environ. Earth Sci., 80, 328, https://doi.org/10.1007/s12665-021-09631-5, 2021. a, b

Golkarian, A., Khosravi, K., Panahi, M., and Clague, J. J.: Spatial variability of soil water erosion: Comparing empirical and intelligent techniques, Geosci. Front., 14, 101456, https://doi.org/10.1016/j.gsf.2022.101456, 2023. a

Guerra, C. A., Rosa, I. M., Valentini, E., Wolf, F., Filipponi, F., Karger, D. N., Nguyen Xuan, A., Mathieu, J., Lavelle, P., and Eisenhauer, N.: Global vulnerability of soil ecosystems to erosion, Landscape Ecol., 35, 823–842, https://doi.org/10.1007/s10980-020-00984-z, 2020. a

Guisan, A., Weiss, S. B., and Weiss, A. D.: GLM versus CCA spatial modeling of plant species distribution, Plant Ecol., 143, 107–122, https://doi.org/10.1023/A:1009841519580, 1999. a

Hilburn, K. A.: Understanding spatial context in convolutional neural networks using explainable methods: Application to interpretable GREMLIN, Artificial Intelligence for the Earth Systems, 2, 220093, https://doi.org/10.1175/AIES-D-22-0093.1, 2023. a

Hoerl, A. E. and Kennard, R. W.: Ridge regression: Biased estimation for nonorthogonal problems, Technometrics, 12, 55–67, 1970. a

Igwe, P. U., Onuigbo, A. A., Chinedu, O. C., Ezeaku, I. I., and Muoneke, M. M.: Soil erosion: A review of models and applications, International Journal of Advanced Engineering Research and Science, 4, 237341, https://doi.org/10.22161/ijaers.4.12.22, 2017. a

Issaka, S. and Ashraf, M. A.: Impact of soil erosion and degradation on water quality: a review, Geology, Ecology, and Landscapes, 1, 1–11, https://doi.org/10.1080/24749508.2017.1301053, 2017. a

Jaafari, A., Janizadeh, S., Abdo, H. G., Mafi-Gholami, D., and Adeli, B.: Understanding land degradation induced by gully erosion from the perspective of different geoenvironmental factors, J. Environ. Manage., 315, 115181, https://doi.org/10.1016/j.jenvman.2022.115181, 2022. a, b, c

Khosravi, K., Rezaie, F., Cooper, J. R., Kalantari, Z., Abolfathi, S., and Hatamiafkoueieh, J.: Soil water erosion susceptibility assessment using deep learning algorithms, J. Hydrol., 618, 129229, https://doi.org/10.1016/j.jhydrol.2023.129229, 2023. a

Kingma, D. P. and Ba, J.: Adam: A Method for Stochastic Optimization, arXiv [preprint], https://doi.org/10.48550/arxiv.1412.6980, 2014. a

Kingsford, C. and Salzberg, S. L.: What are decision trees?, Nat. Biotechnol., 26, 1011–1013, https://doi.org/10.1038/nbt0908-1011, 2008. a

Krizhevsky, A., Sutskever, I., and Hinton, G. E.: ImageNet classification with deep convolutional neural networks, Commun. ACM, 60, 84–90, https://doi.org/10.1145/3065386, 2017. a, b

Kumar, M., Sahu, A. P., Sahoo, N., Dash, S. S., Raul, S. K., and Panigrahi, B.: Global-scale application of the RUSLE model: a comprehensive review, Hydrolog. Sci. J., 67, 806–830, https://doi.org/10.1080/02626667.2021.2020277, 2022. a

LBEG: Bodenkarte von Niedersachsen 1 : 50 000 (BK50): Blattschnittfreie Vektordaten, Geodataset, Landesamt für Bergbau, Energie und Geologie (LBEG), https://nibis.lbeg.de/geonetwork/srv/api/records/611135b8-7168-4960-ad9d-3103ee96dcc6 (last access: 4 March 2026), 2017. a, b

LeCun, Y., Bengio, Y., and Hinton, G.: Deep learning, Nature, 521, 436–444, https://doi.org/10.1038/nature14539, 2015. a, b, c, d, e

LGLN: Digital Elevation Model (DEM1), Landesamt für Geoinformation und Landesvermessung Niedersachsen (LGLN), licensed under CC BY 4.0, https://arcg.is/aH4Cy (last access: 4 March 2026), 2024. a

Licznar, P. and Nearing, M.: Artificial neural networks of soil erosion and runoff prediction at the plot scale, Catena, 51, 89–114, https://doi.org/10.1016/S0341-8162(02)00147-9, 2003. a

Liu, C., Fan, H., and Wang, Y.: Gully erosion susceptibility assessment using three machine learning models in the black soil region of Northeast China, Catena, 245, 108275, https://doi.org/10.1016/j.catena.2024.108275, 2024. a

Moore, I. D. and Nieber, J. L.: Landscape assessment of soil erosion and nonpoint source pollution, Journal of the Minnesota Academy of Science, 55, 18–25, 1989. a

Morgan, R., Quinton, J., Smith, R., Govers, G., Poesen, J., Auerswald, K., Chisci, G., Torri, D., and Styczen, M.: The European Soil Erosion Model (EUROSEM): a dynamic approach for predicting sediment transport from fields and small catchments, Earth Surf. Proc. Land., 23, 527–544, https://doi.org/10.1002/(SICI)1096-9837(199806)23:6<527::AID-ESP868>3.0.CO;2-5, 1998. a

Nair, V. and Hinton, G. E.: Rectified linear units improve restricted boltzmann machines, Proceedings of the 27th international conference on machine learning (ICML-10), Omnipress, 807–814, https://www.cs.toronto.edu/~hinton/absps/reluICML.pdf (last access: 19 March 2026), 2010. a

Nearing, M. A., Foster, G. R., Lane, L., and Finkner, S.: A process-based soil erosion model for USDA-Water Erosion Prediction Project technology, T. ASAE, 32, 1587–1593, 1989. a

Ng, A. Y.: Feature selection, L₁ vs. L₂ regularization, and rotational invariance, Proceedings of the twenty-first international conference on Machine learning, ACM (Association for Computing Machinery), p. 78, https://doi.org/10.1145/1015330.1015435, 2004. a

Nielsen, M. A.: Neural networks and deep learning, Vol. 25, Determination press San Francisco, CA, USA, 2015. a

O'Brien, R. M.: A caution regarding rules of thumb for variance inflation factors, Qual. Quant., 41, 673–690, https://doi.org/10.1007/s11135-006-9018-6, 2007. a

Panagos, P., Ballabio, C., Himics, M., Scarpa, S., Matthews, F., Bogonos, M., Poesen, J., and Borrelli, P.: Projections of soil loss by water erosion in Europe by 2050, Environ. Sci. Policy, 124, 380–392, https://doi.org/10.1016/j.envsci.2021.07.012, 2021. a

Parsons, A. J.: How reliable are our methods for estimating soil erosion by water?, Sci. Total Environ., 676, 215–221, https://doi.org/10.1016/j.scitotenv.2019.04.307, 2019. a

Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., and Duchesnay, E.: Scikit-learn: Machine Learning in Python, J. Mach. Learn. Res., 12, 2825–2830, 2011. a

Pieri, L., Bittelli, M., Wu, J. Q., Dun, S., Flanagan, D. C., Pisa, P. R., Ventura, F., and Salvatorelli, F.: Using the Water Erosion Prediction Project (WEPP) model to simulate field-observed runoff and erosion in the Apennines mountain range, Italy, J. Hydrol., 336, 84–97, https://doi.org/10.1016/j.jhydrol.2006.12.014, 2007. a

Plambeck, N. O.: Reassessment of the potential risk of soil erosion by water on agricultural land in Germany: Setting the stage for site-appropriate decision-making in soil and water resources management, Ecol. Indic., 118, 106732, https://doi.org/10.1016/j.ecolind.2020.106732, 2020. a

Renard, K. G., Foster, G. R., Weesies, G. A., McCool, D. K., and Yoder, D. C.: Predicting Soil Erosion by Water: A Guide to Conservation Planning with the Revised Universal Soil Loss Equation (RUSLE), U.S. Department of Agriculture, Agricultural Research Service, https://www.tucson.ars.ag.gov/unit/publications/PDFfiles/717.pdf (last access: 19 March 2026), 1997. a

Rohr, W., Mosimann, T., Bono, R., Rüttimann, M., and Prasuhn, V.: Kartieranleitung zur Aufnahme von Bodenerosionsformen und-schäden auf Ackerflächen, Legende, Erläuterungen zur Kartiertechnik, Schadensdokumentation und Fehlerabschätzung, Materialien zur Physiogeographie, 14, Geographisches Institut der Universität Basel, https://eth.swisscovery.slsp.ch/permalink/41SLSP_ETH/1puep7t/alma990005539670205503 (last access: 19 March 2026), 1990. a

Rumelhart, D. E., Hinton, G. E., and Williams, R. J.: Learning representations by back-propagating errors, Nature, 323, 533–536, https://doi.org/10.1038/323533a0, 1986. a

Saha, S., Sarkar, R., Thapa, G., and Roy, J.: Modeling gully erosion susceptibility in Phuentsholing, Bhutan using deep learning and basic machine learning algorithms, Environ. Earth Sci., 80, 295, https://doi.org/10.1007/s12665-021-09599-2, 2021. a

Sahour, H., Gholami, V., Vazifedan, M., and Saeedi, S.: Machine learning applications for water-induced soil erosion modeling and mapping, Soil Till. Res., 211, 105032, https://doi.org/10.1016/j.still.2021.105032, 2021. a

Sarkar, T. and Mishra, M.: Soil erosion susceptibility mapping with the application of logistic regression and artificial neural network, Journal of Geovisualization and Spatial Analysis, 2, 8, https://doi.org/10.1007/s41651-018-0015-9, 2018. a, b

Schmidt, J., Werner, M., and Michael, A.: Application of the EROSION 3D model to the CATSOP watershed, The Netherlands, Catena, 37, 449–456, https://doi.org/10.1016/S0341-8162(99)00032-6, 1999. a

Schober, P., Boer, C., and Schwarte, L. A.: Correlation coefficients: appropriate use and interpretation, Anesth. Analg., 126, 1763–1768, https://doi.org/10.1213/ANE.0000000000002864, 2018. a

Schwertmann, U., Vogl, W., and Kainz, M.: Bodenerosion durch wasser, Ulmer Verlag, 64 pp., ISBN 3800130882, 1987. a, b

Sharififar, A. and Sarmadian, F.: Coping with imbalanced data problem in digital mapping of soil classes, Eur. J. Soil Sci., 74, e13368, https://doi.org/10.1111/ejss.13368, 2023. a

Steinhoff, B., Bug, J., and Mosimann, T.: Einsatz eines mobilen GIS zur Kartierung von Bodenerosion durch Wasser, Neue Horizonte für Geodateninfrastrukturen – Open GeoData, Mobility D, 3, 27–32, 2013. a

Steinhoff-Knopp, B. and Burkhard, B.: Soil erosion by water in Northern Germany: long-term monitoring results from Lower Saxony, Catena, 165, 299–309, https://doi.org/10.1016/j.catena.2018.02.017, 2018. a, b, c, d, e

Taha, A. A. and Hanbury, A.: Metrics for evaluating 3D medical image segmentation: analysis, selection, and tool, BMC Med. Imaging, 15, 1–28, https://doi.org/10.1186/s12880-015-0068-x, 2015. a

Winterrath, T., Brendel, C., Hafer, M., Junghänel, T., Klameth, A., Lengfeld, K., Walawender, E., Weigl, E., and Becker, A.: Radar-based gauge-adjusted one-hour precipitation sum climatology Version 2017.002: Gridded Precipitation Data for Germany (v2017.02), Deutscher Wetterdienst (DWD) [data set], https://doi.org/10.5676/DWD/RADKLIM_RW_V2017.002, 2018. a

Wischmeier, W. H. and Smith, D. D.: Predicting Rainfall Erosion Losses: A Guide to Conservation Planning, U.S. Department of Agriculture, Science and Education Administration, https://www.ars.usda.gov/ARSUserFiles/60600505/RUSLE/AH_537%20Predicting%20Rainfall%20Soil%20Losses.pdf (last access: 20 March 2026), 1978. a

Wythoff, B. J.: Backpropagation neural networks: a tutorial, Chemometr. Intell. Lab., 18, 115–155, https://doi.org/10.1016/0169-7439(93)80052-J, 1993. a

Yu, T. and Zhu, H.: Hyper-parameter optimization: A review of algorithms and applications, arXiv [preprint], https://doi.org/10.48550/arXiv.2003.05689, 2020. a

Zevenbergen, L. W. and Thorne, C. R.: Quantitative analysis of land surface topography, Earth Surf. Proc. Land., 12, 47–56, https://doi.org/10.1002/esp.3290120107, 1987. a

Articles

Short summary

This study compares neural networks and a random forest model for predicting soil erosion in agricultural cropland using long-term data from northern Germany. All models captured general erosion patterns, while more complex neural networks slightly improved the distinction between soil loss classes. A permutation importance analysis identified slope and machine direction vs. aspect as the most influential predictors across all models.