Game theory interpretation of digital soil mapping convolutional neural networks

Padarian, José; McBratney, Alex B.; Minasny, Budiman

doi:https://doi.org/10.5194/soil-6-389-2020

Articles | Volume 6, issue 2

https://doi.org/10.5194/soil-6-389-2020

Articles | Volume 6, issue 2

Original research article

18 Aug 2020

Original research article |

| 18 Aug 2020

Game theory interpretation of digital soil mapping convolutional neural networks

José Padarian, Alex B. McBratney, and Budiman Minasny

Abstract

The use of complex models such as deep neural networks has yielded large improvements in predictive tasks in many fields including digital soil mapping. One of the concerns about using these models is that they are perceived as black boxes with low interpretability. In this paper we introduce the use of game theory, specifically Shapley additive explanations (SHAP) values, in order to interpret a digital soil mapping model. SHAP values represent the contribution of a covariate to the final model predictions. We applied this method to a multi-task convolutional neural network trained to predict soil organic carbon in Chile. The results show the contribution of each covariate to the model predictions in three different contexts: (a) at a local level, showing the contribution of the various covariates for a single prediction; (b) a global understanding of the covariate contribution; and (c) a spatial interpretation of their contributions. The latter constitutes a novel application of SHAP values and also the first detailed analysis of a model in a spatial context. The analysis of a SOC (soil organic carbon) model in Chile corroborated that the model is capturing sensible relationships between SOC and rainfall, temperature, elevation, slope, and topographic wetness index. The results agree with commonly reported relationships, highlighting environmental thresholds that coincide with significant areas within the study area. This contribution addresses the limitations of the current interpretation of models in digital soil mapping, especially in a spatial context. We believe that SHAP values are a valuable tool that should be included within the DSM (digital soil mapping) framework, since they address the important concerns regarding the interpretability of more complex models. The model interpretation is a crucial step that could lead to generating new knowledge to improve our understanding of soils.

Download & links

Article (PDF, 3772 KB)

Download & links

How to cite.

Received: 19 Mar 2020 – Discussion started: 30 Mar 2020 – Revised: 24 Jun 2020 – Accepted: 01 Jul 2020 – Published: 18 Aug 2020

1 Introduction

The use of statistical and machine learning (ML) methods to model the distribution of soil classes or properties with environmental factors is a core component of digital soil mapping (DSM). Since the processes driving pedogenesis are generally complex, there is a general trend in DSM studies to increase the complexity of the models. More advanced models such as tree-like models and neural networks tend to deal better with the complex non-linearities present in the data, usually outperforming more traditional methods such as generalised linear models (Lamichhane et al., 2019; Padarian et al., 2020). By increasing the complexity of the modelling framework, researchers are able to reduce the need to over-simplify the target process. With that in mind, we introduced the use of a multi-task convolutional neural network (Padarian et al., 2019), which uses a window of pixels around a punctual soil observation as input instead of the single pixel intercepting its location, in order to better capture its spatial context. In addition, thanks to its multi-task design, the model is capable of predicting multiple soil properties simultaneously, taking into account the interaction between them. The introduction of these two novel features yielded an error decrease of around 30 % compared with a more conventional ML model (Cubist tree model).

Complex models such as deep convolutional neural networks (CNNs) have been demonstrated to excel in predictive tasks in many fields of science (Anwar et al., 2018; Nash et al., 2018; Shen, 2018; Webb, 2018), but they have raised concerns about their interpretability and potential biases, especially since these type of algorithms are starting to be applied to assist decision-making in domains with high human impact (Dressel and Farid, 2018). Some of these concerns could be tackled by the General Data Protection Regulation, introduced by the European Union in 2016, which includes policies that give users the “right to explanation” when an algorithmic decision that affects them has been made (Goodman and Flaxman, 2017). To address the interpretability issue, the ML community has responded by focussing part of its research on improving this aspect of ML models, and many advances have been made more recently (Doshi-Velez and Kim, 2017).

In this paper, we explore one approach, namely game theory, to interpret the CNN model for soil organic carbon (SOC) prediction in Chile reported in Padarian et al. (2019). Our main objective is to corroborate that the model is capturing sensible relationships between the target soil property and the covariates used to train the model. First, we give the rationale behind using game theory to explain a soil predictive model, and we explain the main measure used to perform this task. Second, we introduce the CNN model and the data used to train it. Then, we describe the different levels of interpretability that we explore, from local (i.e. for a particular observation) to global explanations. Finally, we present and discuss the results obtained, exploring the effect of the different predictors, their interactions, and the correspondence to known geographical patterns in the study area.

2 Game theory and SHAP (Shapley additive explanation) values

From a game theory perspective, a modelling exercise may be rationalised as the superposition of multiple collaborative games where, in each game, agents (explanatory variables) strategically interact to achieve a goal – making a prediction for a single observation. As a result of this collaboration, the agents receive a “payout” proportionate to their contribution. In this context, the total gain (or loss) resulting from the collaboration is the deviation of the prediction from the mean of the predictions for the complete dataset (expected value). Game theory is the mathematical study of such “games” and the interactions and strategies between the involved agents (Nash, 1950; Rasmusen, 1989).

One method to estimate the expected marginal contribution of a covariate among all possible combinations is using Shapley values (Shapley, 1953). To compute these values, it is required to fit a model f_S∪{i} including that covariate i and another model f_S withholding it. The difference between both models' predictions on the input x is the marginal contribution of covariate i. When using more than one covariate, that contribution will depend on the interaction with the rest of the covariates; hence this procedure should be repeated for the complete power set of the covariates (all possible subsets S⊆F including the empty set and the set F of all the covariates). The final covariate contribution ϕ_i∈ℝ is the weighted average of all marginal contributions:

\begin{matrix} (1) & ϕ_{i} = \sum_{S \subseteq F ∖ {i}} \frac{| S |! (| F | - | S | - 1)!}{| F |!} [f_{S \cup {i}} (x_{S \cup {i}}) - f_{S} (x_{S})] . \end{matrix}

In this paper we use a modification of the original method proposed by Shapley (1953). Since the original method requires re-training $2^{| F |}$ models, it quickly becomes prohibitive to apply on complex models and large datasets. By using techniques such as sampling approximations to Eq. (1) and approximating the effect of removing covariates by using other samples from the training dataset, Lundberg and Lee (2017) developed a more efficient method (Shapley additive explanations; SHAP) to estimate the contribution of covariates preserving the properties of Shapley values. The SHAP values have been used to describe models developed in many fields, including medicine (Lundberg et al., 2018), engineering (Parsa et al., 2020), and finances (Mokhtari et al., 2019), showing a much stronger agreement with human explanations compared to alternative methods (Lundberg and Lee, 2017). Lundberg and Lee (2017) also adapted their method to be used on deep neural networks, which is what we used in this paper to interpret our CNN model. “Deep SHAP” solves the SHAP values of each component of the deep neural network by using linear approximations which are then aggregated by using a composition rule that enables the efficient approximation of SHAP values for the whole model.

https://soil.copernicus.org/articles/6/389/2020/soil-6-389-2020-f01

Figure 1Architecture of the multi-task network. “Shared layers” represent the layers shared by all the depth ranges. Each branch, one per depth range, first flattens the information to a 1D array and is followed by a series of two fully connected layers and a fully connected layer of size = 1, which corresponds to the final prediction.

Download

The method is based on the concept that, in order to explain a complex model f, it is necessary to use an explanation model g simpler than the original one. This simplification also includes the use of simplified inputs x^′ that map to the original inputs x through a mapping function $x = h_{x} (x^{'})$ , which is specific to x. Given a different set of inputs $z^{'} \approx x^{'}$ , the method ensures that $g (z^{'}) \approx f (h_{x} (z^{'}))$ . In order to estimate the effect of varying the input x, z^′ is obtained from reference values sampled from other observations from the training dataset. Then, the effect ϕ_i to each feature (Eq. 1) becomes part of a linear function of binary variables (i.e. explainable variables have two possible states – available or not):

\begin{matrix} (2) & g (z^{'}) = ϕ_{0} + \sum_{i = 1}^{M} ϕ_{i} z_{i}^{'}, \end{matrix}

where $z^{'} \in {\{0, 1\}}^{M}$ and M is the number of simplified inputs features. By summing all the feature attributions $ϕ_{i} z_{i}^{'}$ , including the attribution ϕ₀ of the empty set (i.e. mean of the predictions for the complete dataset), it is possible to approximate the prediction f(x) in what constitutes the additive feature attribution explanation model, which is on what many current explanation methods are based (Lundberg and Lee, 2017).

3 Methods

3.1 Model and data description

The national-scale model was trained using a dataset of 485 soil profiles from Chile. As covariates, we used (a) a digital elevation model (HydroSHEDS – Hydrological data and maps based on SHuttle Elevation Derivatives at multiple Scales; Lehner et al., 2008), which is provided at 3 arcsec resolution, in addition to its derived slope and topographic wetness index, calculated using SAGA (System for Automated Geoscientific Analyses Conrad et al., 2015), and (b) long-term mean annual temperature (MAT) and total annual rainfall (TAP) derived from information provided by WorldClim (Hijmans et al., 2005), at 30 arcsec resolution. All data layers were standardised to a 100 m grid size. For more details about the data and data preparation, including depth harmonisation and data augmentation, we refer the reader to Padarian et al. (2017) and Padarian et al. (2019).

The model that we explore in this paper corresponds to a multi-task CNN (Fig. 1), trained to simultaneously predict SOC content of Chile at five depth intervals (0–5, 5–15, 15–30, 30–60, and 60–100 cm). The input of the network are 3D arrays with shape (29, 29, 5), which represent a 29 pixel × 29 pixel window around each observation, for the five covariates used to explain the spatial distribution of SOC. This model is capable of taking the pixel values (3D array) and condense this contextual information as it moves to deeper layers of the network. The different branches of the network allow the model to generate different rules for the different depth intervals but also take into account all the outputs to regularise (guide) the predictions. By adding information surrounding an observation and by using information of all the depth intervals simultaneously, the model showed a SOC prediction error of around 2.5 % (RMSE of the 0–5 cm depth interval) compared to a 3.6 % obtained with a traditional Cubist model (around 30 % error reduction throughout the profile). Details about hyperparameter search and model training can be found in Padarian et al. (2019).

3.2 Applying SHAP to a CNN soil model

https://soil.copernicus.org/articles/6/389/2020/soil-6-389-2020-f02

Figure 2Spatial distribution of SHAP values for each covariate in two different samples (3D arrays used as inputs).

To estimate the SHAP values, we used the “shap” Python library (https://github.com/slundberg/shap, last access: 14 August 2020), using as background data (data to perform the approximations) the 436 samples corresponding to the CNN training dataset. In order to corroborate that the model is capturing sensible relationships between SOC and the different covariates, we illustrate the application of SHAP in four specific tasks, described below. All the figures correspond to the 0–5 cm depth range, unless stated otherwise.

Single-sample (spatial).: Given the nature of the CNN input (a 29 pixel × 29 pixel context window), the first application of SHAP focussed on the contribution of each pixel to the final prediction. The obtained SHAP values should highlight influential areas in the landscape and the local influence of each covariate. It is important to remember that the covariates where resampled to a 100 m grid; hence the detailed, per-pixel contribution of TAP and MAT should be considered an artefact, and it is probably better to only consider their aggregated value.
Single-sample.: Since SHAP values have an additive property (Lundberg and Lee, 2017), the per-pixel contributions can be aggregated into a single SHAP value per covariate. This aggregation provided a more intuitive measure of the contribution of the covariates, since their values, once added to the expected value, should equal the predicted SOC content. We would like to remind the reader that SHAP values are not the amount of SOC that would vary if we remove a specific covariate, since they are a weighted average of all marginal contributions (Eq. 1).
Model.: Besides per-sample explanations, SHAP values can also be interpreted in a global context by analysing all the samples simultaneously. This allowed us to explore the overall contribution of the different covariates and also the interactions between them. By having a global understanding of the relationships captured by the model, it was possible to analyse the model behaviour along its environmental gradient and identify significant thresholds.
Model (spatial).: To make a prediction at a single pixel, the CNN model uses a 29 pixel × 29 pixel context window around the target pixel. In order to visualise the spatial pattern of the SHAP values, we estimated the SHAP values for the context windows around each pixel of a larger area. Since there is a considerable overlap between context windows, the final pixel contribution is an average of all the instances where that pixel is used as context (up to 841 times for a context window with a size of 29 pixels × 29 pixels). Related to the previous task, this analysis allowed us to visualise the environmental thresholds in a spatial context. In order to compare the patterns captured by the CNN model, we also generated maps of the SHAP values obtained from a linear model and a tree-like model (Cubist). The Cubist model corresponds to the same model used to compare performance of the CNN reported in Padarian et al. (2019).

https://soil.copernicus.org/articles/6/389/2020/soil-6-389-2020-f03

Figure 3Force plots of the same two samples presented in Fig. 2. The figures depict the positive (green) and negative (purple) aggregated contributions that deviate the SOC content from the expected base SOC value (the mean of the predictions for the complete dataset; 5.7 % in this case). The sum of all the contributions plus the base value yield the model output (7.06 % and 8.49 % for panels a and b, respectively.)

Download

4 Results and discussion

If we only consider single samples that can be used as inputs to the model, it is possible to evaluate the contribution of each pixel to the model prediction. Fig. 2 shows two samples extracted from two contrasting locations. The first sample (top panels) is close to 2000 $m a . s . l .$ in a mountainous area, with a TAP of around 1000 mm yr⁻¹ and MAT of 6 ^∘C. Consequently, it was possible to observe a negative contribution of the elevation and slope and a positive, large contribution of MAT. The second sample is located in a low valley, closer to the coast, with MAT of 12 ^∘C. In this case, the topography had a positive contribution (gentle slopes favouring deposition), and the effect of MAT is positive but lower than the first sample. In both cases, the MAT is lower compared with the rest of the country, which favours SOC accumulation (Krull et al., 2003).

In the case of the covariates related to topography, this analysis gives an indication of influential areas in the landscape (within the context window), either positive or negative, describing interactions over long distances in what Trenberth et al. (1998) denominated teleconnections. This somewhat contradicts the opinion of Behrens et al. (2019), where they state that it is not possible to extract teleconnections using CNNs.

By aggregating the per-pixel contributions into a single SHAP value per sample (Fig. 3), it is possible to corroborate the contributions observed in Fig. 2, with the elevation and slope having a negative contribution in the first location (mountains) and an opposite effect in the second location (valley). Similar to MAT, the contribution of TAP in both cases is positive because this is a national model and both samples are located the southern part of the study area, which is more humid compared with the north.

If we consider multiple samples simultaneously, it is possible to have a general idea of what the model has learned. We were able to confirm that the model captured the direct relationship between TAP and SOC and the inverse relationship between MAT and SOC (Fig. 4). Due to the great north–south climatic gradient in Chile and the abrupt changes in topography from the Andes to the coast, the main two drivers are TAP and MAP, with a broader range of SHAP values, closely followed by elevation. These results are expected and in line with those of previous studies at the national to continental scale (Martin et al., 2011; Viscarra-Rossel et al., 2014; Akpa et al., 2016). In Fig. 4 it is also possible to corroborate that the predicted SOC content decreased in depth.

https://soil.copernicus.org/articles/6/389/2020/soil-6-389-2020-f04

Figure 4SHAP values for each covariate and soil depth interval for the CNN model. TAP: total annual precipitation; MAT: mean annual temperature; TWI: topographic wetness index.

Download

It was also possible to further inspect the effect of particular covariates and evaluate if the model captured interactions between them. For instance, the positive contribution of temperature peaked in the range between 4 and 8 ^∘C with a negative contribution above 12 ^∘C. Given the climatic conditions in the country, the decrease in MAT contribution is correlated with the decrease of precipitation, which is accentuated in areas with high temperatures (Fig. 5a) where carbon inputs are low (Ewing et al., 2008). The contribution of precipitation (Fig. 5a) showed a defined trend, with a mostly constant negative response up to around 1000 mm yr⁻¹, where it starts increasing, becoming positive around 1400 mm yr⁻¹. The model also captured an inverse relationship between SOC content and elevation (Fig. 5c) for which higher values are usually associated with steeper terrain, given the topography of the country. In general, this effect was exacerbated in areas with TAP higher than 400 mm yr⁻¹.

https://soil.copernicus.org/articles/6/389/2020/soil-6-389-2020-f05

Figure 5Dependency plots between SHAP values and selected covariates: (a) between SHAP values and mean annual temperature (MAT), showing the interaction with total annual precipitation (TAP; colour scale); (b) between SHAP values and total annual precipitation, showing the interaction with mean annual temperature (colour scale); and (c) between SHAP values and elevation, showing the interaction with total annual precipitation (colour scale).

Download

For both temperature and precipitation, the threshold values where their contributions turn positive (around 12 ^∘C and 1400 mm yr⁻¹, respectively) coincide with a significant area within the country (around 38^∘ S), where Andisols become more prevalent and there is a change from a xeric to an udic soil moisture regime (Luzio, 2010), associated with a sharp increase in the content of SOC (Padarian et al., 2017). In terms of the TAP threshold that seems to change the behaviour of the contribution of the elevation (around 400 mm yr⁻¹), it roughly corresponds to the transition between the arid and semi-arid zones of the country (Casanova et al., 2013), where the erosion processes by water start becoming more important.

All the previous interpretations are further corroborated by the resulting map of SHAP values (Fig. 6). It is possible to distinguish the valley close to the coast, which has lower TAP and slope and higher MAT compared with the surrounding areas. Those conditions contribute negatively to the model output, compared to the expected mean output. As mentioned before, even when those trends make sense in terms of SOC processes, they are probably interacting with the effect of human activity (agriculture) in the valley.

https://soil.copernicus.org/articles/6/389/2020/soil-6-389-2020-f06

Figure 6Spatial distribution of SHAP values for each covariate for the CNN model. The value of each pixel is an average of all the instances where that pixel is used as context (up to 841 times for a 29 pixel × 29 pixel context window). TAP: total annual precipitation; MAT: mean annual temperature; TWI: topographic wetness index.

The SHAP values can also be applied to other ML models and even to linear models. When comparing the SHAP value map of the CNN with the SHAP value map of a tree-like model and a linear model, it was possible to observe similar trends in all of them. In the tree-like model (RMSE 3.6 %; Fig. 7) it is also possible to distinguish the valley close to the coast, the increase in the contribution of TAP and MAT, and the east–west gradient of the elevation contribution. The most evident difference is the sharpness of the limits generated by the tree-like model, which is expected given the nature of the underlying algorithm. The linear model (RMSE 3.8 %; Fig. 8) also showed similar spatial patterns, but, as expected, they were much simpler and smoother than the tree-like and CNN models.

https://soil.copernicus.org/articles/6/389/2020/soil-6-389-2020-f07

Figure 7Spatial distribution of SHAP values for each covariate for the tree-like model (RMSE 3.6 %). TAP: total annual precipitation; MAT: mean annual temperature; TWI: topographic wetness index.

https://soil.copernicus.org/articles/6/389/2020/soil-6-389-2020-f08

Figure 8Spatial distribution of SHAP values for each covariate for the linear model (RMSE 3.8 %). TAP: total annual precipitation; MAT: mean annual temperature; TWI: topographic wetness index.

The results of this study show that it is possible to work towards interpretable deep learning models in DSM and that a complex model, generally perceived as a black box, can be inspected using SHAP values. It was possible to assess the covariate importance for the whole model, providing an alternative to the variables of importance of the random forest algorithm, which is commonly used in DSM (Wiesmeier et al., 2011; Heung et al., 2014; Dharumarajan et al., 2017; Ellili et al., 2019). More importantly, we were able to evaluate the variable importance in a spatial context, generating maps of the contributions of each covariate to the model prediction, in a novel application of SHAP values. Surprisingly, considering the spatial nature of DSM, efforts to understand the behaviour of a model in space are scarce. To the best of our knowledge, this has only been attempted with tree-like models (Bui et al., 2006). These models generate rules to partition the data in different groups, and, when a partition is no longer possible, a linear regression is fitted to the observations of that terminal node. That allows for the possibility of generating maps of (a) the rules that lead to the terminal nodes, (b) covariate thresholds defined in the partition rules, and (c) the parameters of the terminal linear regressions. Given the linear nature of the tree-like models, the spatial representation of their rules, covariate thresholds, and parameters, although useful, usually lead to maps with abrupt changes or large continuous areas of dominance by a single rule. Also, since the model parameters are specific for each terminal node, they are not comparable over the whole study area.

Using SHAP values shows promising results to interpret a DSM CNN, which it is not only necessary to corroborate that the model was trained properly, but it is also a fundamental part of the process leading to knowledge discovery (Fayyad et al., 1996). Here we presented a relatively simple model that only used five covariates but that already showed interesting insights, identifying environmental thresholds that can be attributed to specific geographical areas. Potentially, including other covariates such as land use and management practices could allow the model to extract knowledge related to how human intervention affects SOC content and perhaps could help us find solutions to prevent soil degradation. Of course, increasing the complexity of the problem generally translates into needing more data. In applications such as drug discovery, studies reporting novel discoveries through deep learning methods use extensive databases (Zhang et al., 2017; Ekins et al., 2019; Zhavoronkov et al., 2019). On the other hand, global and national soil datasets are sparse in space and time, with a number of profiles in the order of hundreds to tens of thousands. We think that it is important to keep generating more soil data to avoid limiting the applicability of more complex deep learning models and their capacity to extract new insights that could improve our knowledge and understanding of soils.

5 Conclusions

In this paper we introduced the use of game theory, specifically SHAP values, in order to interpret a multi-task convolutional neural network trained to simultaneously predict soil organic carbon (SOC) content at five depth intervals. We illustrated how this method can be used to provide insights about the model. The results corroborated that the model captured sensible relationships between the target soil property and the covariates used to train the model.

We were able to interpret the contribution of the different covariates in three contexts. First, this was at a local level, showing the contribution of the covariates for a single prediction. Second, by analysing multiple local interpretations simultaneously, a global understanding of the covariates contribution was determined. Third, a spatial interpretation of the contributions was performed, which is a novel application of SHAP values and also the first detailed spatial application of this kind.

After a more detailed inspection of the contributions at the global level, we were able to identify environmental thresholds consistent with significant areas within the study area. Those thresholds can be also inspected in a spatial context thanks to the map of SHAP values. This suggests that the modelling exercise, including data quality, model selection, and training, was successful.

Considering the limitations of the current interpretation of models in digital soil mapping, especially in a spatial context, we believe that SHAP values are a valuable tool that should be included in the DSM framework, since they address the important concerns regarding the interpretability of more complex models. Additionally, the insights provided from the ML models could also lead to knowledge discovery.

Data availability

The data are available upon request.

Author contributions

JP conceived and executed the research and wrote the paper. BM gave suggestions about the approach and wrote the paper. ABMB gave suggestions about the approach. All authors reviewed the paper.

Competing interests

The authors declare that they have no conflict of interest.

Acknowledgements

The authors acknowledge the University of Sydney HPC service at The University of Sydney for providing HPC resources that have contributed to the research results reported within this paper.

Financial support

This research has been supported by the Australian Research Council, project “Forecasting soil conditions” (grant no. DP200102542).

Review statement

This paper was edited by Olivier Evrard and reviewed by Matt Aitkenhead and one anonymous referee.

References

Akpa, S. I., Odeh, I. O., Bishop, T. F., Hartemink, A. E., and Amapu, I. Y.: Total soil organic carbon and carbon sequestration potential in Nigeria, Geoderma, 271, 202–215, 2016. a

Anwar, S. M., Majid, M., Qayyum, A., Awais, M., Alnowami, M., and Khan, M. K.: Medical image analysis using convolutional neural networks: a review, J. Med. Syst., 42, 226, https://doi.org/10.1007/s10916-018-1088-1, 2018. a

Behrens, T., MacMillan, R. A., Rossel, R. A. V., Schmidt, K., and Lee, J.: Teleconnections in spatial modelling, Geoderma, 354, 113854, https://doi.org/10.1016/j.geoderma.2019.07.012, 2019. a

Bui, E. N., Henderson, B. L., and Viergever, K.: Knowledge discovery from models of soil properties developed through data mining, Ecol. Model., 191, 431–446, 2006. a

Casanova, M., Salazar, O., Seguel, O., and Luzio, W.: The soils of Chile, Springer, London, https://doi.org/10.1007/978-94-007-5949-7, 2013. a

Conrad, O., Bechtel, B., Bock, M., Dietrich, H., Fischer, E., Gerlitz, L., Wehberg, J., Wichmann, V., and Böhner, J.: System for Automated Geoscientific Analyses (SAGA) v. 2.1.4, Geosci. Model Dev., 8, 1991–2007, https://doi.org/10.5194/gmd-8-1991-2015, 2015. a

Dharumarajan, S., Hegde, R., and Singh, S.: Spatial prediction of major soil properties using Random Forest techniques-A case study in semi-arid tropics of South India, Geoderma Regional, 10, 154–162, 2017. a

Doshi-Velez, F. and Kim, B.: Towards a rigorous science of interpretable machine learning, arXiv preprint, arXiv:1702.08608, 2017. a

Dressel, J. and Farid, H.: The accuracy, fairness, and limits of predicting recidivism, Science Advances, 4, eaao5580, https://doi.org/10.1126/sciadv.aao5580, 2018. a

Ekins, S., Puhl, A. C., Zorn, K. M., Lane, T. R., Russo, D. P., Klein, J. J., Hickey, A. J., and Clark, A. M.: Exploiting machine learning for end-to-end drug discovery and development, Nat. Mater., 18, 435–441, https://doi.org/10.1038/s41563-019-0338-z, 2019. a

Ellili, Y., Walter, C., Michot, D., Pichelin, P., and Lemercier, B.: Mapping soil organic carbon stock change by soil monitoring and digital soil mapping at the landscape scale, Geoderma, 351, 1–8, 2019. a

Ewing, S., Macalady, J., Warren-Rhodes, K., McKay, C., and Amundson, R.: Changes in the soil C cycle at the arid-hyperarid transition in the Atacama Desert, J. Geophys. Res.-Biogeo., 113, G02S90, https://doi.org/10.1029/2007JG000495, 2008. a

Fayyad, U., Piatetsky-Shapiro, G., and Smyth, P.: From data mining to knowledge discovery in databases, AI Mag., 17, 37–37, 1996. a

Goodman, B. and Flaxman, S.: European Union regulations on algorithmic decision-making and a “right to explanation”, AI Mag., 38, 50–57, 2017. a

Heung, B., Bulmer, C. E., and Schmidt, M. G.: Predictive soil parent material mapping at a regional-scale: a random forest approach, Geoderma, 214, 141–154, 2014. a

Hijmans, R. J., Cameron, S. E., Parra, J. L., Jones, P. G., and Jarvis, A.: Very high resolution interpolated climate surfaces for global land areas, Int. J. Climatol., 25, 1965–1978, 2005. a

Krull, E. S., Baldock, J. A., and Skjemstad, J. O.: Importance of mechanisms and processes of the stabilisation of soil organic matter for modelling carbon turnover, Funct. Plant Biol., 30, 207–222, 2003. a

Lamichhane, S., Kumar, L., and Wilson, B.: Digital soil mapping algorithms and covariates for soil organic carbon mapping and their implications: A review, Geoderma, 352, 395–413, 2019. a

Lehner, B., Verdin, K., and Jarvis, A.: New global hydrography derived from spaceborne elevation data, EOS T. Am. Geophys. Un., 89, 93–94, 2008. a

Lundberg, S. M. and Lee, S.-I.: A unified approach to interpreting model predictions, in: Advances in neural information processing systems, edited by: Guyon, I., Luxburg, U. V., Bengio, S., Wallach, H., Fergus, R., Vishwanathan, S., and Garnett, R., 4765–4774, 2017. a, b, c, d, e

Lundberg, S. M., Nair, B., Vavilala, M. S., Horibe, M., Eisses, M. J., Adams, T., Liston, D. E., Low, D. K.-W., Newman, S.-F., Kim, J., et al.: Explainable machine-learning predictions for the prevention of hypoxaemia during surgery, Nature Biomedical Engineering, Neural Information Processing Systems Foundation, Inc., San Diego, CA, USA, 2, 749–760, 2018. a

Luzio, W. (Ed.): Suelos de Chile, Universidad de Chile, Santiago, Chile, 2010. a

Martin, M. P., Wattenbach, M., Smith, P., Meersmans, J., Jolivet, C., Boulonne, L., and Arrouays, D.: Spatial distribution of soil organic carbon stocks in France, Biogeosciences, 8, 1053–1065, https://doi.org/10.5194/bg-8-1053-2011, 2011. a

Mokhtari, K. E., Higdon, B. P., and Başar, A.: Interpreting financial time series with SHAP values, in: Proceedings of the 29th Annual International Conference on Computer Science and Software Engineering, Nature Biomedical Engineering, 4–6 November 2019, Markham, Ontario, Canada, 166–172, 2019. a

Nash, J. F.: Equilibrium points in n-person games, P. Natl. Acad. Sci. USA, 36, 48–49, https://doi.org/10.1073/pnas.36.1.48, 1950. a

Nash, W., Drummond, T., and Birbilis, N.: A review of deep learning in the study of materials degradation, npj Materials Degradation, 2, 1–12, 2018. a

Padarian, J., Minasny, B., and McBratney, A.: Chile and the Chilean soil grid: a contribution to GlobalSoilMap, Geoderma Regional, 9, 17–28, 2017. a, b

Padarian, J., Minasny, B., and McBratney, A. B.: Using deep learning for digital soil mapping, SOIL, 5, 79–89, https://doi.org/10.5194/soil-5-79-2019, 2019. a, b, c, d, e

Padarian, J., Minasny, B., and McBratney, A. B.: Machine learning and soil sciences: a review aided by machine learning tools, SOIL, 6, 35–52, https://doi.org/10.5194/soil-6-35-2020, 2020. a

Parsa, A. B., Movahedi, A., Taghipour, H., Derrible, S., and Mohammadian, A. K.: Toward safer highways, application of XGBoost and SHAP for real-time accident detection and feature analysis, Accident Anal. Prev., 136, 105405, https://doi.org/10.1016/j.aap.2019.105405, 2020. a

Rasmusen, E.: Games and information: An introduction to game theory, 519.3/R22g, Blackwell, Oxford, 1989. a

Shapley, L. S.: A value for n-person games, Contributions to the Theory of Games, edited by: Kuhn, H. W. and Tucker, A. W., Princeton University Press, 2, 307–317, 1953. a, b

Shen, C.: A transdisciplinary review of deep learning research and its relevance for water resources scientists, Water Resour. Res., 54, 8558–8593, 2018. a

Trenberth, K. E., Branstator, G. W., Karoly, D., Kumar, A., Lau, N.-C., and Ropelewski, C.: Progress during TOGA in understanding and modeling global teleconnections associated with tropical sea surface temperatures, J. Geophys. Res.-Oceans, 103, 14291–14324, 1998. a

Viscarra-Rossel, R. A., Webster, R., Bui, E. N., and Baldock, J. A.: Baseline map of organic carbon in Australian soil to support national carbon accounting and monitoring under climate change, Glob. Change Biol., 20, 2953–2970, 2014. a

Webb, S.: Deep learning for biology, Nature, 554, 555–557, 2018. a

Wiesmeier, M., Barthold, F., Blank, B., and Kögel-Knabner, I.: Digital mapping of soil organic matter stocks using Random Forest modeling in a semi-arid steppe ecosystem, Plant Soil, 340, 7–24, 2011. a

Zhang, L., Tan, J., Han, D., and Zhu, H.: From machine learning to deep learning: progress in machine intelligence for rational drug discovery, Drug Discov. Today, 22, 1680–1685, 2017. a

Zhavoronkov, A., Ivanenkov, Y. A., Aliper, A., Veselov, M. S., Aladinskiy, V. A., Aladinskaya, A. V., Terentiev, V. A., Polykovskiy, D. A., Kuznetsov, M. D., Asadulaev, A., Volkov, Y., Zholus, A., Shayakhmetov, R. R., Zhebrak, A., Minaeva, L. I., Zagribelnyy, B. A., Lee, L. H., Soll, R., Madge, D., Xing, L., Guo, T., and Aspuru-Guzik, A.: Deep learning enables rapid identification of potent DDR1 kinase inhibitors, Nature Biotechnol., 37, 1038–1040, 2019. a

Articles

Short summary

In this paper we introduce the use of game theory to interpret a digital soil mapping (DSM) model to understand the contribution of environmental factors to the prediction of soil organic carbon (SOC) in Chile. The analysis corroborated that the SOC model is capturing sensible relationships between SOC and climatic and topographical factors. We were able to represent them spatially (map) addressing the limitations of the current interpretation of models in DSM.