<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing with OASIS Tables v3.0 20080202//EN" "journalpub-oasis3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:oasis="http://docs.oasis-open.org/ns/oasis-exchange/table" xml:lang="en" dtd-version="3.0">
  <front>
    <journal-meta><journal-id journal-id-type="publisher">SOIL</journal-id><journal-title-group>
    <journal-title>SOIL</journal-title>
    <abbrev-journal-title abbrev-type="publisher">SOIL</abbrev-journal-title><abbrev-journal-title abbrev-type="nlm-ta">SOIL</abbrev-journal-title>
  </journal-title-group><issn pub-type="epub">2199-398X</issn><publisher>
    <publisher-name>Copernicus Publications</publisher-name>
    <publisher-loc>Göttingen, Germany</publisher-loc>
  </publisher></journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.5194/soil-4-173-2018</article-id><title-group><article-title>No silver bullet for digital soil mapping: country-specific soil organic carbon estimates across Latin America</article-title><alt-title>No silver bullet on digital soil mapping</alt-title>
      </title-group><?xmltex \runningtitle{No silver bullet on digital soil mapping}?><?xmltex \runningauthor{M. Guevara et al.}?>
      <contrib-group>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Guevara</surname><given-names>Mario</given-names></name>
          
        <ext-link>https://orcid.org/0000-0002-9788-9947</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff2 aff3">
          <name><surname>Olmedo</surname><given-names>Guillermo Federico</given-names></name>
          
        <ext-link>https://orcid.org/0000-0002-4482-3266</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Stell</surname><given-names>Emma</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff3">
          <name><surname>Yigini</surname><given-names>Yusuf</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff4">
          <name><surname>Aguilar Duarte</surname><given-names>Yameli</given-names></name>
          
        <ext-link>https://orcid.org/0000-0002-7007-1360</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff5">
          <name><surname>Arellano Hernández</surname><given-names>Carlos</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff6">
          <name><surname>Arévalo</surname><given-names>Gloria E.</given-names></name>
          
        <ext-link>https://orcid.org/0000-0001-7323-1238</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff7">
          <name><surname>Arroyo-Cruz</surname><given-names>Carlos Eduardo</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff8">
          <name><surname>Bolivar</surname><given-names>Adriana</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff9">
          <name><surname>Bunning</surname><given-names>Sally</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff10">
          <name><surname>Bustamante Cañas</surname><given-names>Nelson</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff5">
          <name><surname>Cruz-Gaistardo</surname><given-names>Carlos Omar</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff11">
          <name><surname>Davila</surname><given-names>Fabian</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff11">
          <name><surname>Dell Acqua</surname><given-names>Martin</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff12">
          <name><surname>Encina</surname><given-names>Arnulfo</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff13">
          <name><surname>Figueredo Tacona</surname><given-names>Hernán</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff11">
          <name><surname>Fontes</surname><given-names>Fernando</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff14">
          <name><surname>Hernández Herrera</surname><given-names>José Antonio</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff5">
          <name><surname>Ibelles Navarro</surname><given-names>Alejandro Roberto</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff15">
          <name><surname>Loayza</surname><given-names>Veronica</given-names></name>
          
        <ext-link>https://orcid.org/0000-0002-7928-3546</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff6">
          <name><surname>Manueles</surname><given-names>Alexandra M.</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff16">
          <name><surname>Mendoza Jara</surname><given-names>Fernando</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff17">
          <name><surname>Olivera</surname><given-names>Carolina</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff10">
          <name><surname>Osorio Hermosilla</surname><given-names>Rodrigo</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff11">
          <name><surname>Pereira</surname><given-names>Gonzalo</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff11">
          <name><surname>Prieto</surname><given-names>Pablo</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff18">
          <name><surname>Ramos</surname><given-names>Iván Alexis</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff19">
          <name><surname>Rey Brina</surname><given-names>Juan Carlos</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff20">
          <name><surname>Rivera</surname><given-names>Rafael</given-names></name>
          
        <ext-link>https://orcid.org/0000-0001-5764-3806</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff7">
          <name><surname>Rodríguez-Rodríguez</surname><given-names>Javier</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff21 aff22">
          <name><surname>Roopnarine</surname><given-names>Ronald</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff23">
          <name><surname>Rosales Ibarra</surname><given-names>Albán</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff24">
          <name><surname>Rosales Riveiro</surname><given-names>Kenset Amaury</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff25">
          <name><surname>Schulz</surname><given-names>Guillermo Andrés</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff26">
          <name><surname>Spence</surname><given-names>Adrian</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff27">
          <name><surname>Vasques</surname><given-names>Gustavo M.</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff3">
          <name><surname>Vargas</surname><given-names>Ronald R.</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="yes" rid="aff1">
          <name><surname>Vargas</surname><given-names>Rodrigo</given-names></name>
          <email>rvargas@udel.edu</email>
        <ext-link>https://orcid.org/0000-0001-6829-5333</ext-link></contrib>
        <aff id="aff1"><label>1</label><institution>University of Delaware, Department of Plant and Soil Sciences, Newark, DE, USA</institution>
        </aff>
        <aff id="aff2"><label>2</label><institution>INTA EEA Mendoza, San Martín 3853, Luján de Cuyo, Mendoza, Argentina</institution>
        </aff>
        <aff id="aff3"><label>3</label><institution>FAO, Vialle de Terme di Caracalla, Rome, Italy</institution>
        </aff>
        <aff id="aff4"><label>4</label><institution>Instituto Nacional de Investigaciones Forestales, Agrícolas y Pecuarias, Mérida, Mexico</institution>
        </aff>
        <aff id="aff5"><label>5</label><institution>Instituto Nacional de Estadísitica y Geografía, Aguascalientes, Mexico</institution>
        </aff>
        <aff id="aff6"><label>6</label><institution>Zamorano University of Honduras and Asociación Hondureña de la Ciencia del Suelo, Tegucigalpa, Honduras</institution>
        </aff>
        <aff id="aff7"><label>7</label><institution>National Commission for the Knowledge and Use of Biodiversity, Mexico City, Mexico</institution>
        </aff>
        <aff id="aff8"><label>8</label><institution>Subdirección Agrología, Instituto Geográfico Agustín Codazzi, Bogotá, Colombia</institution>
        </aff>
        <aff id="aff9"><label>9</label><institution>Oficina Regional de la FAO para América Latina y el Caribe, Santiago de Chile, Chile</institution>
        </aff>
        <aff id="aff10"><label>10</label><institution>Servicio Agrícola y Ganadero, Santiago de Chile, Chile</institution>
        </aff>
        <aff id="aff11"><label>11</label><institution>Direccion General de Recursos Naturales, Ministerio de Ganaderia,<?xmltex \hack{\break}?> Agricultura y Pesca, Montevideo, Uruguay</institution>
        </aff>
        <aff id="aff12"><label>12</label><institution>Facultad de Ciencias Agrarias de la Universidad Nacional de Asunción, Asunción, Paraguay</institution>
        </aff>
        <aff id="aff13"><label>13</label><institution>Land Viceministry, Ministry of Rural Development and Land, La Paz, Bolivia</institution>
        </aff>
        <aff id="aff14"><label>14</label><institution>Universidad Autónoma Agraria Antonio Narro Unidad Laguna, Torreón, Mexico</institution>
        </aff>
        <aff id="aff15"><label>15</label><institution>Ministerio de Agricultura y Ganaderia, Quito, Ecuador</institution>
        </aff>
        <aff id="aff16"><label>16</label><institution>Universidad Nacional Agraria, Managua, Nicaragua</institution>
        </aff>
        <aff id="aff17"><label>17</label><institution>Oficina Regional de la FAO para América Latina y el Caribe, Bogotá, Colombia</institution>
        </aff>
        <aff id="aff18"><label>18</label><institution>Instituto de Investigación Agropecuaria de Panamá, Panamá, Panama</institution>
        </aff>
        <aff id="aff19"><label>19</label><institution>Sociedad Venezolana de la Ciencia del Suelo, Caracas, Venezuela</institution>
        </aff>
        <aff id="aff20"><label>20</label><institution>Ministerio de Medio Ambiente, Santo Domingo, Dominican Republic</institution>
        </aff>
        <aff id="aff21"><label>21</label><institution>Department of Natural and Life Sciences, COSTAATT, Port of Spain, Trinidad and Tobago</institution>
        </aff>
        <aff id="aff22"><label>22</label><institution>University of the West Indies, St. Augustine Campus, St. Augustine, Trinidad and Tobago</institution>
        </aff>
        <aff id="aff23"><label>23</label><institution>Instituto de Innovación en Transferencia y Tecnología Agropecuaria, San José, Costa Rica</institution>
        </aff>
        <aff id="aff24"><label>24</label><institution>Ministerio de Ambiente y Recursos Naturales de Guatemala, Ciudad Guatemala, Guatemala</institution>
        </aff>
        <aff id="aff25"><label>25</label><institution>INTA CNIA, Buenos Aires, Argentina</institution>
        </aff>
        <aff id="aff26"><label>26</label><institution>International Centre for Environmental and Nuclear Sciences,<?xmltex \hack{\break}?> University of the West Indies, Kingston, Jamaica</institution>
        </aff>
        <aff id="aff27"><label>27</label><institution>Embrapa Solos, Rio de Janeiro, Brazil</institution>
        </aff>
      </contrib-group>
      <author-notes><corresp id="corr1">Rodrigo Vargas (rvargas@udel.edu)</corresp></author-notes><pub-date><day>1</day><month>August</month><year>2018</year></pub-date>
      
      <volume>4</volume>
      <issue>3</issue>
      <fpage>173</fpage><lpage>193</lpage>
      <history>
        <date date-type="received"><day>15</day><month>December</month><year>2017</year></date>
           <date date-type="rev-request"><day>25</day><month>January</month><year>2018</year></date>
           <date date-type="rev-recd"><day>15</day><month>June</month><year>2018</year></date>
           <date date-type="accepted"><day>24</day><month>June</month><year>2018</year></date>
      </history>
      <permissions>
        
        
      <license license-type="open-access"><license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p></license></permissions><self-uri xlink:href="https://soil.copernicus.org/articles/4/173/2018/soil-4-173-2018.html">This article is available from https://soil.copernicus.org/articles/4/173/2018/soil-4-173-2018.html</self-uri><self-uri xlink:href="https://soil.copernicus.org/articles/4/173/2018/soil-4-173-2018.pdf">The full text article is available as a PDF file from https://soil.copernicus.org/articles/4/173/2018/soil-4-173-2018.pdf</self-uri>
      <abstract>
    <p id="d1e606">Country-specific soil organic carbon (SOC) estimates are the baseline for
the Global SOC Map of the Global Soil Partnership (GSOCmap-GSP). This
endeavor is key to explaining the uncertainty of global SOC estimates but
requires harmonizing heterogeneous datasets and building country-specific
capacities for digital soil mapping (DSM). We identified country-specific
predictors for SOC and tested the performance of five predictive algorithms
for mapping SOC across Latin America. The algorithms included support vector
machines (SVMs), random forest (RF), kernel-weighted nearest neighbors (KK),
partial least squares regression (PL), and regression kriging based on
stepwise multiple linear models (RK). Country-specific training data and SOC
predictors (5 <inline-formula><mml:math id="M1" display="inline"><mml:mo>×</mml:mo></mml:math></inline-formula> 5 km pixel resolution) were obtained from
ISRIC – World Soil Information. Temperature, soil type, vegetation
indices, and topographic constraints were the best predictors for SOC, but
country-specific predictors and their respective weights varied across Latin
America. We compared a large diversity of country-specific datasets and
models, and were able to explain SOC variability in a range between <inline-formula><mml:math id="M2" display="inline"><mml:mo>∼</mml:mo></mml:math></inline-formula> 1 and <inline-formula><mml:math id="M3" display="inline"><mml:mo>∼</mml:mo></mml:math></inline-formula> 60 %, with no universal predictive algorithm among
countries. A regional (<inline-formula><mml:math id="M4" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> <inline-formula><mml:math id="M5" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 11 268 SOC estimates) ensemble of these
five algorithms was able to explain <inline-formula><mml:math id="M6" display="inline"><mml:mo>∼</mml:mo></mml:math></inline-formula> 39 % of SOC variability from
repeated 5-fold cross-validation. We report a combined SOC stock of
77.8 <inline-formula><mml:math id="M7" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula> 43.6 Pg (uncertainty represented by the full conditional
response of independent model residuals) across Latin America. SOC stocks
were higher in tropical forests (30 <inline-formula><mml:math id="M8" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula> 16.5 Pg) and croplands
(13 <inline-formula><mml:math id="M9" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula> 8.1 Pg). Country-specific and regional ensembles revealed
spatial discrepancies across geopolitical borders, higher elevations, and
coastal plains, but provided similar regional stocks (77.8 <inline-formula><mml:math id="M10" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula> 42.2 and
76.8 <inline-formula><mml:math id="M11" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula> 45.1 Pg, respectively). These results are conservative
compared to global estimates (e.g., SoilGrids250m 185.8 Pg, the Harmonized
World Soil Database 138.4 Pg, or the GSOCmap-GSP 99.7 Pg). Countries with
large area (i.e., Brazil, Bolivia, Mexico, Peru) and large spatial SOC
heterogeneity had lower SOC stocks per unit area and larger uncertainty in
their predictions. We highlight that expert opinion is needed to set boundary
prediction limits to avoid unrealistically high modeling estimates. For
maximizing explained variance while minimizing prediction bias, the selection
of predictive algorithms for SOC mapping should consider density of available
data and variability of country-specific environmental gradients. This study
highlights the large degree of spatial uncertainty in SOC estimates across
Latin America. We provide a framework for improving country-specific mapping
efforts and reducing current discrepancy of global, regional, and
country-specific SOC estimates.</p>
  </abstract>
    </article-meta>
  </front>
<body>
      

<?pagebreak page174?><sec id="Ch1.S1" sec-type="intro">
  <title>Introduction</title>
      <p id="d1e694">Soils store around 1500 <inline-formula><mml:math id="M12" display="inline"><mml:mi mathvariant="normal">Pg</mml:mi></mml:math></inline-formula> of carbon and represent the largest
terrestrial carbon pool <xref ref-type="bibr" rid="bib1.bibx34" id="paren.1"/>; thus, it is critical to
accurately quantify the variability of soil organic carbon (SOC) from
local to global scales. During the fourth session of the Global Soil Partnership
(GSP) Plenary Assembly held in May 2016 in Rome, it was agreed to develop a
Global Soil Organic Carbon Map (GSOCmap) <xref ref-type="bibr" rid="bib1.bibx17" id="paren.2"/>. The overarching
goal is that a Global SOC Map of the Global Soil Partnership (GSOCmap-GSP)
will be developed using a distributed approach relying on country-specific
SOC maps. Country-specific maps represent a valuable source of information to
explain the high discrepancy of current global SOC estimates such as the
SoilGrids250m system and the Harmonized World Soil Database <xref ref-type="bibr" rid="bib1.bibx64" id="paren.3"/>.
The Food and Agriculture Organization (FAO) recently compiled how different
statistical methods (e.g., regression kriging and machine learning) could be
used to generate country-specific SOC maps and calculate uncertainty
<xref ref-type="bibr" rid="bib1.bibx72" id="paren.4"/>. All these approaches consider the reference framework of
the  Soils, Climate, Organisms, Parent material, Age and (<inline-formula><mml:math id="M13" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula>) space or
spatial position (SCORPAN) model for digital soil mapping (DSM) <xref ref-type="bibr" rid="bib1.bibx43" id="paren.5"/>. In the
SCORPAN reference framework, a soil attribute (e.g., SOC) can be predicted as
a function of the soil-forming environment, in correspondence with
soil-forming factors from the Dokuchaev hypothesis and Jenny's soil-forming
equation based on climate, organisms, relief, parent material, and elapsed
time of soil formation <xref ref-type="bibr" rid="bib1.bibx19" id="paren.6"/>. The SCORPAN
reference framework is an empirical approach that can be expressed as in
Eq. (1):

              <disp-formula specific-use="align" content-type="numbered"><mml:math id="M14" display="block"><mml:mtable displaystyle="true"><mml:mtr><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:msub><mml:mi>S</mml:mi><mml:mrow><mml:msub><mml:mi mathvariant="normal">a</mml:mi><mml:mrow><mml:mo>[</mml:mo><mml:mi>x</mml:mi><mml:mo>;</mml:mo><mml:mi>y</mml:mi><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi>t</mml:mi><mml:mo>]</mml:mo></mml:mrow></mml:msub></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi>f</mml:mi></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mo mathsize="1.1em">(</mml:mo><mml:msub><mml:mi>S</mml:mi><mml:mrow><mml:mo>[</mml:mo><mml:mi>x</mml:mi><mml:mo>;</mml:mo><mml:mi>y</mml:mi><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mi>t</mml:mi><mml:mo>]</mml:mo></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mo>[</mml:mo><mml:mi>x</mml:mi><mml:mo>;</mml:mo><mml:mi>y</mml:mi><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi>t</mml:mi><mml:mo>]</mml:mo></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>O</mml:mi><mml:mrow><mml:mo>[</mml:mo><mml:mi>x</mml:mi><mml:mo>;</mml:mo><mml:mi>y</mml:mi><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi>t</mml:mi><mml:mo>]</mml:mo></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>R</mml:mi><mml:mrow><mml:mo>[</mml:mo><mml:mi>x</mml:mi><mml:mo>;</mml:mo><mml:mi>y</mml:mi><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi>t</mml:mi><mml:mo>]</mml:mo></mml:mrow></mml:msub><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mlabeledtr id="Ch1.E1"><mml:mtd/><mml:mtd><mml:mstyle displaystyle="true" class="stylechange"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mo>[</mml:mo><mml:mi>x</mml:mi><mml:mo>;</mml:mo><mml:mi>y</mml:mi><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi>t</mml:mi><mml:mo>]</mml:mo></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>A</mml:mi><mml:mrow><mml:mo>[</mml:mo><mml:mi>x</mml:mi><mml:mo>;</mml:mo><mml:mi>y</mml:mi><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi>t</mml:mi><mml:mo>]</mml:mo></mml:mrow></mml:msub><mml:mo mathsize="1.1em">)</mml:mo><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

          where <inline-formula><mml:math id="M15" display="inline"><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mi mathvariant="normal">a</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the soil attribute of interest at a specific location
<inline-formula><mml:math id="M16" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula> (represented by the spatial coordinates of field observations <inline-formula><mml:math id="M17" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula>; <inline-formula><mml:math id="M18" display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula>)
and at a specific period of time (<inline-formula><mml:math id="M19" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula>); <inline-formula><mml:math id="M20" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula> is the soil or other soil
properties that are correlated with the soil attribute of interest (<inline-formula><mml:math id="M21" display="inline"><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mi mathvariant="normal">a</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>); <inline-formula><mml:math id="M22" display="inline"><mml:mi>C</mml:mi></mml:math></inline-formula> is the climate or climatic properties of the
environment; <inline-formula><mml:math id="M23" display="inline"><mml:mi>O</mml:mi></mml:math></inline-formula> is the organisms, vegetation, fauna, or human activity; <inline-formula><mml:math id="M24" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>
is topography or landscape attributes; <inline-formula><mml:math id="M25" display="inline"><mml:mi>P</mml:mi></mml:math></inline-formula> is parent material or lithology;
and <inline-formula><mml:math id="M26" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula> is the substrate age or the time factor. To generate predictions of
<inline-formula><mml:math id="M27" display="inline"><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mi mathvariant="normal">a</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> across places where no soil data are available, <inline-formula><mml:math id="M28" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula> should be
explicit for the information layers representing the soil-forming factors.
These predictions will be representative of a specific period of time (<inline-formula><mml:math id="M29" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula>)
when soil available data were collected. Therefore, the prediction factors
ideally should represent the conditions of the soil-forming environment for
the same period<?pagebreak page175?> of time (as much as possible) when soil available data were
collected. In Eq. (1), the left side is usually represented by the available
geospatial soil observational data (e.g., from legacy soil profile
collections) and the right side of the equation is represented by the soil
prediction factors. These prediction factors are normally derived from four
main sources of information: (a) thematic maps (i.e., soil type, rock type,
land use type); (b) remote sensing (i.e., active and passive sensors);
(c) climate surfaces and meteorological data; and (d) digital terrain
analysis or geomorphometry. The SCORPAN reference framework is widely used,
but one critical challenge is to quantify the relative importance of the
soil-forming factors (i.e., prediction factors) that could explain the
underlying soil processes controlling the spatial variability of a specific
soil attribute (i.e., SOC).</p>
      <p id="d1e1013">Arguably, there are two approaches for statistical modeling
<xref ref-type="bibr" rid="bib1.bibx11" id="paren.7"/> that influence the predictions of the spatial variability
of SOC. One assumes that the variability of observations can be reproduced by
a given stochastic data model (e.g., with hypotheses about the spatial
structure of the variable). The other approach uses algorithms and treats as
unknown the mechanisms generating the structure of values in available
datasets (e.g., with hypothesis about the statistical distribution and
moments of the variable). For SOC modeling, the accuracies of global models
compared with country-specific estimates have not been systematically
evaluated on detail. While globally available SOC predictions rely on large
and complex multivariate spaces to represent the soil-forming environment,
local (i.e., more simple) models may be useful for validation purposes and
required to measure the bias of global SOC estimates, specifically, at
particular sites/countries (well represented by available data), where SOC drivers may be
easier to identify due to a smaller range of SOC variance. In addition, the
assumptions of global models compared with local efforts may be different,
and local datasets could complement global information sources. Because
different mapping approaches use available information (i.e., training data
and predictors) in different ways, comparing several approaches and methods
is useful to quantify the relative importance of prediction factors across
data configurations and distributional properties. We argue that a systematic
analysis of predictive algorithms and consequently selection of predictors
(by each one of the algorithms) could provide insights about the underlying
factors that control the spatial variability of SOC.</p>
      <p id="d1e1019">The last decade has seen an increasing diversity of approaches for DSM. Data
mining techniques have been successfully used to model and predict the
spatial variability of soil properties <xref ref-type="bibr" rid="bib1.bibx57 bib1.bibx29 bib1.bibx60" id="paren.8"/> and generate site-specific and country-specific SOC maps
<xref ref-type="bibr" rid="bib1.bibx68 bib1.bibx1" id="paren.9"/>. The combination of regression
modeling approaches with geostatistics of independent model residuals (i.e.,
regression kriging) is a combined strategy that has been widely used to map
SOC <xref ref-type="bibr" rid="bib1.bibx27 bib1.bibx47 bib1.bibx41 bib1.bibx38 bib1.bibx54 bib1.bibx1 bib1.bibx71 bib1.bibx51 bib1.bibx49" id="paren.10"/>. Machine learning
algorithms such as random forests or support vector machines have also been
used to increase statistical accuracy of soil carbon models
<xref ref-type="bibr" rid="bib1.bibx42 bib1.bibx23 bib1.bibx29" id="paren.11"/> including applications for SOC
mapping <xref ref-type="bibr" rid="bib1.bibx20 bib1.bibx62 bib1.bibx70 bib1.bibx29 bib1.bibx15 bib1.bibx39 bib1.bibx68" id="paren.12"/>. Machine
learning methods do not necessarily allow to extract information about the
main effects of prediction factors in the response variable (e.g., SOC);
consequently, a variable selection strategy is always useful to increase the
interpretability of machine learning algorithms. With this diversity of
approaches, one constant question is if there is a method that systematically
improves the prediction capacity of the others aiming to predict SOC across
large geographic areas (e.g., Latin America). We postulate that probably
there is no universal method (i.e., silver bullet) for DSM, but both global
and country-specific efforts are needed to test a variety of predictive
algorithms including variable and parameter selection strategies for
maximizing explained variance while minimizing prediction bias.</p>
      <p id="d1e1037">To minimize bias in SOC predictions, it is required to build baseline
reference estimates to quantify SOC stocks and contribute to better
parameterization for projections of SOC under future soil weathering
conditions and land degradation scenarios. Therefore, SOC estimates based on
statistical predictions should be ideally based on all available information
for specific countries or regions of interest, from both national and global
information sources. However, the availability of public SOC information is
limited across large areas of Latin America and large discrepancies exist in
current global SOC estimates <xref ref-type="bibr" rid="bib1.bibx64" id="paren.13"/>. Thus, there is a pressing need
to validate the accuracy of global SOC estimates, improve interoperability
(<xref ref-type="bibr" rid="bib1.bibx66" id="altparen.14"/>) and contribute to the capacity of countries to meet
the GlobalSoilMap specifications <xref ref-type="bibr" rid="bib1.bibx4" id="paren.15"/> to inform policy
decisions around climate change mitigation strategies.</p>
      <p id="d1e1050">This study focuses on Latin America, where site- or region-specific modeling
efforts report high explained variance when mapping SOC <xref ref-type="bibr" rid="bib1.bibx56" id="paren.16"/>.
Accurate SOC maps are required to identify areas with the potential for soil
carbon sequestration, and distinguish them from areas with high SOC. However,
site-specific efforts to map SOC across Latin America highlight the challenge
of predicting pedologically sound soil maps due to the complexity of SOC
spatial variability <xref ref-type="bibr" rid="bib1.bibx2" id="paren.17"/>, including the inconsistencies of
using simple linear approaches to explain soil and depth interrelationships
<xref ref-type="bibr" rid="bib1.bibx3" id="paren.18"/>. Site-specific SOC mapping efforts across Latin America
also suggest that variable selection and the spatial detail of SOC prediction
factors also contribute to discrepancies of SOC predictions
<xref ref-type="bibr" rid="bib1.bibx58" id="paren.19"/>. To increase the accuracy<?pagebreak page176?> of SOC predictions, the
use of high-performance computing through open-source platforms (i.e., Google
Earth) represents a valuable resource to make and continuously update (as new
and better data become available) country-specific SOC maps
<xref ref-type="bibr" rid="bib1.bibx53" id="paren.20"/>. The constant challenge is how to increase SOC
prediction accuracy while also reducing the uncertainty and granularity of
SOC grids.</p>
      <p id="d1e1068">The overarching goal of this study is to compare different predictive
algorithms across 19 data/country scenarios with publicly available
information to support the development of country-specific SOC maps to be
included in the GSOCmap-GSP. Currently, SOC information across Latin America
has been derived from global models such as the SoilGrids system or the
Harmonized World Soil Database <xref ref-type="bibr" rid="bib1.bibx29 bib1.bibx36" id="paren.21"/>, which lack
quantification of uncertainty and where large areas remain parameterized with
limited country-specific information. This challenge is not unique for Latin
America as many regions around the world (e.g., Africa, Siberia) have limited
SOC information to parameterize models to estimate the SOC pool. To inform
future SOC mapping efforts, this study addresses two specific questions:
(a) which environmental variables (derived from publicly available
information) have the highest correlations with country-specific SOC
information, and (b) which method (i.e., predictive algorithm)
is best to represent SOC across Latin America and within each country. We assumed
that methods could inform each other as they may explain different aspects of
SOC variability. The ultimate aim of this study is to empower capacities for
digital SOC mapping across Latin America and to contribute to the
discussion about the importance of integrating country-specific information
for representing and predicting soil-related variables (e.g., SOC) to improve
regional-to-global SOC predictions.</p>
</sec>
<sec id="Ch1.S2">
  <title>Methods</title>
      <p id="d1e1080">We based our methodological approach on public sources of information and
methods implemented in open-source platforms for statistical computing. Thus,
our framework for modeling SOC stocks (Fig. <xref ref-type="fig" rid="Ch1.F1"/>) could be
reproduced across the world for comparative purposes between country-specific
and global estimates.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F1" specific-use="star"><caption><p id="d1e1087">Flow diagram of the main
methodological steps that we performed in order to generate country-specific
and regional SOC predictions. The World Soil Information Service (WoSIS)
dataset was harmonized with the <uri>http://worldgrids.org</uri> (last access:
20 February 2018) environmental data using 5 <inline-formula><mml:math id="M30" display="inline"><mml:mo>×</mml:mo></mml:math></inline-formula> 5 km grids. SOC
stocks were calculated at points and correlated predictors identified. Five
methods were parameterized and we created an ensemble of using a generalized
linear approach. Accuracy of models and the ensembles was assessed with
repeated cross-validation. Country-specific and regional (Latin America)
ensembles were compared with global models. KK is kernel-weighted nearest
neighbors, SVM is support vector machines, RF is random forests, PL is
partial least squares regression, and RK is regression kriging.</p></caption>
        <?xmltex \igopts{width=441.017717pt}?><graphic xlink:href="https://soil.copernicus.org/articles/4/173/2018/soil-4-173-2018-f01.png"/>

      </fig>

<sec id="Ch1.S2.SS1">
  <title>SOC observations</title>
      <p id="d1e1111">Soil organic carbon information was extracted from the World Soil Information
Service (WoSIS) soil profile database. This dataset represents a great
harmonization effort in which a large number of national legacy datasets have
been compiled. It includes local-to-national soil profile collections with a
sampling strategy generally based on morphological soil attributes
<xref ref-type="bibr" rid="bib1.bibx5" id="paren.22"/>. The goal of the GSOCmap-GSP is to produce global
information for the first 30 cm; thus, we generated synthetic horizons for
this depth using a mass-preserving spline approach <xref ref-type="bibr" rid="bib1.bibx6" id="paren.23"/>. We
applied a pedotransfer function based on organic matter (OM) if the bulk
density (BLD) information was missing: BLD is <inline-formula><mml:math id="M31" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mo>(</mml:mo><mml:mn mathvariant="normal">0.6268</mml:mn><mml:mo>+</mml:mo><mml:mn mathvariant="normal">0.0361</mml:mn><mml:mo>×</mml:mo><mml:mi mathvariant="normal">OM</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> <xref ref-type="bibr" rid="bib1.bibx72" id="paren.24"/>. We decided to use this equation because it
showed less extreme values than other available pedotransfer functions during
preliminary discussion and training exercises (data not shown). Another
reason is that there is not a single pedotransfer function applicable to all
conditions across Latin America. This equation is representative for soils
with organic matter content between 0.17 and 13.5 % <xref ref-type="bibr" rid="bib1.bibx16" id="paren.25"/>. For
coarse fragments (CRFVOL), a value of 0 % was used for missing
information prior to the mass-preservative spline modeling. SOC estimates (0
to 30 cm) were derived following a standardized SOC calculation method
<xref ref-type="bibr" rid="bib1.bibx50" id="paren.26"/> (Eq. 2):
            <disp-formula id="Ch1.E2" content-type="numbered"><mml:math id="M32" display="block"><mml:mrow><mml:msub><mml:mi mathvariant="normal">SOC</mml:mi><mml:mi mathvariant="normal">stock</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mi mathvariant="normal">ORCDR</mml:mi><mml:mn mathvariant="normal">1000</mml:mn></mml:mfrac></mml:mstyle><mml:mo>×</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mi>H</mml:mi><mml:mn mathvariant="normal">100</mml:mn></mml:mfrac></mml:mstyle><mml:mo>×</mml:mo><mml:mi mathvariant="normal">BLD</mml:mi><mml:mo>×</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mo>(</mml:mo><mml:mn mathvariant="normal">100</mml:mn><mml:mo>-</mml:mo><mml:mi mathvariant="normal">CRFVOL</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mn mathvariant="normal">100</mml:mn></mml:mfrac></mml:mstyle><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
          where ORCDR is SOC density (<inline-formula><mml:math id="M33" display="inline"><mml:mrow><mml:mi mathvariant="normal">g</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">kg</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>) and <inline-formula><mml:math id="M34" display="inline"><mml:mi>H</mml:mi></mml:math></inline-formula> is soil depth
(30 <inline-formula><mml:math id="M35" display="inline"><mml:mi mathvariant="normal">cm</mml:mi></mml:math></inline-formula>).</p>
      <p id="d1e1235">Because of the limitations and uncertainty in the available BD and CRFVOL
data, we also included an error approximation of SOC estimates. This error
was derived using Global Soil Information Facilities (GSIF;
<xref ref-type="bibr" rid="bib1.bibx26" id="altparen.27"/>) as explained in the next section.</p>
</sec>
<sec id="Ch1.S2.SS2">
  <title>SOC error estimates</title>
      <p id="d1e1247">The GSIF approach for estimating SOC (function OCSKGM) includes an
approximate error which we used to quantify the reliability of SOC estimates
<xref ref-type="bibr" rid="bib1.bibx29" id="paren.28"/>. This error was approximated using the Taylor series
method, by a truncated Taylor series centered by the means explained
previously <xref ref-type="bibr" rid="bib1.bibx32" id="paren.29"/>. We mapped the error trend of SOC estimates
by interpolating the values on a per country basis using the generic
framework for predictive modeling based on machine learning and buffer
(geographical) distances <xref ref-type="bibr" rid="bib1.bibx30" id="paren.30"/>. We followed
this method to provide a spatial explicit measure of the SOC estimation
error. We used this method because it can be implemented without prediction
factors (e.g., only buffer distances) and because it is practically free of
assumptions but considers the geographical proximity to and composition of
the sampling location points as explained by its developers
<xref ref-type="bibr" rid="bib1.bibx30" id="paren.31"/>. SOC error estimates represent a
component of uncertainty of the overall quality of country-specific input
data.</p>
</sec>
<sec id="Ch1.S2.SS3">
  <title>SOC training data and exploratory analysis</title>
      <p id="d1e1269">Each country-specific SOC dataset was transformed to its natural logarithm to
reduce the right-skewed distribution of SOC values and because exploratory
analysis showed that this transformation can improve the prediction capacity
of further modeling methods. To analyze the statistical distribution of SOC
values, a probability distribution function was<?pagebreak page177?> plotted and a Shapiro–Wilk
test of normality was conducted on each dataset. The units of the SOC
estimates are <inline-formula><mml:math id="M36" display="inline"><mml:mrow><mml:mi mathvariant="normal">kg</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">m</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>. Our global (Latin America) dataset of
11 268 SOC estimates was divided using a simple bootstrapping technique
<xref ref-type="bibr" rid="bib1.bibx37" id="paren.32"/> and 25 % of data were used for independent validation
purposes, and the remaining 75 % of data for training prediction models.
We coupled this information with a public source of prediction factors; see
Sect. 2.4.</p>
</sec>
<sec id="Ch1.S2.SS4">
  <title>Soils prediction factors</title>
      <p id="d1e1298">We used environmental information from WorldGrids (worldgrids.org), which is
an initiative of ISRIC-World Soil Information. We downloaded and masked 118
environmental layers (i.e., prediction factors) for each country to
quantitatively represent the soil-forming environment
(<uri>http://worldgrids.org/doku.php/wiki:layers</uri>, last access:
20 February 2018). The prediction factors were harmonized into a
1 <inline-formula><mml:math id="M37" display="inline"><mml:mo>×</mml:mo></mml:math></inline-formula> 1 km global grid by the WorldGrids project from three main
information sources: remote sensing, climate surfaces, and digital terrain
analysis. Additional terrain parameters (e.g., terrain slope, aspect,
catchment area, channel network base level, terrain curvature, topographic
wetness index, and length–slope factor) from elevation data were calculated
in the System for Automated Geoscientific Analyses geographic information
system (SAGA GIS) for each country following the standard implementation for
basic terrain parameters <xref ref-type="bibr" rid="bib1.bibx13" id="paren.33"/>. We resampled the prediction
factors into a 5 <inline-formula><mml:math id="M38" display="inline"><mml:mo>×</mml:mo></mml:math></inline-formula> 5 km pixel size grid to reduce the computational
demand required to make predictions and facilitate the reproducibility of
this DSM framework without the need for high-performance computing.</p>
</sec>
<sec id="Ch1.S2.SS5">
  <title>Prediction of SOC</title>
      <p id="d1e1327">We made predictions on a country-specific and on a regional (Latin American)
basis. We based our prediction framework on the following six steps:
<list list-type="bullet"><list-item>
      <p id="d1e1332">First, the relationship between SOC and prediction factors was explored
using simple correlation analysis.</p></list-item><list-item>
      <?pagebreak page178?><p id="d1e1336">Second, the 10 prediction factors with highest correlations with SOC data
were identified for each country and used for further analyses.</p></list-item><list-item>
      <p id="d1e1340">Third, we explored, parameterized, and compared five statistical methods
with different assumptions to model SOC variability across Latin America:
regression kriging (based on a multiple linear regression model (RK) and
partial least squares (PLS) regression, support vector machines (SVMs), random
forests (RF), and kernel-weighted nearest neighbors (KK). A brief explanation
for each modeling approach is provided in Appendix A.</p></list-item><list-item>
      <p id="d1e1344">Fourth, we used a five times repeated 5-fold cross-validation strategy
of the aforementioned models to estimate the RMSE. Then, we used the
caretEnsemble tools for stacking the five predictions <xref ref-type="bibr" rid="bib1.bibx14 bib1.bibx37" id="paren.34"/>. The
caretEnsemble approach uses the RMSE to weight and create ensembles of regression models
under a generalized approach to create a linear blend of predictions.</p></list-item><list-item>
      <p id="d1e1351">Fifth, we calculated independent model residuals (by predicting the
25 % of data not used for model parameterization). For each
5 <inline-formula><mml:math id="M39" display="inline"><mml:mo>×</mml:mo></mml:math></inline-formula> 5 km pixel, we estimated the full conditional response of these
residuals to the SOC prediction factors following the quantile regression
method available within the quantregForest modeling framework <xref ref-type="bibr" rid="bib1.bibx45 bib1.bibx44" id="paren.35"/>. We used this map as a surrogate of model
uncertainty complementary to the approximated error trend of SOC estimates.</p></list-item><list-item>
      <p id="d1e1365">Sixth, we used all Latin American data in the WoSIS system to repeat
the fourth and fifth steps of our modeling framework, generating regional
predictions of SOC and comparing with country-specific results and global SOC
estimates. We also evaluated the prediction capacity of these models.</p></list-item></list></p>
</sec>
<sec id="Ch1.S2.SS6">
  <title>Model evaluation and accuracy</title>
      <p id="d1e1374">First, we selected the optimal parameters for each model/country by the means
of a 10-fold cross-validation strategy following a generic recommendation
<xref ref-type="bibr" rid="bib1.bibx9" id="paren.36"/> (see parameter description in
Appendix A). For each model, the train function of the caret package
<xref ref-type="bibr" rid="bib1.bibx37" id="paren.37"/> included simple resampling techniques for automatic model
parameter selection. Thus, we obtained unbiased residuals for each
model/country that we compared using Taylor diagrams <xref ref-type="bibr" rid="bib1.bibx12" id="paren.38"/>. A Taylor
diagram summarizes multiple aspects of model performance, such as the
agreement and variance between observed and predicted values
<xref ref-type="bibr" rid="bib1.bibx63" id="paren.39"/>. In a Taylor diagram, each model is represented by a point
in the plot describing how well the patterns of observed and modeled values match
each other. Two models have a similar predictive capacity if they overlap
across the intersection of an error vector, a variance ratio, and a
correlation vector.</p>
      <p id="d1e1389">We analyzed the overall ratio (EC<inline-formula><mml:math id="M40" display="inline"><mml:msub><mml:mi/><mml:mi mathvariant="normal">r</mml:mi></mml:msub></mml:math></inline-formula>) between model errors (RMSE)
and the correlation between observed and predicted values (corr) for each
model across all countries. We propose this ratio EC<inline-formula><mml:math id="M41" display="inline"><mml:msub><mml:mi/><mml:mi mathvariant="normal">r</mml:mi></mml:msub></mml:math></inline-formula> as an
approach to better understand the agreement between the correlation
(calculated by the means of cross-validation) and the RMSE (derived from the
unbiased residuals of cross-validation). Before calculating the
RMSE <inline-formula><mml:math id="M42" display="inline"><mml:mo>/</mml:mo></mml:math></inline-formula> correlation ratio, the RMSE and the correlation between observed
and predicted values were standardized (by its maximum and minimum values) to a
range between 0 and 1 using

                <disp-formula specific-use="align" content-type="numbered"><mml:math id="M43" display="block"><mml:mtable displaystyle="true"><mml:mlabeledtr id="Ch1.E3"><mml:mtd/><mml:mtd><mml:mstyle displaystyle="true" class="stylechange"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:msub><mml:mtext>RMSE</mml:mtext><mml:mi mathvariant="normal">SD</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msub><mml:mtext>RMSE</mml:mtext><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:mo>min⁡</mml:mo><mml:mo>(</mml:mo><mml:mtext>RMSE</mml:mtext><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mi mathvariant="normal">range</mml:mi><mml:mo>(</mml:mo><mml:mtext>RMSE</mml:mtext><mml:mo>)</mml:mo></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E4"><mml:mtd/><mml:mtd><mml:mstyle displaystyle="true" class="stylechange"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:msub><mml:mtext>corr</mml:mtext><mml:mi mathvariant="normal">SD</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msub><mml:mtext>corr</mml:mtext><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:mo>min⁡</mml:mo><mml:mo>(</mml:mo><mml:mtext>corr</mml:mtext><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mi mathvariant="normal">range</mml:mi><mml:mo>(</mml:mo><mml:mtext>corr</mml:mtext><mml:mo>)</mml:mo></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E5"><mml:mtd/><mml:mtd><mml:mstyle displaystyle="true" class="stylechange"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:msub><mml:mi mathvariant="normal">EC</mml:mi><mml:mi mathvariant="normal">r</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msub><mml:mtext>RMSE</mml:mtext><mml:mi mathvariant="normal">SD</mml:mi></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mtext>corr</mml:mtext><mml:mi mathvariant="normal">SD</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

            where EC<inline-formula><mml:math id="M44" display="inline"><mml:msub><mml:mi/><mml:mi mathvariant="normal">r</mml:mi></mml:msub></mml:math></inline-formula> is the proposed ratio between errors and correlation
between observed and predicted values; <inline-formula><mml:math id="M45" display="inline"><mml:mrow><mml:msub><mml:mtext>RMSE</mml:mtext><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the observed RMSE
for the <inline-formula><mml:math id="M46" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>th model; <inline-formula><mml:math id="M47" display="inline"><mml:mrow><mml:mo>min⁡</mml:mo><mml:mo>(</mml:mo><mml:mtext>RMSE</mml:mtext><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is the minimum observed value of
RMSE, and <inline-formula><mml:math id="M48" display="inline"><mml:mrow><mml:mi mathvariant="normal">range</mml:mi><mml:mo>(</mml:mo><mml:mtext>RMSE</mml:mtext><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is the difference between the maximum
and minimum observed values of RMSE; <inline-formula><mml:math id="M49" display="inline"><mml:mrow><mml:msub><mml:mtext>corr</mml:mtext><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the observed
correlation for the <inline-formula><mml:math id="M50" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>th model; <inline-formula><mml:math id="M51" display="inline"><mml:mrow><mml:mo>min⁡</mml:mo><mml:mo>(</mml:mo><mml:mtext>corr</mml:mtext><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is the minimum observed
value of correlation, and <inline-formula><mml:math id="M52" display="inline"><mml:mrow><mml:mi mathvariant="normal">range</mml:mi><mml:mo>(</mml:mo><mml:mtext>corr</mml:mtext><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is the difference
between the maximum and minimum observed values of correlation</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F2" specific-use="star"><caption><p id="d1e1640">Spatial distribution of available SOC in WoSIS for Latin America.
SOC estimates are calculated for each point using Eq. (2) <bold>(a)</bold>. The
approximated error is based on Taylor series as implemented in the R-GSIF
package, as is explained in <xref ref-type="bibr" rid="bib1.bibx32" id="text.40"/> <bold>(b)</bold>. Thus,
panel <bold>(b)</bold> represents the uncertainty of SOC estimates at each point. The
values of this map could be associated with data limitations and missing
information for BLD and CRFVOL.</p></caption>
          <?xmltex \igopts{width=412.564961pt}?><graphic xlink:href="https://soil.copernicus.org/articles/4/173/2018/soil-4-173-2018-f02.jpg"/>

        </fig>

      <p id="d1e1661">If the value of the EC<inline-formula><mml:math id="M53" display="inline"><mml:msub><mml:mi/><mml:mi mathvariant="normal">r</mml:mi></mml:msub></mml:math></inline-formula> was close to 0, then there was a stronger
agreement between high RMSE and low correlation, or low RMSE and high
correlation. If this value deviated from 0 (up to 1 or more), then the RMSE
would tend to be high while the correlation was also high, suggesting that
the method represented the variability of SOC but with high bias.</p>
      <p id="d1e1674">Model accuracy (also represented by the RMSE and <inline-formula><mml:math id="M54" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>) was assessed for the
model ensembles with a more strict (but computationally expensive) 5-fold and
five times repeated cross-validation strategy. This model refitting allowed
more stable accuracy results with the ultimate goal of comparing
country-specific and regional (Latin America) estimates. Repeated 10- and
5-fold cross-validation have been used to compare both machine learning and
geostatistical approaches for mapping soil properties from book examples to
real applications at the global scale <xref ref-type="bibr" rid="bib1.bibx30 bib1.bibx29" id="paren.41"/>. In addition, independent model residuals were also obtained from
the 25 % of data not used for the country-specific and regional ensembles
to estimate a spatially explicit measure of uncertainty (as explained in
step five of our prediction framework).</p>
</sec>
<?pagebreak page179?><sec id="Ch1.S2.SS7">
  <title>SOC stocks</title>
      <p id="d1e1697">First, we analyzed the influence of the maximum allowed prediction limits for
each prediction algorithm. We harmonized the units of our SOC estimates with
global datasets in <inline-formula><mml:math id="M55" display="inline"><mml:mrow><mml:mi mathvariant="normal">Mg</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="normal">ha</mml:mi></mml:mrow></mml:math></inline-formula> (megagrams per hectare at 30 cm depth). The
sensitivity of the total SOC stock to the model prediction limit was tested
by increasing (every 10 <inline-formula><mml:math id="M56" display="inline"><mml:mrow><mml:mi mathvariant="normal">Mg</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="normal">ha</mml:mi></mml:mrow></mml:math></inline-formula>) the maximum prediction limit from
0.5 <inline-formula><mml:math id="M57" display="inline"><mml:mrow><mml:mi mathvariant="normal">Mg</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="normal">ha</mml:mi></mml:mrow></mml:math></inline-formula>. until finding a stable rate. Geopolitical limits were
obtained from the Global Administrative areas project
(<uri>https://gadm.org/</uri>, last access: 16 July 2018). Using these country
limits we report our country-specific and Latin American SOC estimates. For
comparative purposes, we also extracted for each country the global SOC
estimates from the SoilGrids system <xref ref-type="bibr" rid="bib1.bibx29" id="paren.42"/>, the Harmonized World
Soil Database <xref ref-type="bibr" rid="bib1.bibx36" id="paren.43"/>, and the GSOCmap-GSP (see
<uri>http://54.229.242.119/apps/GSOCmap.html</uri>, last access: 16 July 2018). We
also report stocks across the land cover classes derived from the Latin
American Network for Monitoring and Studying of Natural Resources, a product
with an estimated accuracy of 84 % <xref ref-type="bibr" rid="bib1.bibx7" id="paren.44"/>. We report the
overall uncertainty of these stocks with the independent model residuals map
and the approximated error trend of the SOC estimates. Some countries with no
data were filled with the average of the surrounding extent of the SOC
predictions. All analyses were performed using the R software <xref ref-type="bibr" rid="bib1.bibx55" id="paren.45"/>.</p><?xmltex \hack{\newpage}?>
</sec>
</sec>
<sec id="Ch1.S3">
  <title>Results</title>
<sec id="Ch1.S3.SS1">
  <title>Descriptive statistics</title>
      <p id="d1e1765">SOC across different countries showed a wide diversity of data scenarios
(Table <xref ref-type="table" rid="Ch1.T1"/>). Costa Rica (with a mean of
11.05 <inline-formula><mml:math id="M58" display="inline"><mml:mrow><mml:mi mathvariant="normal">kg</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">m</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>), Chile (with a mean of 9.88 <inline-formula><mml:math id="M59" display="inline"><mml:mrow><mml:mi mathvariant="normal">kg</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">m</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>), and
Colombia (with a mean of 8.15 <inline-formula><mml:math id="M60" display="inline"><mml:mrow><mml:mi mathvariant="normal">kg</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">m</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>) are the countries with the
highest SOC mean values. Brazil (<inline-formula><mml:math id="M61" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">5616</mml:mn></mml:mrow></mml:math></inline-formula>) and Mexico (<inline-formula><mml:math id="M62" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">4321</mml:mn></mml:mrow></mml:math></inline-formula>) were the
countries with highest data availability. In contrast, Honduras (<inline-formula><mml:math id="M63" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">11</mml:mn></mml:mrow></mml:math></inline-formula>),
Guatemala (<inline-formula><mml:math id="M64" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">20</mml:mn></mml:mrow></mml:math></inline-formula>), and Belize (<inline-formula><mml:math id="M65" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">21</mml:mn></mml:mrow></mml:math></inline-formula>) were the countries with lowest
density of SOC estimated values (Table <xref ref-type="table" rid="Ch1.T1"/>). With the original
(untransformed) dataset, the only countries that showed a normal distribution
after the Shapiro–Wilk test of normality with an <inline-formula><mml:math id="M66" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula> of 0.05 were
Belize, Guatemala, Honduras, and Suriname.</p>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T1" specific-use="star"><caption><p id="d1e1885">Descriptive statistics of SOC estimates (in <inline-formula><mml:math id="M67" display="inline"><mml:mrow><mml:mi mathvariant="normal">kg</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">m</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>) and
total land area for each analyzed country. <inline-formula><mml:math id="M68" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> is the number of observations.
We provide quantiles, median, mean, and the standard deviation of SOC data.
The columns <inline-formula><mml:math id="M69" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M70" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mi>log⁡</mml:mi></mml:mrow></mml:math></inline-formula> represent the probability values derived from the
Shapiro–Wilk test of normality before <inline-formula><mml:math id="M71" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula> and after <inline-formula><mml:math id="M72" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mi>log⁡</mml:mi></mml:mrow></mml:math></inline-formula> the log
transformation of SOC values. When <inline-formula><mml:math id="M73" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula> is larger than <inline-formula><mml:math id="M74" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mi>log⁡</mml:mi></mml:mrow></mml:math></inline-formula>, the log
transformation of the data did not increase the probability of normality in
the dataset. For comparative purposes, we provide (Fig. S1 in the Supplement)
the probability distribution functions of available data before and after the
log transformations. ARG is Argentina, BLZ is Belize,
BOL is Bolivia, BRA is Brazil, CHL is Chile, COL is Colombia,
CRI is Costa Rica, CUB is Cuba, ECU is Ecuador, GTM is Guatemala,
HND is Honduras, JAM is Jamaica, MEX is Mexico,
NIC is Nicaragua, PAN is Panama, PER is Peru, SUR is Suriname,
SLV is El Salvador, URY is Uruguay, and VEN is Venezuela.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="11">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:colspec colnum="6" colname="col6" align="right"/>
     <oasis:colspec colnum="7" colname="col7" align="right"/>
     <oasis:colspec colnum="8" colname="col8" align="right"/>
     <oasis:colspec colnum="9" colname="col9" align="right"/>
     <oasis:colspec colnum="10" colname="col10" align="right"/>
     <oasis:colspec colnum="11" colname="col11" align="right"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Country</oasis:entry>
         <oasis:entry colname="col2"><italic>n</italic></oasis:entry>
         <oasis:entry colname="col3">Land area (<inline-formula><mml:math id="M75" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="normal">km</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col4">Min.</oasis:entry>
         <oasis:entry colname="col5">First <inline-formula><mml:math id="M76" display="inline"><mml:mi>Q</mml:mi></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col6">Med.</oasis:entry>
         <oasis:entry colname="col7">Mean</oasis:entry>
         <oasis:entry colname="col8">Third <inline-formula><mml:math id="M77" display="inline"><mml:mi>Q</mml:mi></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col9">Max.</oasis:entry>
         <oasis:entry colname="col10">SD</oasis:entry>
         <oasis:entry colname="col11"><inline-formula><mml:math id="M78" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>/</mml:mo><mml:mi>p</mml:mi><mml:mi>log⁡</mml:mi></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">ARG</oasis:entry>
         <oasis:entry colname="col2">231</oasis:entry>
         <oasis:entry colname="col3">2 736 690</oasis:entry>
         <oasis:entry colname="col4">0.34</oasis:entry>
         <oasis:entry colname="col5">1.88</oasis:entry>
         <oasis:entry colname="col6">3.21</oasis:entry>
         <oasis:entry colname="col7">5.65</oasis:entry>
         <oasis:entry colname="col8">5.96</oasis:entry>
         <oasis:entry colname="col9">86.85</oasis:entry>
         <oasis:entry colname="col10">9.33</oasis:entry>
         <oasis:entry colname="col11"><inline-formula><mml:math id="M79" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">0.03</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">BLZ</oasis:entry>
         <oasis:entry colname="col2">21</oasis:entry>
         <oasis:entry colname="col3">22 970</oasis:entry>
         <oasis:entry colname="col4">1.84</oasis:entry>
         <oasis:entry colname="col5">4.49</oasis:entry>
         <oasis:entry colname="col6">6.72</oasis:entry>
         <oasis:entry colname="col7">7.71</oasis:entry>
         <oasis:entry colname="col8">9.99</oasis:entry>
         <oasis:entry colname="col9">19.48</oasis:entry>
         <oasis:entry colname="col10">4.32</oasis:entry>
         <oasis:entry colname="col11"><inline-formula><mml:math id="M80" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.08</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">0.99</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">BOL</oasis:entry>
         <oasis:entry colname="col2">76</oasis:entry>
         <oasis:entry colname="col3">1 083 301</oasis:entry>
         <oasis:entry colname="col4">0.64</oasis:entry>
         <oasis:entry colname="col5">1.83</oasis:entry>
         <oasis:entry colname="col6">2.56</oasis:entry>
         <oasis:entry colname="col7">2.64</oasis:entry>
         <oasis:entry colname="col8">3.20</oasis:entry>
         <oasis:entry colname="col9">7.65</oasis:entry>
         <oasis:entry colname="col10">1.21</oasis:entry>
         <oasis:entry colname="col11"><inline-formula><mml:math id="M81" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">0.08</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">BRA</oasis:entry>
         <oasis:entry colname="col2">5616</oasis:entry>
         <oasis:entry colname="col3">8 358 140</oasis:entry>
         <oasis:entry colname="col4">0.07</oasis:entry>
         <oasis:entry colname="col5">1.99</oasis:entry>
         <oasis:entry colname="col6">2.67</oasis:entry>
         <oasis:entry colname="col7">3.23</oasis:entry>
         <oasis:entry colname="col8">3.34</oasis:entry>
         <oasis:entry colname="col9">573.76</oasis:entry>
         <oasis:entry colname="col10">9.18</oasis:entry>
         <oasis:entry colname="col11"><inline-formula><mml:math id="M82" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn><mml:mo>/</mml:mo><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">CHL</oasis:entry>
         <oasis:entry colname="col2">44</oasis:entry>
         <oasis:entry colname="col3">743 812</oasis:entry>
         <oasis:entry colname="col4">0.43</oasis:entry>
         <oasis:entry colname="col5">3.58</oasis:entry>
         <oasis:entry colname="col6">5.19</oasis:entry>
         <oasis:entry colname="col7">9.88</oasis:entry>
         <oasis:entry colname="col8">16.52</oasis:entry>
         <oasis:entry colname="col9">31.87</oasis:entry>
         <oasis:entry colname="col10">8.86</oasis:entry>
         <oasis:entry colname="col11"><inline-formula><mml:math id="M83" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">0.01</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">COL</oasis:entry>
         <oasis:entry colname="col2">166</oasis:entry>
         <oasis:entry colname="col3">1 038 700</oasis:entry>
         <oasis:entry colname="col4">0.66</oasis:entry>
         <oasis:entry colname="col5">3.44</oasis:entry>
         <oasis:entry colname="col6">5.78</oasis:entry>
         <oasis:entry colname="col7">8.15</oasis:entry>
         <oasis:entry colname="col8">9.95</oasis:entry>
         <oasis:entry colname="col9">52.62</oasis:entry>
         <oasis:entry colname="col10">7.35</oasis:entry>
         <oasis:entry colname="col11"><inline-formula><mml:math id="M84" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">0.96</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">CRI</oasis:entry>
         <oasis:entry colname="col2">43</oasis:entry>
         <oasis:entry colname="col3">51 060</oasis:entry>
         <oasis:entry colname="col4">2.27</oasis:entry>
         <oasis:entry colname="col5">4.07</oasis:entry>
         <oasis:entry colname="col6">7.23</oasis:entry>
         <oasis:entry colname="col7">11.05</oasis:entry>
         <oasis:entry colname="col8">10.85</oasis:entry>
         <oasis:entry colname="col9">82.57</oasis:entry>
         <oasis:entry colname="col10">14.90</oasis:entry>
         <oasis:entry colname="col11"><inline-formula><mml:math id="M85" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">CUB</oasis:entry>
         <oasis:entry colname="col2">48</oasis:entry>
         <oasis:entry colname="col3">109 820</oasis:entry>
         <oasis:entry colname="col4">0.36</oasis:entry>
         <oasis:entry colname="col5">2.85</oasis:entry>
         <oasis:entry colname="col6">3.61</oasis:entry>
         <oasis:entry colname="col7">4.32</oasis:entry>
         <oasis:entry colname="col8">5.73</oasis:entry>
         <oasis:entry colname="col9">10.98</oasis:entry>
         <oasis:entry colname="col10">2.23</oasis:entry>
         <oasis:entry colname="col11"><inline-formula><mml:math id="M86" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.004</mml:mn><mml:mo>/</mml:mo><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">ECU</oasis:entry>
         <oasis:entry colname="col2">77</oasis:entry>
         <oasis:entry colname="col3">276 841</oasis:entry>
         <oasis:entry colname="col4">0.99</oasis:entry>
         <oasis:entry colname="col5">2.37</oasis:entry>
         <oasis:entry colname="col6">3.65</oasis:entry>
         <oasis:entry colname="col7">5.15</oasis:entry>
         <oasis:entry colname="col8">4.36</oasis:entry>
         <oasis:entry colname="col9">24.36</oasis:entry>
         <oasis:entry colname="col10">5.15</oasis:entry>
         <oasis:entry colname="col11"><inline-formula><mml:math id="M87" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn><mml:mo>/</mml:mo><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">GTM</oasis:entry>
         <oasis:entry colname="col2">20</oasis:entry>
         <oasis:entry colname="col3">107 159</oasis:entry>
         <oasis:entry colname="col4">2.60</oasis:entry>
         <oasis:entry colname="col5">5.66</oasis:entry>
         <oasis:entry colname="col6">8.48</oasis:entry>
         <oasis:entry colname="col7">7.73</oasis:entry>
         <oasis:entry colname="col8">9.75</oasis:entry>
         <oasis:entry colname="col9">12.41</oasis:entry>
         <oasis:entry colname="col10">3.11</oasis:entry>
         <oasis:entry colname="col11"><inline-formula><mml:math id="M88" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.14</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">0.007</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">HND</oasis:entry>
         <oasis:entry colname="col2">11</oasis:entry>
         <oasis:entry colname="col3">111 890</oasis:entry>
         <oasis:entry colname="col4">2.69</oasis:entry>
         <oasis:entry colname="col5">5.25</oasis:entry>
         <oasis:entry colname="col6">6.48</oasis:entry>
         <oasis:entry colname="col7">6.71</oasis:entry>
         <oasis:entry colname="col8">8.32</oasis:entry>
         <oasis:entry colname="col9">12.38</oasis:entry>
         <oasis:entry colname="col10">2.78</oasis:entry>
         <oasis:entry colname="col11"><inline-formula><mml:math id="M89" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.72</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">0.39</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">JAM</oasis:entry>
         <oasis:entry colname="col2">76</oasis:entry>
         <oasis:entry colname="col3">10 831</oasis:entry>
         <oasis:entry colname="col4">1.29</oasis:entry>
         <oasis:entry colname="col5">3.01</oasis:entry>
         <oasis:entry colname="col6">3.99</oasis:entry>
         <oasis:entry colname="col7">4.35</oasis:entry>
         <oasis:entry colname="col8">4.83</oasis:entry>
         <oasis:entry colname="col9">12.90</oasis:entry>
         <oasis:entry colname="col10">1.99</oasis:entry>
         <oasis:entry colname="col11"><inline-formula><mml:math id="M90" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">0.72</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">MEX</oasis:entry>
         <oasis:entry colname="col2">4321</oasis:entry>
         <oasis:entry colname="col3">1 943 945</oasis:entry>
         <oasis:entry colname="col4">0.00</oasis:entry>
         <oasis:entry colname="col5">1.73</oasis:entry>
         <oasis:entry colname="col6">2.49</oasis:entry>
         <oasis:entry colname="col7">2.56</oasis:entry>
         <oasis:entry colname="col8">3.25</oasis:entry>
         <oasis:entry colname="col9">35.55</oasis:entry>
         <oasis:entry colname="col10">1.49</oasis:entry>
         <oasis:entry colname="col11"><inline-formula><mml:math id="M91" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn><mml:mo>/</mml:mo><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">NIC</oasis:entry>
         <oasis:entry colname="col2">26</oasis:entry>
         <oasis:entry colname="col3">119 990</oasis:entry>
         <oasis:entry colname="col4">2.93</oasis:entry>
         <oasis:entry colname="col5">3.94</oasis:entry>
         <oasis:entry colname="col6">7.31</oasis:entry>
         <oasis:entry colname="col7">7.50</oasis:entry>
         <oasis:entry colname="col8">9.04</oasis:entry>
         <oasis:entry colname="col9">15.91</oasis:entry>
         <oasis:entry colname="col10">3.78</oasis:entry>
         <oasis:entry colname="col11"><inline-formula><mml:math id="M92" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.05</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">0.09</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">PAN</oasis:entry>
         <oasis:entry colname="col2">25</oasis:entry>
         <oasis:entry colname="col3">74 177</oasis:entry>
         <oasis:entry colname="col4">3.39</oasis:entry>
         <oasis:entry colname="col5">4.90</oasis:entry>
         <oasis:entry colname="col6">7.53</oasis:entry>
         <oasis:entry colname="col7">7.59</oasis:entry>
         <oasis:entry colname="col8">9.13</oasis:entry>
         <oasis:entry colname="col9">19.89</oasis:entry>
         <oasis:entry colname="col10">3.76</oasis:entry>
         <oasis:entry colname="col11"><inline-formula><mml:math id="M93" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.003</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">0.49</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">PER</oasis:entry>
         <oasis:entry colname="col2">145</oasis:entry>
         <oasis:entry colname="col3">1 279 996</oasis:entry>
         <oasis:entry colname="col4">0.19</oasis:entry>
         <oasis:entry colname="col5">1.89</oasis:entry>
         <oasis:entry colname="col6">2.93</oasis:entry>
         <oasis:entry colname="col7">2.92</oasis:entry>
         <oasis:entry colname="col8">3.55</oasis:entry>
         <oasis:entry colname="col9">8.35</oasis:entry>
         <oasis:entry colname="col10">1.42</oasis:entry>
         <oasis:entry colname="col11"><inline-formula><mml:math id="M94" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.005</mml:mn><mml:mo>/</mml:mo><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">SUR</oasis:entry>
         <oasis:entry colname="col2">27</oasis:entry>
         <oasis:entry colname="col3">156 000</oasis:entry>
         <oasis:entry colname="col4">1.38</oasis:entry>
         <oasis:entry colname="col5">2.60</oasis:entry>
         <oasis:entry colname="col6">3.35</oasis:entry>
         <oasis:entry colname="col7">3.37</oasis:entry>
         <oasis:entry colname="col8">4.07</oasis:entry>
         <oasis:entry colname="col9">6.01</oasis:entry>
         <oasis:entry colname="col10">1.20</oasis:entry>
         <oasis:entry colname="col11"><inline-formula><mml:math id="M95" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.69</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">0.51</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">URY</oasis:entry>
         <oasis:entry colname="col2">130</oasis:entry>
         <oasis:entry colname="col3">175 015</oasis:entry>
         <oasis:entry colname="col4">0.82</oasis:entry>
         <oasis:entry colname="col5">2.70</oasis:entry>
         <oasis:entry colname="col6">3.38</oasis:entry>
         <oasis:entry colname="col7">4.34</oasis:entry>
         <oasis:entry colname="col8">3.90</oasis:entry>
         <oasis:entry colname="col9">46.54</oasis:entry>
         <oasis:entry colname="col10">4.67</oasis:entry>
         <oasis:entry colname="col11"><inline-formula><mml:math id="M96" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn><mml:mo>/</mml:mo><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">VEN</oasis:entry>
         <oasis:entry colname="col2">164</oasis:entry>
         <oasis:entry colname="col3">882 050</oasis:entry>
         <oasis:entry colname="col4">0.31</oasis:entry>
         <oasis:entry colname="col5">2.58</oasis:entry>
         <oasis:entry colname="col6">4.14</oasis:entry>
         <oasis:entry colname="col7">5.92</oasis:entry>
         <oasis:entry colname="col8">6.57</oasis:entry>
         <oasis:entry colname="col9">44.35</oasis:entry>
         <oasis:entry colname="col10">6.37</oasis:entry>
         <oasis:entry colname="col11"><inline-formula><mml:math id="M97" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">0.11</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

</sec>
<sec id="Ch1.S3.SS2">
  <title>Spatial distribution and point error estimates</title>
      <p id="d1e3005">There were large areas of Latin America with no available SOC observational
data in the WoSIS system (e.g., the south of Chile, Argentina, or across large
areas of Central America). We found substantial error estimates across large
areas with high density of SOC data but low carbon contents, such as northern
Mexico or the Brazilian semiarid savanna located at the eastern side of that
country (Fig. <xref ref-type="fig" rid="Ch1.F2"/>).</p>
</sec>
<sec id="Ch1.S3.SS3">
  <title>Correlation of SOC and its predictors</title>
      <p id="d1e3016">Best correlated predictors were not the same across countries. We found
higher correlations with the original datasets transformed to their natural
logarithm, as data had a<?pagebreak page180?> right-skewed distribution and did not follow a
normal distribution (i.e., log normal). Highest correlations of available SOC
data and their environmental predictors were associated with
temperature-related variables across Honduras, Costa Rica, Peru, Chile,
Guatemala, and Suriname (the <inline-formula><mml:math id="M98" display="inline"><mml:mrow><mml:msup><mml:mi>r</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> varied from 0.35 to 0.58). However,
there were a low number of available SOC observations across these countries
in the WoSIS system (between 11 to 34). Similarly, across countries with high
data availability (e.g., Mexico and Brazil), the strongest correlations
between SOC and prediction factors were associated with temperature-related
variables (Table 2). In all cases, the relationship between SOC and
temperature-related variables was negative. In contrast, SOC had a positive
relationship with elevation-derived terrain parameters (<inline-formula><mml:math id="M99" display="inline"><mml:mrow><mml:msup><mml:mi>r</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> varied from
0.43 to 0.59) such as terrain curvature, potential incoming solar radiation,
and slope of terrain.</p>
      <p id="d1e3041">Lower correlations of SOC data with prediction factors were found across
Brazil, Bolivia, Uruguay, Cuba, Panama, Venezuela, and Argentina (e.g.,
<inline-formula><mml:math id="M100" display="inline"><mml:mrow><mml:msup><mml:mi>r</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M101" display="inline"><mml:mo>&lt;</mml:mo></mml:math></inline-formula> 0.2). The correlation analysis was useful to formulate a
working hypothesis about the major drivers of the spatial variability of SOC
across countries based on our DSM conceptual framework (e.g.,
SOC<inline-formula><mml:math id="M102" display="inline"><mml:msub><mml:mi/><mml:mi mathvariant="normal">ARG</mml:mi></mml:msub></mml:math></inline-formula> <inline-formula><mml:math id="M103" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> <inline-formula><mml:math id="M104" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>[px4wcl3a <inline-formula><mml:math id="M105" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> px3wcl3a <inline-formula><mml:math id="M106" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> evmmod3a
<inline-formula><mml:math id="M107" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> l07igb3a <inline-formula><mml:math id="M108" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> px2wcl3a <inline-formula><mml:math id="M109" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> …]). For example, the best
correlated predictors with SOC for Argentina were precipitation-related
variables (px4wcl3a, px3wcl3a, px2wcl3a), remote-sensing-based vegetation
indexes (evmmod3a), and a probability-based shrubland map (l07igb3a)
(Table <xref ref-type="table" rid="Ch1.T2"/>) (see sources of these maps in
<uri>http://worldgrids.org/doku.php/wiki:layers</uri>, last access:
20 February 2018).</p>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T2" specific-use="star"><caption><p id="d1e3130">Best correlated predictors and their frequency across the analyzed
data country scenarios, given available data in the WoSIS system; see
predictor codes in <uri>http://worldgrids.org/doku.php/wiki:layers</uri> (last
access: 20 February 2018). ARG is Argentina, BLZ is Belize, BOL is Bolivia,
BRA is Brazil, CHL is Chile, COL is Colombia, CRI is Costa Rica, CUB is Cuba,
DOM is Dominican Republic, ECU is Ecuador, GTM is Guatemala, HND is Honduras,
JAM is Jamaica, MEX is Mexico, NIC is Nicaragua, PAN is Panama, PER is Peru,
SUR is Suriname, SLV is El Salvador, URY is Uruguay, and VEN is Venezuela.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="5">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="left"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="left"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Var</oasis:entry>
         <oasis:entry colname="col2">Factor</oasis:entry>
         <oasis:entry colname="col3">Subfactor</oasis:entry>
         <oasis:entry colname="col4">Freq.</oasis:entry>
         <oasis:entry colname="col5">Country</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">gachws3a</oasis:entry>
         <oasis:entry colname="col2">Soil</oasis:entry>
         <oasis:entry colname="col3">Soil type</oasis:entry>
         <oasis:entry colname="col4">2</oasis:entry>
         <oasis:entry colname="col5">CUB, SUR</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">garhws3a</oasis:entry>
         <oasis:entry colname="col2">Soil</oasis:entry>
         <oasis:entry colname="col3">Soil type</oasis:entry>
         <oasis:entry colname="col4">2</oasis:entry>
         <oasis:entry colname="col5">PER, URY</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">ghshws3a</oasis:entry>
         <oasis:entry colname="col2">Soil</oasis:entry>
         <oasis:entry colname="col3">Soil type</oasis:entry>
         <oasis:entry colname="col4">2</oasis:entry>
         <oasis:entry colname="col5">BLZ, URY</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">gphhws3a</oasis:entry>
         <oasis:entry colname="col2">Soil</oasis:entry>
         <oasis:entry colname="col3">Soil type</oasis:entry>
         <oasis:entry colname="col4">2</oasis:entry>
         <oasis:entry colname="col5">CUB, JAM</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">gplhws3a</oasis:entry>
         <oasis:entry colname="col2">Soil</oasis:entry>
         <oasis:entry colname="col3">Soil type</oasis:entry>
         <oasis:entry colname="col4">2</oasis:entry>
         <oasis:entry colname="col5">BLZ, BOL</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">gvrhws3a</oasis:entry>
         <oasis:entry colname="col2">Soil</oasis:entry>
         <oasis:entry colname="col3">Soil type</oasis:entry>
         <oasis:entry colname="col4">2</oasis:entry>
         <oasis:entry colname="col5">JAM, URY</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">tdmmod3a</oasis:entry>
         <oasis:entry colname="col2">Climate</oasis:entry>
         <oasis:entry colname="col3">Temperature</oasis:entry>
         <oasis:entry colname="col4">11</oasis:entry>
         <oasis:entry colname="col5">ARG, BOL, BRA, CHL, COL, CRI, CUB, ECU, MEX, PER, VEN</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">tx1mod3a</oasis:entry>
         <oasis:entry colname="col2">Climate</oasis:entry>
         <oasis:entry colname="col3">Temperature</oasis:entry>
         <oasis:entry colname="col4">10</oasis:entry>
         <oasis:entry colname="col5">ARG, BOL, BRA, COL, CUB, ECU, JAM, NIC, PER, URY</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">tx4mod3a</oasis:entry>
         <oasis:entry colname="col2">Climate</oasis:entry>
         <oasis:entry colname="col3">Temperature</oasis:entry>
         <oasis:entry colname="col4">10</oasis:entry>
         <oasis:entry colname="col5">BRA, CHL, CRI, CUB, ECU, GTM, JAM, MEX, PER, VEN</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">tx5mod3a</oasis:entry>
         <oasis:entry colname="col2">Climate</oasis:entry>
         <oasis:entry colname="col3">Temperature</oasis:entry>
         <oasis:entry colname="col4">9</oasis:entry>
         <oasis:entry colname="col5">BOL, BRA, CHL, CUB, ECU, JAM, MEX, PER, VEN</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">tx6mod3a</oasis:entry>
         <oasis:entry colname="col2">Climate</oasis:entry>
         <oasis:entry colname="col3">Temperature</oasis:entry>
         <oasis:entry colname="col4">9</oasis:entry>
         <oasis:entry colname="col5">ARG, BOL, BRA, CHL, COL, CRI, ECU, MEX, VEN</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">tnhmod3a</oasis:entry>
         <oasis:entry colname="col2">Climate</oasis:entry>
         <oasis:entry colname="col3">Temperature</oasis:entry>
         <oasis:entry colname="col4">8</oasis:entry>
         <oasis:entry colname="col5">BLZ, COL, CRI, GTM, HND, JAM, PAN, VEN</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">tnmmod3a</oasis:entry>
         <oasis:entry colname="col2">Climate</oasis:entry>
         <oasis:entry colname="col3">Temperature</oasis:entry>
         <oasis:entry colname="col4">8</oasis:entry>
         <oasis:entry colname="col5">BLZ, COL, CRI, GTM, HND, PAN, URY, VEN</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">tx3mod3a</oasis:entry>
         <oasis:entry colname="col2">Climate</oasis:entry>
         <oasis:entry colname="col3">Temperature</oasis:entry>
         <oasis:entry colname="col4">7</oasis:entry>
         <oasis:entry colname="col5">BRA, CHL, CUB, ECU, PAN, PER, VEN</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">tdhmod3a</oasis:entry>
         <oasis:entry colname="col2">Climate</oasis:entry>
         <oasis:entry colname="col3">Temperature</oasis:entry>
         <oasis:entry colname="col4">6</oasis:entry>
         <oasis:entry colname="col5">ARG, CUB, ECU, JAM, MEX, URY</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">tdlmod3a</oasis:entry>
         <oasis:entry colname="col2">Climate</oasis:entry>
         <oasis:entry colname="col3">Temperature</oasis:entry>
         <oasis:entry colname="col4">6</oasis:entry>
         <oasis:entry colname="col5">BRA, CHL, COL, ECU, GTM, JAM</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">tnsmod3a</oasis:entry>
         <oasis:entry colname="col2">Climate</oasis:entry>
         <oasis:entry colname="col3">Temperature</oasis:entry>
         <oasis:entry colname="col4">5</oasis:entry>
         <oasis:entry colname="col5">ARG, MEX, NIC, PAN, SUR</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">tx2mod3a</oasis:entry>
         <oasis:entry colname="col2">Climate</oasis:entry>
         <oasis:entry colname="col3">Temperature</oasis:entry>
         <oasis:entry colname="col4">4</oasis:entry>
         <oasis:entry colname="col5">ARG, ECU, PER, URY</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">tdsmod3a</oasis:entry>
         <oasis:entry colname="col2">Climate</oasis:entry>
         <oasis:entry colname="col3">Temperature</oasis:entry>
         <oasis:entry colname="col4">3</oasis:entry>
         <oasis:entry colname="col5">MEX, PAN, SUR</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">tnlmod3a</oasis:entry>
         <oasis:entry colname="col2">Climate</oasis:entry>
         <oasis:entry colname="col3">Temperature</oasis:entry>
         <oasis:entry colname="col4">3</oasis:entry>
         <oasis:entry colname="col5">BLZ, COL, GTM</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">px2wcl3a</oasis:entry>
         <oasis:entry colname="col2">Climate</oasis:entry>
         <oasis:entry colname="col3">Precipitation</oasis:entry>
         <oasis:entry colname="col4">2</oasis:entry>
         <oasis:entry colname="col5">BOL, PAN</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">px3wcl3a</oasis:entry>
         <oasis:entry colname="col2">Climate</oasis:entry>
         <oasis:entry colname="col3">Precipitation</oasis:entry>
         <oasis:entry colname="col4">2</oasis:entry>
         <oasis:entry colname="col5">CHL, MEX</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">px4wcl3a</oasis:entry>
         <oasis:entry colname="col2">Climate</oasis:entry>
         <oasis:entry colname="col3">Precipitation</oasis:entry>
         <oasis:entry colname="col4">2</oasis:entry>
         <oasis:entry colname="col5">BRA, CHL</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">etmnts3a</oasis:entry>
         <oasis:entry colname="col2">Climate</oasis:entry>
         <oasis:entry colname="col3">ET</oasis:entry>
         <oasis:entry colname="col4">2</oasis:entry>
         <oasis:entry colname="col5">ARG, MEX</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">evmmod3a</oasis:entry>
         <oasis:entry colname="col2">Organism</oasis:entry>
         <oasis:entry colname="col3">Vegetation</oasis:entry>
         <oasis:entry colname="col4">5</oasis:entry>
         <oasis:entry colname="col5">ARG, ECU, HND, MEX, VEN</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">l07igb3a</oasis:entry>
         <oasis:entry colname="col2">Organism</oasis:entry>
         <oasis:entry colname="col3">Vegetation</oasis:entry>
         <oasis:entry colname="col4">2</oasis:entry>
         <oasis:entry colname="col5">ARG, CHL</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">DEMSRE3a</oasis:entry>
         <oasis:entry colname="col2">Topography</oasis:entry>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4">5</oasis:entry>
         <oasis:entry colname="col5">COL, CRI, GTM, HND, SUR</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">twisre3a</oasis:entry>
         <oasis:entry colname="col2">Topography</oasis:entry>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4">5</oasis:entry>
         <oasis:entry colname="col5">BRA, JAM, NIC, PAN, SUR</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">ChannNetworkBLevel</oasis:entry>
         <oasis:entry colname="col2">Topography</oasis:entry>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4">4</oasis:entry>
         <oasis:entry colname="col5">COL, HND, PAN, SUR</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">l3pobi3b</oasis:entry>
         <oasis:entry colname="col2">Topography</oasis:entry>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4">4</oasis:entry>
         <oasis:entry colname="col5">COL, CRI, PAN, VEN</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">inssre3a</oasis:entry>
         <oasis:entry colname="col2">Topography</oasis:entry>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4">3</oasis:entry>
         <oasis:entry colname="col5">BLZ, HND, SUR</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">opisre3a</oasis:entry>
         <oasis:entry colname="col2">Topography</oasis:entry>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4">3</oasis:entry>
         <oasis:entry colname="col5">CRI, NIC, SUR</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">SLPSRT3a</oasis:entry>
         <oasis:entry colname="col2">Topography</oasis:entry>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4">3</oasis:entry>
         <oasis:entry colname="col5">CRI, NIC, SUR</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">AnalyticalHillshading</oasis:entry>
         <oasis:entry colname="col2">Topography</oasis:entry>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4">2</oasis:entry>
         <oasis:entry colname="col5">BLZ, CUB</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Aspect</oasis:entry>
         <oasis:entry colname="col2">Topography</oasis:entry>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4">2</oasis:entry>
         <oasis:entry colname="col5">BLZ, BOL</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">CovergenceIndex</oasis:entry>
         <oasis:entry colname="col2">Topography</oasis:entry>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4">2</oasis:entry>
         <oasis:entry colname="col5">BOL, HND</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">inmsre3a</oasis:entry>
         <oasis:entry colname="col2">Topography</oasis:entry>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4">2</oasis:entry>
         <oasis:entry colname="col5">CRI, GTM</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">ValleyDepth</oasis:entry>
         <oasis:entry colname="col2">Topography</oasis:entry>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4">2</oasis:entry>
         <oasis:entry colname="col5">BLZ, JAM</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">geaisg3a</oasis:entry>
         <oasis:entry colname="col2">Age</oasis:entry>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4">3</oasis:entry>
         <oasis:entry colname="col5">CHL, NIC, SUR</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

</sec>
<sec id="Ch1.S3.SS4">
  <title>SOC-related properties</title>
      <p id="d1e3877">Correlations between SOC density (ORCDR) and prediction factors were higher
with maximum and mean nighttime temperature, where Costa Rica and Chile had
the highest correlations (<inline-formula><mml:math id="M110" display="inline"><mml:mrow><mml:msup><mml:mi>r</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> varied from 0.61 to 0.71). The best
correlated variables with BLD were terrain parameters: relative slope
position, vertical distance to channel network, flow accumulation areas, and
potential incoming solar radiation. These correlations were stronger across
Guatemala, Belize, and Panama (<inline-formula><mml:math id="M111" display="inline"><mml:mrow><mml:msup><mml:mi>r</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> varied from 0.52 to 0.67). We found
that terrain slope and the standard deviation of temperature were the
variables with highest correlations with CRFVOL, where Nicaragua, Honduras,
and Argentina had the highest correlations (<inline-formula><mml:math id="M112" display="inline"><mml:mrow><mml:msup><mml:mi>r</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> varied from 0.40 to
0.55). We did not find a dominant algorithm to predict ORCDR, BLD, and CRFVOL.
Slightly higher correlations between observed and predicted values were
achieved with RF, but in most cases different methods showed<?pagebreak page181?> similar
prediction capacity. The highest prediction error was found with RK for
CRFVOL, but for all other output variables all prediction algorithms had a
similar range of errors (Fig. <xref ref-type="fig" rid="Ch1.F3"/>). The PLS and SVM had the
lowest variance for prediction of each one of the four soil properties. The
<inline-formula><mml:math id="M113" display="inline"><mml:mrow><mml:msup><mml:mi>r</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> values for predicting the combined SOC-related properties (i.e.,
ORCDR, CRFVOL, and BLD) for each prediction algorithm were RK (<inline-formula><mml:math id="M114" display="inline"><mml:mrow><mml:msup><mml:mi>r</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> 0.67
to 0.76), RF (<inline-formula><mml:math id="M115" display="inline"><mml:mrow><mml:msup><mml:mi>r</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> 0.56 to 0.74), SVM (<inline-formula><mml:math id="M116" display="inline"><mml:mrow><mml:msup><mml:mi>r</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> 0.32 to 0.71), PL (<inline-formula><mml:math id="M117" display="inline"><mml:mrow><mml:msup><mml:mi>r</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>
0.46 to 0.69), and KK (<inline-formula><mml:math id="M118" display="inline"><mml:mrow><mml:msup><mml:mi>r</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> 0.19 to 0.64). Across countries with lower data
availability and sparse distribution, SVM and RK algorithms resulted in lower
model performance.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F3" specific-use="star"><caption><p id="d1e3984">Taylor diagrams showing the performance of the five models evaluated.
SOC stock <bold>(a)</bold>, ORCDR <bold>(b)</bold>, BLD <bold>(c)</bold>, and
CRFVOL <bold>(d)</bold>. This analysis is based on all available data across
Latin America. Although RF tends to generate higher correlation, it also shows
high variance in predictions. The points are close to each other and the
differences in accuracy between them generally fall within the same
intersection of error, variance, and correlation, suggesting a similar
prediction capacity by the implemented approaches.</p></caption>
          <?xmltex \igopts{width=398.338583pt}?><graphic xlink:href="https://soil.copernicus.org/articles/4/173/2018/soil-4-173-2018-f03.png"/>

        </fig>

</sec>
<?pagebreak page182?><sec id="Ch1.S3.SS5">
  <title>Country-specific SOC predictions</title>
      <p id="d1e4012">We did not find a dominant algorithm to predict SOC on a country-specific
basis (Fig. <xref ref-type="fig" rid="Ch1.F4"/>). Overall, machine learning
prediction algorithms generated similar results. Higher agreement of machine
learning prediction algorithms was found in small countries where
environmental conditions and land cover/use characteristics tend to be more
homogeneous (e.g., Jamaica, Suriname). RK showed higher discrepancies in
countries where data distribution was sparse (e.g., Suriname, Chile,
Guatemala) but effective across countries with higher and/or
well-distributed data availability (e.g., Mexico, Brazil). Machine learning SOC
predictions were conservative compared with RK (RK generated the higher
density of extreme and unreliable SOC values). PL had comparable results with
machine learning algorithms (i.e., KK, SVM, RF). From the cross-validation
strategy, higher <inline-formula><mml:math id="M119" display="inline"><mml:mrow><mml:msup><mml:mi>r</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> values between observed and predicted data were
found for Costa Rica (0.58; <inline-formula><mml:math id="M120" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">21</mml:mn></mml:mrow></mml:math></inline-formula>) using SVM, while the lowest error was
found in Suriname (0.36 <inline-formula><mml:math id="M121" display="inline"><mml:mrow><mml:mi mathvariant="normal">kg</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">m</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>; <inline-formula><mml:math id="M122" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">37</mml:mn></mml:mrow></mml:math></inline-formula>) using PL. In contrast,
algorithms had lower prediction capacity for countries with large areas
(e.g., Brazil, Mexico) despite the large data availability.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F4" specific-use="star"><caption><p id="d1e4071">Taylor diagrams showing the performance of the five models evaluated
for country-specific SOC estimates across Latin America. The position of each
point/method varies from each dataset to another, suggesting that the
predictive capacity changes when data characteristics are different.</p></caption>
          <?xmltex \igopts{width=497.923228pt}?><graphic xlink:href="https://soil.copernicus.org/articles/4/173/2018/soil-4-173-2018-f04.png"/>

        </fig>

      <p id="d1e4080">The simple correlation (main effect) between the <inline-formula><mml:math id="M123" display="inline"><mml:mrow><mml:msup><mml:mi>r</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> and RMSE for RF, PL,
KK, and RK was positive (0.18, 0.35, 0.32, and 0.1, respectively). In contrast,
this correlation was stronger for SVM (but negative; <inline-formula><mml:math id="M124" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.65) where
increasing the explained variance resulted in a lower error. Thus, we found a
low level of agreement between these two information criteria (<inline-formula><mml:math id="M125" display="inline"><mml:mrow><mml:msup><mml:mi>r</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> and
RMSE) commonly used in DSM to assess performance of prediction algorithms.</p>
      <?pagebreak page183?><p id="d1e4112">Agreement between the RMSE and <inline-formula><mml:math id="M126" display="inline"><mml:mrow><mml:msup><mml:mi>r</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> was found only in 12 of the 19
countries, resulting in country-specific “recommended” prediction
algorithms. Here, we list the prediction algorithms that generated the best
correlation and the best RMSE for each country: ARG (RK, RK), BLZ (RF, RK),
BOL (SVM, KK), BRA (RF, RF), CHL (PL, PL), COL (RF, RF), CRI (SVM, SVM), CUB
(PL, PL), ECU (RK, RK), GTM (KK, RF), HND (SVM, KK), JAM (RF, RF), MEX (RK ,
RK), NIC (RF, RF), PAN (PL, KK), PER (KK, KK), SUR (SVM, PL), URY (RF, RK),
and VEN (RK, RK) (see country codes in Table 1). Brazil and Mexico had the
highest number of observations (nearly 80 % of the total) and the same
method yielded the highest <inline-formula><mml:math id="M127" display="inline"><mml:mrow><mml:msup><mml:mi>r</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> and the lowest RMSE. We clarify that the best
within-country method was not the same for every country. The higher
EC<inline-formula><mml:math id="M128" display="inline"><mml:msub><mml:mi/><mml:mi mathvariant="normal">r</mml:mi></mml:msub></mml:math></inline-formula> was found with PL (0.96), followed by RF (0.54) and KK (0.43),
informing that these predictive algorithms did not minimize prediction bias
while increasing the explained variance. SVM (with 0.008) and RK (with 0.003)
had the lowest EC<inline-formula><mml:math id="M129" display="inline"><mml:msub><mml:mi/><mml:mi mathvariant="normal">r</mml:mi></mml:msub></mml:math></inline-formula>, as they maximize the explained variance while
minimizing prediction bias.</p>
</sec>
<sec id="Ch1.S3.SS6">
  <title>Model ensembles and SOC maps</title>
      <p id="d1e4161">High discrepancy was found among country-specific SOC predictions and between
country-specific and regional SOC predictions. Although both maps predict SOC
following a similar general pattern, the country-specific ensemble showed a
higher density of unrealistic patterns across Guatemala, Venezuela, northern
Brazil, and the surroundings of Uruguay (Fig. <xref ref-type="fig" rid="Ch1.F5"/>a). These
areas correspond to areas where we report both higher SOC calculation errors
and model uncertainty (Fig. <xref ref-type="fig" rid="Ch1.F6"/>).</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F5" specific-use="star"><caption><p id="d1e4170">Country-specific <bold>(a)</bold> and regional (Latin
America) <bold>(b)</bold> predictions of SOC based on a linear ensemble of
methods. We present the units as <inline-formula><mml:math id="M130" display="inline"><mml:mrow><mml:mi mathvariant="normal">Mg</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="normal">ha</mml:mi></mml:mrow></mml:math></inline-formula> for visualization purposes.
These units were used to reduce the digits of the value range and highlight
larger differences between SOC maps.</p></caption>
          <?xmltex \igopts{width=327.206693pt}?><graphic xlink:href="https://soil.copernicus.org/articles/4/173/2018/soil-4-173-2018-f05.png"/>

        </fig>

      <p id="d1e4196">Compared with the country-specific ensemble, the regional model showed
spatial differences predicting higher SOC across the highlands of the Southern
Andes and boundaries of the Amazon Basin (Fig. <xref ref-type="fig" rid="Ch1.F5"/>b). As
expected, the country-specific model showed spatial artifacts associated with
country geopolitical borders. Based on the repeated 5-fold cross-validation,
we report a <inline-formula><mml:math id="M131" display="inline"><mml:mrow><mml:msup><mml:mi>r</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.39</mml:mn></mml:mrow></mml:math></inline-formula> for the regional model and <inline-formula><mml:math id="M132" display="inline"><mml:mrow><mml:msup><mml:mi>r</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> values for the
country-specific approach that vary from 0.01 to 0.55.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F6" specific-use="star"><caption><p id="d1e4230">The full conditional response of residuals to the prediction factors
on a country-specific basis <bold>(a)</bold>. The full conditional response of
residuals to the SOC prediction factors in the regional (Latin America)
model <bold>(b)</bold>. The trend of the approximated error of SOC estimates is
derived from buffer distances and the random forest spatial
framework <bold>(c)</bold>.</p></caption>
          <?xmltex \igopts{width=483.69685pt}?><graphic xlink:href="https://soil.copernicus.org/articles/4/173/2018/soil-4-173-2018-f06.png"/>

        </fig>

      <?pagebreak page184?><p id="d1e4248">High uncertainty in our modeling framework was found across tropical, arid,
and semiarid regions of Latin America (Fig. <xref ref-type="fig" rid="Ch1.F6"/>a, b).
Residual uncertainty from independent validation in the country-specific
ensemble showed higher errors across geopolitical borders (in Chile,
Argentina, Colombia, Ecuador, Venezuela, and the Brazilian savanna), while
the residual uncertainty map from the regional model had higher uncertainty
across ecologically meaningful transitions, with no evident effect of
geopolitical borders. The trend of the mean approximated error suggests high
uncertainty in the SOC calculation method (Fig. <xref ref-type="fig" rid="Ch1.F6"/>c). We used
this map just to visualize the general trend of error estimates based only on
geographical buffer distances.</p>
      <p id="d1e4255">Primarily, the Pacific coastal plains, the delta of the Amazon river, some
closed watersheds and wetlands across Mexico, and some sparse points across
Central America showed the higher discrepancies. Mexico and Brazil, with
higher density of SOC data, were the countries with less discrepancy between
country and global models (Fig. <xref ref-type="fig" rid="Ch1.F7"/>a). We report that the
geographical areas where country-specific models tend to predict higher SOC
values than the regional ensemble (Fig. <xref ref-type="fig" rid="Ch1.F7"/>b). However, we report a
similar SOC stock from both modeling approaches (country-specific and global)
as we explain in Sect. 3.7.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F7"><caption><p id="d1e4264">The absolute distance (<inline-formula><mml:math id="M133" display="inline"><mml:mrow><mml:mi mathvariant="normal">Mg</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="normal">ha</mml:mi></mml:mrow></mml:math></inline-formula>) between the country-specific
and the regional ensemble <bold>(a)</bold>. The areas in white are areas where
the country-specific modeling is predicting higher SOC than the regional
estimate (i.e., country-specific is greater than regional) <bold>(b)</bold>. </p></caption>
          <?xmltex \igopts{width=207.705118pt}?><graphic xlink:href="https://soil.copernicus.org/articles/4/173/2018/soil-4-173-2018-f07.png"/>

        </fig>

</sec>
<sec id="Ch1.S3.SS7">
  <title>SOC stocks and model uncertainties</title>
      <p id="d1e4296">For comparative purposes with previous reports (i.e., the SoilGrids system
and the Harmonized World Soil Database), we harmonized the units of our maps
to <inline-formula><mml:math id="M134" display="inline"><mml:mrow><mml:mi mathvariant="normal">Mg</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="normal">ha</mml:mi></mml:mrow></mml:math></inline-formula>, which was also useful for visualization purposes. For our
models, the uncertainty of the maximum prediction limit was estimated to be
<inline-formula><mml:math id="M135" display="inline"><mml:mrow><mml:mo>±</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula> Pg, which was the variance of the SOC stock by increasing the
prediction limit from 1 to 700 <inline-formula><mml:math id="M136" display="inline"><mml:mrow><mml:mi mathvariant="normal">Mg</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="normal">ha</mml:mi></mml:mrow></mml:math></inline-formula> (Fig. <xref ref-type="fig" rid="Ch1.F8"/>).<?pagebreak page185?> This
relationship showed a stable (close to 0) trend after 200 <inline-formula><mml:math id="M137" display="inline"><mml:mrow><mml:mi mathvariant="normal">Mg</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="normal">ha</mml:mi></mml:mrow></mml:math></inline-formula>. A
larger density of extreme values was found with the regional model, and we
calculated a maximum possible SOC stock of 83.62 Pg with this model.</p>
      <p id="d1e4344">Despite the spatial differences reported for the country-specific and
regional ensembles, we report a similar stock between both approaches
(77.8 <inline-formula><mml:math id="M138" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula> 42.2 and 76.8 <inline-formula><mml:math id="M139" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula> 45.1 Pg, respectively). We found that
the global ensemble yields a slightly higher uncertainty. Our country-specific
ensembles suggested that countries with highest SOC stocks were Brazil,
Argentina, Colombia, Mexico, Peru, and Venezuela
(Table <xref ref-type="table" rid="Ch1.T3"/>).</p>
      <p id="d1e4363">Consistently, all models showed that tropical broadleaf evergreen forests,
croplands, and temperate shrublands were the land cover classes that had
higher SOC across all SOC available estimates (Table <xref ref-type="table" rid="Ch1.T4"/>).
However, using only the dataset contained in the WoSIS system, we predict
nearly the half of SOC compared with previously reported SOC estimates such
as the SoilGrids system (Table <xref ref-type="table" rid="Ch1.T3"/>).</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F8"><caption><p id="d1e4372">Relationship between the SOC stock and the prediction limit. The
average breakdown points of this relationship are shown in the vertical line
at the right of the plot.</p></caption>
          <?xmltex \igopts{width=170.716535pt}?><graphic xlink:href="https://soil.copernicus.org/articles/4/173/2018/soil-4-173-2018-f08.png"/>

        </fig>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T3" specific-use="star"><caption><p id="d1e4385">SOC stocks (Pg) at the contextual resolution of 5 <inline-formula><mml:math id="M140" display="inline"><mml:mo>×</mml:mo></mml:math></inline-formula> 5 km
grids. The terms used are defined as follows: ens is country-specific,
regional is Latin America ensemble, sg is the SoilGrids system, GSOCmap-GSP
is country-specific 1 km, and hw is the Harmonized World Soil Database.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="7">
     <oasis:colspec colnum="1" colname="col1" align="right"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:colspec colnum="6" colname="col6" align="right"/>
     <oasis:colspec colnum="7" colname="col7" align="right"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">Country</oasis:entry>
         <oasis:entry colname="col3">ens</oasis:entry>
         <oasis:entry colname="col4">regional</oasis:entry>
         <oasis:entry colname="col5">sg</oasis:entry>
         <oasis:entry colname="col6">GSOCmap-GSP</oasis:entry>
         <oasis:entry colname="col7">hw</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">1</oasis:entry>
         <oasis:entry colname="col2">Argentina</oasis:entry>
         <oasis:entry colname="col3">13.19</oasis:entry>
         <oasis:entry colname="col4">12.77</oasis:entry>
         <oasis:entry colname="col5">24.45</oasis:entry>
         <oasis:entry colname="col6">18.00</oasis:entry>
         <oasis:entry colname="col7">18.13</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">2</oasis:entry>
         <oasis:entry colname="col2">Belize</oasis:entry>
         <oasis:entry colname="col3">0.24</oasis:entry>
         <oasis:entry colname="col4">0.12</oasis:entry>
         <oasis:entry colname="col5">0.28</oasis:entry>
         <oasis:entry colname="col6">0.28</oasis:entry>
         <oasis:entry colname="col7">0.19</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">3</oasis:entry>
         <oasis:entry colname="col2">Bolivia</oasis:entry>
         <oasis:entry colname="col3">3.29</oasis:entry>
         <oasis:entry colname="col4">3.39</oasis:entry>
         <oasis:entry colname="col5">8.39</oasis:entry>
         <oasis:entry colname="col6">6.99</oasis:entry>
         <oasis:entry colname="col7">5.96</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">4</oasis:entry>
         <oasis:entry colname="col2">Brazil</oasis:entry>
         <oasis:entry colname="col3">26.82</oasis:entry>
         <oasis:entry colname="col4">27.16</oasis:entry>
         <oasis:entry colname="col5">68.45</oasis:entry>
         <oasis:entry colname="col6">42.79</oasis:entry>
         <oasis:entry colname="col7">47.20</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">5</oasis:entry>
         <oasis:entry colname="col2">Chile</oasis:entry>
         <oasis:entry colname="col3">6.31</oasis:entry>
         <oasis:entry colname="col4">7.20</oasis:entry>
         <oasis:entry colname="col5">15.15</oasis:entry>
         <oasis:entry colname="col6">1.93</oasis:entry>
         <oasis:entry colname="col7">8.28</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">6</oasis:entry>
         <oasis:entry colname="col2">Colombia</oasis:entry>
         <oasis:entry colname="col3">7.01</oasis:entry>
         <oasis:entry colname="col4">5.96</oasis:entry>
         <oasis:entry colname="col5">15.50</oasis:entry>
         <oasis:entry colname="col6">5.12</oasis:entry>
         <oasis:entry colname="col7">14.99</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">7</oasis:entry>
         <oasis:entry colname="col2">Costa Rica</oasis:entry>
         <oasis:entry colname="col3">0.56</oasis:entry>
         <oasis:entry colname="col4">0.34</oasis:entry>
         <oasis:entry colname="col5">0.83</oasis:entry>
         <oasis:entry colname="col6">0.83</oasis:entry>
         <oasis:entry colname="col7">0.71</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">8</oasis:entry>
         <oasis:entry colname="col2">Cuba</oasis:entry>
         <oasis:entry colname="col3">0.52</oasis:entry>
         <oasis:entry colname="col4">0.51</oasis:entry>
         <oasis:entry colname="col5">1.48</oasis:entry>
         <oasis:entry colname="col6">0.82</oasis:entry>
         <oasis:entry colname="col7">0.64</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">9</oasis:entry>
         <oasis:entry colname="col2">Ecuador</oasis:entry>
         <oasis:entry colname="col3">1.31</oasis:entry>
         <oasis:entry colname="col4">1.36</oasis:entry>
         <oasis:entry colname="col5">4.04</oasis:entry>
         <oasis:entry colname="col6">1.57</oasis:entry>
         <oasis:entry colname="col7">2.63</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">10</oasis:entry>
         <oasis:entry colname="col2">Guatemala</oasis:entry>
         <oasis:entry colname="col3">1.02</oasis:entry>
         <oasis:entry colname="col4">0.57</oasis:entry>
         <oasis:entry colname="col5">1.27</oasis:entry>
         <oasis:entry colname="col6">1.27</oasis:entry>
         <oasis:entry colname="col7">0.99</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">11</oasis:entry>
         <oasis:entry colname="col2">Jamaica</oasis:entry>
         <oasis:entry colname="col3">0.05</oasis:entry>
         <oasis:entry colname="col4">0.05</oasis:entry>
         <oasis:entry colname="col5">0.14</oasis:entry>
         <oasis:entry colname="col6">0.07</oasis:entry>
         <oasis:entry colname="col7">0.07</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">12</oasis:entry>
         <oasis:entry colname="col2">Mexico</oasis:entry>
         <oasis:entry colname="col3">5.98</oasis:entry>
         <oasis:entry colname="col4">6.12</oasis:entry>
         <oasis:entry colname="col5">14.43</oasis:entry>
         <oasis:entry colname="col6">9.04</oasis:entry>
         <oasis:entry colname="col7">17.59</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">13</oasis:entry>
         <oasis:entry colname="col2">Nicaragua</oasis:entry>
         <oasis:entry colname="col3">0.74</oasis:entry>
         <oasis:entry colname="col4">0.62</oasis:entry>
         <oasis:entry colname="col5">1.42</oasis:entry>
         <oasis:entry colname="col6">0.71</oasis:entry>
         <oasis:entry colname="col7">0.92</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">14</oasis:entry>
         <oasis:entry colname="col2">Panama</oasis:entry>
         <oasis:entry colname="col3">0.56</oasis:entry>
         <oasis:entry colname="col4">0.43</oasis:entry>
         <oasis:entry colname="col5">1.10</oasis:entry>
         <oasis:entry colname="col6">0.33</oasis:entry>
         <oasis:entry colname="col7">0.69</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">15</oasis:entry>
         <oasis:entry colname="col2">Peru</oasis:entry>
         <oasis:entry colname="col3">4.38</oasis:entry>
         <oasis:entry colname="col4">5.13</oasis:entry>
         <oasis:entry colname="col5">17.08</oasis:entry>
         <oasis:entry colname="col6">3.14</oasis:entry>
         <oasis:entry colname="col7">10.51</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">16</oasis:entry>
         <oasis:entry colname="col2">Suriname</oasis:entry>
         <oasis:entry colname="col3">0.56</oasis:entry>
         <oasis:entry colname="col4">0.51</oasis:entry>
         <oasis:entry colname="col5">1.20</oasis:entry>
         <oasis:entry colname="col6">0.45</oasis:entry>
         <oasis:entry colname="col7">1.33</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">17</oasis:entry>
         <oasis:entry colname="col2">Uruguay</oasis:entry>
         <oasis:entry colname="col3">0.92</oasis:entry>
         <oasis:entry colname="col4">0.88</oasis:entry>
         <oasis:entry colname="col5">1.99</oasis:entry>
         <oasis:entry colname="col6">0.84</oasis:entry>
         <oasis:entry colname="col7">2.27</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">18</oasis:entry>
         <oasis:entry colname="col2">Venezuela</oasis:entry>
         <oasis:entry colname="col3">4.71</oasis:entry>
         <oasis:entry colname="col4">3.77</oasis:entry>
         <oasis:entry colname="col5">9.39</oasis:entry>
         <oasis:entry colname="col6">5.28</oasis:entry>
         <oasis:entry colname="col7">5.64</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

      <p id="d1e4898">The model variance of predicted SOC reached values over 300 % for
countries such as Mexico and Bolivia. In contrast, countries with higher SOC
per unit area and relatively low prediction variances were Panama, Guatemala,
Costa Rica, Nicaragua, and Belize. Overall, we found a median model
prediction variance of 53 % across countries in Latin America. Areas with
high uncertainty and model variance were across northern Mexico, Central
America, limits between Colombia and Brazil, and the border between Chile and
Argentina.</p><?xmltex \hack{\newpage}?>
</sec>
</sec>
<sec id="Ch1.S4">
  <title>Discussion</title>
      <p id="d1e4909">We developed a DSM framework to characterize the spatial variability of SOC
across Latin America. Our results suggest that a multi-model approach was
suitable to better understand modeling bias and uncertainty of SOC maps. We
argue that uncertainty on SOC mapping can be associated with (a) the
complexity of the property of interest (i.e., SOC), (b) the environmental
heterogeneity within the area/country of interest, and (c) the
characteristics of available data (e.g., data density, data quality, and data
representativeness) to meet model-specific assumptions. Thus, when legacy
soil profile collections that were collected for different purposes along
long periods of time (i.e., decades), a multi-model approach (i.e., ensemble)
would be convenient to maximize the predictive capacity considering the
available information.</p>
      <p id="d1e4912">To maximize accuracy of our models, we used a generalized linear approach to
combine single predictions, and at the continental scale we were able to
explain 39 % of SOC variance using only information contained in the
WoSIS system for Latin America. This result was within the range of the
prediction capacity of country-specific models. Besides the low density of
observation points, the performance could be partially affected by the
generalization from the 1 : 1 scale of a soil profile (or field SOC
observation) to a 5 <inline-formula><mml:math id="M141" display="inline"><mml:mo>×</mml:mo></mml:math></inline-formula> 5 km grid, representing an additional source
of uncertainty. Higher discrepancy between country-specific and global
efforts was evident across Brazil, the largest country, where our models tend
to predict nearly half of SOC compared to previous efforts (e.g., the GSOCmap-GSP,
the SoilGrids system, and the Harmonized World Soil Database). The SoilGrids
system tends to predict the highest values, while our country-specific
ensemble predicts the lowest. The GSOCmap-GSP and our ensembles predicted <inline-formula><mml:math id="M142" display="inline"><mml:mo>&lt;</mml:mo></mml:math></inline-formula> 100 Pg
of SOC across the analyzed countries, while all other products suggest higher
stocks (see Tables <xref ref-type="table" rid="Ch1.T3"/> and <xref ref-type="table" rid="Ch1.T4"/>).</p>
      <?pagebreak page186?><p id="d1e4933">Another source of discrepancy can be associated with the lack of available
data to represent the SOC stock at the depth of interest (i.e., <inline-formula><mml:math id="M143" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>30 cm of
mineral soil). The predictive performance of the mass-preservative spline to
continuously represent the SOC and depth relationships in some cases could be
strongly influenced by the lack of observations across highly variable soil
profiles. Some examples include SOC-rich agricultural soil profiles
constantly transformed for food production purposes, or a volcanic setting.
These high levels of missing data lead the trend map of approximated error
(Fig. <xref ref-type="fig" rid="Ch1.F6"/>), which provides an idea of the uncertainty in the
SOC estimates.</p>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T4" specific-use="star"><caption><p id="d1e4948">SOC stocks at the contextual resolution of 5 <inline-formula><mml:math id="M144" display="inline"><mml:mo>×</mml:mo></mml:math></inline-formula> 5 km across
land cover classes of Latin America for the 18 analyzed countries. The terms
used are defined as follows: ens is country-specific, regional is Latin
America ensemble, sg is the SoilGrids system, GSOCmap-GSP is country-specific
1 km, and hw is the Harmonized World Soil Database. These are the land cover
classes described in Blanco et al. (2013). This land cover product was
generated using 500 m grids and has 84 % of accuracy.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="7">
     <oasis:colspec colnum="1" colname="col1" align="right"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:colspec colnum="6" colname="col6" align="right"/>
     <oasis:colspec colnum="7" colname="col7" align="right"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">Land cover</oasis:entry>
         <oasis:entry colname="col3">ens</oasis:entry>
         <oasis:entry colname="col4">GSOCmap-GSP</oasis:entry>
         <oasis:entry colname="col5">hw</oasis:entry>
         <oasis:entry colname="col6">sg</oasis:entry>
         <oasis:entry colname="col7">regional</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">1</oasis:entry>
         <oasis:entry colname="col2">Tropical broadleaf evergreen forest</oasis:entry>
         <oasis:entry colname="col3">30.39</oasis:entry>
         <oasis:entry colname="col4">40.30</oasis:entry>
         <oasis:entry colname="col5">59.15</oasis:entry>
         <oasis:entry colname="col6">80.44</oasis:entry>
         <oasis:entry colname="col7">29.73</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">2</oasis:entry>
         <oasis:entry colname="col2">Tropical broadleaf deciduous forest</oasis:entry>
         <oasis:entry colname="col3">0.43</oasis:entry>
         <oasis:entry colname="col4">0.65</oasis:entry>
         <oasis:entry colname="col5">1.00</oasis:entry>
         <oasis:entry colname="col6">1.09</oasis:entry>
         <oasis:entry colname="col7">0.42</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">3</oasis:entry>
         <oasis:entry colname="col2">Subtropical broadleaf evergreen forest</oasis:entry>
         <oasis:entry colname="col3">2.38</oasis:entry>
         <oasis:entry colname="col4">3.91</oasis:entry>
         <oasis:entry colname="col5">4.51</oasis:entry>
         <oasis:entry colname="col6">6.57</oasis:entry>
         <oasis:entry colname="col7">2.25</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">4</oasis:entry>
         <oasis:entry colname="col2">Subtropical broadleaf deciduous forest</oasis:entry>
         <oasis:entry colname="col3">1.42</oasis:entry>
         <oasis:entry colname="col4">2.04</oasis:entry>
         <oasis:entry colname="col5">1.87</oasis:entry>
         <oasis:entry colname="col6">2.55</oasis:entry>
         <oasis:entry colname="col7">1.07</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">5</oasis:entry>
         <oasis:entry colname="col2">Temperate broadleaf evergreen forest</oasis:entry>
         <oasis:entry colname="col3">3.32</oasis:entry>
         <oasis:entry colname="col4">1.26</oasis:entry>
         <oasis:entry colname="col5">4.97</oasis:entry>
         <oasis:entry colname="col6">6.91</oasis:entry>
         <oasis:entry colname="col7">3.56</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">6</oasis:entry>
         <oasis:entry colname="col2">Temperate broadleaf deciduous forest</oasis:entry>
         <oasis:entry colname="col3">0.48</oasis:entry>
         <oasis:entry colname="col4">0.52</oasis:entry>
         <oasis:entry colname="col5">1.02</oasis:entry>
         <oasis:entry colname="col6">1.21</oasis:entry>
         <oasis:entry colname="col7">0.63</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">7</oasis:entry>
         <oasis:entry colname="col2">Subtropical needleleaf forest</oasis:entry>
         <oasis:entry colname="col3">0.00</oasis:entry>
         <oasis:entry colname="col4">0.01</oasis:entry>
         <oasis:entry colname="col5">0.00</oasis:entry>
         <oasis:entry colname="col6">0.01</oasis:entry>
         <oasis:entry colname="col7">0.00</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">8</oasis:entry>
         <oasis:entry colname="col2">Temperate needleleaf forest</oasis:entry>
         <oasis:entry colname="col3">0.23</oasis:entry>
         <oasis:entry colname="col4">0.36</oasis:entry>
         <oasis:entry colname="col5">0.45</oasis:entry>
         <oasis:entry colname="col6">0.54</oasis:entry>
         <oasis:entry colname="col7">0.24</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">9</oasis:entry>
         <oasis:entry colname="col2">Mixed forest</oasis:entry>
         <oasis:entry colname="col3">0.67</oasis:entry>
         <oasis:entry colname="col4">1.08</oasis:entry>
         <oasis:entry colname="col5">1.34</oasis:entry>
         <oasis:entry colname="col6">1.66</oasis:entry>
         <oasis:entry colname="col7">0.66</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">10</oasis:entry>
         <oasis:entry colname="col2">Tropical shrubland</oasis:entry>
         <oasis:entry colname="col3">4.25</oasis:entry>
         <oasis:entry colname="col4">6.58</oasis:entry>
         <oasis:entry colname="col5">6.98</oasis:entry>
         <oasis:entry colname="col6">10.30</oasis:entry>
         <oasis:entry colname="col7">4.18</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">11</oasis:entry>
         <oasis:entry colname="col2">Subtropical shrubland</oasis:entry>
         <oasis:entry colname="col3">3.17</oasis:entry>
         <oasis:entry colname="col4">4.18</oasis:entry>
         <oasis:entry colname="col5">6.62</oasis:entry>
         <oasis:entry colname="col6">6.33</oasis:entry>
         <oasis:entry colname="col7">2.90</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">12</oasis:entry>
         <oasis:entry colname="col2">Temperate shrubland</oasis:entry>
         <oasis:entry colname="col3">4.56</oasis:entry>
         <oasis:entry colname="col4">5.08</oasis:entry>
         <oasis:entry colname="col5">7.33</oasis:entry>
         <oasis:entry colname="col6">9.97</oasis:entry>
         <oasis:entry colname="col7">5.32</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">13</oasis:entry>
         <oasis:entry colname="col2">Tropical grassland</oasis:entry>
         <oasis:entry colname="col3">3.01</oasis:entry>
         <oasis:entry colname="col4">2.48</oasis:entry>
         <oasis:entry colname="col5">3.56</oasis:entry>
         <oasis:entry colname="col6">5.46</oasis:entry>
         <oasis:entry colname="col7">2.45</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">14</oasis:entry>
         <oasis:entry colname="col2">Subtropical grassland</oasis:entry>
         <oasis:entry colname="col3">1.15</oasis:entry>
         <oasis:entry colname="col4">1.35</oasis:entry>
         <oasis:entry colname="col5">2.28</oasis:entry>
         <oasis:entry colname="col6">2.58</oasis:entry>
         <oasis:entry colname="col7">1.12</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">15</oasis:entry>
         <oasis:entry colname="col2">Temperate grassland</oasis:entry>
         <oasis:entry colname="col3">2.75</oasis:entry>
         <oasis:entry colname="col4">3.31</oasis:entry>
         <oasis:entry colname="col5">4.86</oasis:entry>
         <oasis:entry colname="col6">5.92</oasis:entry>
         <oasis:entry colname="col7">3.04</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">16</oasis:entry>
         <oasis:entry colname="col2">Inland water bodies</oasis:entry>
         <oasis:entry colname="col3">1.21</oasis:entry>
         <oasis:entry colname="col4">1.37</oasis:entry>
         <oasis:entry colname="col5">2.07</oasis:entry>
         <oasis:entry colname="col6">3.45</oasis:entry>
         <oasis:entry colname="col7">1.21</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">17</oasis:entry>
         <oasis:entry colname="col2">Urban area</oasis:entry>
         <oasis:entry colname="col3">0.24</oasis:entry>
         <oasis:entry colname="col4">0.31</oasis:entry>
         <oasis:entry colname="col5">0.45</oasis:entry>
         <oasis:entry colname="col6">0.55</oasis:entry>
         <oasis:entry colname="col7">0.22</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">18</oasis:entry>
         <oasis:entry colname="col2">Permanent ice and snow</oasis:entry>
         <oasis:entry colname="col3">0.14</oasis:entry>
         <oasis:entry colname="col4">0.08</oasis:entry>
         <oasis:entry colname="col5">0.14</oasis:entry>
         <oasis:entry colname="col6">0.38</oasis:entry>
         <oasis:entry colname="col7">0.17</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">19</oasis:entry>
         <oasis:entry colname="col2">Barren land</oasis:entry>
         <oasis:entry colname="col3">1.74</oasis:entry>
         <oasis:entry colname="col4">2.38</oasis:entry>
         <oasis:entry colname="col5">2.43</oasis:entry>
         <oasis:entry colname="col6">2.95</oasis:entry>
         <oasis:entry colname="col7">1.70</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">20</oasis:entry>
         <oasis:entry colname="col2">Cropland</oasis:entry>
         <oasis:entry colname="col3">12.95</oasis:entry>
         <oasis:entry colname="col4">19.33</oasis:entry>
         <oasis:entry colname="col5">21.89</oasis:entry>
         <oasis:entry colname="col6">27.94</oasis:entry>
         <oasis:entry colname="col7">12.42</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">21</oasis:entry>
         <oasis:entry colname="col2">Wetland</oasis:entry>
         <oasis:entry colname="col3">0.37</oasis:entry>
         <oasis:entry colname="col4">0.56</oasis:entry>
         <oasis:entry colname="col5">0.66</oasis:entry>
         <oasis:entry colname="col6">1.24</oasis:entry>
         <oasis:entry colname="col7">0.35</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">22</oasis:entry>
         <oasis:entry colname="col2">Salt flat</oasis:entry>
         <oasis:entry colname="col3">0.13</oasis:entry>
         <oasis:entry colname="col4">0.17</oasis:entry>
         <oasis:entry colname="col5">0.16</oasis:entry>
         <oasis:entry colname="col6">0.18</oasis:entry>
         <oasis:entry colname="col7">0.10</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">23</oasis:entry>
         <oasis:entry colname="col2">Coastal areas</oasis:entry>
         <oasis:entry colname="col3">1.59</oasis:entry>
         <oasis:entry colname="col4">1.39</oasis:entry>
         <oasis:entry colname="col5">2.23</oasis:entry>
         <oasis:entry colname="col6">4.31</oasis:entry>
         <oasis:entry colname="col7">1.78</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

      <p id="d1e5587">The GSOCmap-GSP, for example, was generated on a country basis, but the
amount of SOC observations used for the countries to generate these maps was
considerable higher than the available data in the WoSIS system
(<inline-formula><mml:math id="M145" display="inline"><mml:mo lspace="0mm">&gt;</mml:mo></mml:math></inline-formula> 1 000 000 points). Both of our models predicted more conservative
results than the GSOCmap-GSP, while at the same time, the GSOCmap-GSP
predicted less SOC than the SoilGrids system and the Harmonized World Soil
Database. Respectively, the SoilGrids system relies on a multivariate space
suitable to represent the global soil-forming environment; however, a model
would assume a similar relation of each covariate with the response across
all land area in the world. The Harmonized World Soil Database may be a
pedologically sound product, but large areas of Latin America have not been
mapped at detailed scales (i.e., larger scales than 1 : 1 million) and this
results in a polygon-based approach relying on wide generalizations.</p>
      <p id="d1e5597">Despite the aforementioned limitations, across Latin America, there is an
increasing availability of relevant SOC information across site- and
country-specific regions <xref ref-type="bibr" rid="bib1.bibx56 bib1.bibx67 bib1.bibx3 bib1.bibx58 bib1.bibx2 bib1.bibx53" id="paren.46"/>, which could serve for
validating and calibrating global SOC estimates. Thus, regional approaches
considering multiple Latin American countries and SOC models could be a
valuable resource to explain discrepancies between site- or country-specific
and global SOC models.</p>
      <p id="d1e5603">Our results incorporate a multi-model perspective for quantifying/evaluating
the spatial variability of SOC. The model with higher predictive capacity in
terms of cross-validated <inline-formula><mml:math id="M146" display="inline"><mml:mrow><mml:msup><mml:mi>r</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> was RF, an ensemble of regression trees based
on bagging. However, this method yields high EC<inline-formula><mml:math id="M147" display="inline"><mml:msub><mml:mi/><mml:mi mathvariant="normal">r</mml:mi></mml:msub></mml:math></inline-formula>, and therefore it
tends to capture the trend but with high bias. Taylor diagrams show that RF
in any case yield the lower variance. SVM and RK were methods with higher
agreement between RMSE and corr, and therefore lower EC<inline-formula><mml:math id="M148" display="inline"><mml:msub><mml:mi/><mml:mi mathvariant="normal">r</mml:mi></mml:msub></mml:math></inline-formula>. Large
values of EC<inline-formula><mml:math id="M149" display="inline"><mml:msub><mml:mi/><mml:mi mathvariant="normal">r</mml:mi></mml:msub></mml:math></inline-formula> represent an accuracy limitation that was evident
for RF, PL, and KK. To overcome these types of modeling biases, previous
studies have suggested that the theory of ensemble learning applied to soil
datasets could increase the accuracy of results <xref ref-type="bibr" rid="bib1.bibx18 bib1.bibx52" id="paren.47"/>. Furthermore, recent studies highlight the applicability of
selective ensembles across a large diversity of model algorithms useful for
digital soil mapping purposes <xref ref-type="bibr" rid="bib1.bibx48" id="paren.48"/>. Thus, our modeling
approach included the combination of multiple predictions by using a linear
stack of models as implemented in the caretEnsemble package of R <xref ref-type="bibr" rid="bib1.bibx14" id="paren.49"/>,
with the ultimate goal of reducing the uncertainty on SOC mapping efforts.</p>
      <?pagebreak page187?><p id="d1e5654">Across Latin America, we did not find a common predictive algorithm for SOC.
These results suggest that country-specific environmental predictors and
available data influence the applicability of different approaches. This
assessment is needed to address the requirements from the GSOCmap-GSP with
the official mandate to generate and update country-specific soil information
by the means of DSM. Thus, we argue that the DSM form of each country should
assess and incorporate country-specific available data and environmental
predictors to select the best prediction algorithm. The FAO SOC mapping
cookbook explores possibilities to derive country-specific SOC maps from a
variety of prediction algorithms <xref ref-type="bibr" rid="bib1.bibx72" id="paren.50"/>, and multiple resources
have described the state of the art of modeling methods focused on DSM of
soil carbon <xref ref-type="bibr" rid="bib1.bibx46 bib1.bibx40" id="paren.51"/> including geostatistics
<xref ref-type="bibr" rid="bib1.bibx25 bib1.bibx26" id="paren.52"/>. Thus, data characteristics (e.g., spatial
structure, representativeness) are specifically important for developing a
DSM framework as legacy soil profile collections, generated with long-term
soil inventory purposes, will determine data availability and spatial
distribution within a country.</p>
      <p id="d1e5666">This country-specific approach to map regional SOC results in artifacts
across geopolitical borders. Therefore, data sharing, model validation, and
calibration experiments among countries are required to better capture the
spatial variability of SOC. The use of a natural-defined prediction domain
(e.g., ecoregional or physiographic map) could reduce the border effects.
However, we understand that geopolitical borders are required for policy
decisions around country-specific needs. We highlight that there is a lack of
publicly available country-specific data that ultimately influence the
performance of both country-specific to regional-to-global SOC estimates.</p>
      <p id="d1e5669">To achieve the highest possible accuracy of country-specific SOC estimates,
the availability of point data sources for SOC modeling and mapping is an
important consideration when selecting an efficient modeling strategy,
especially when dealing with legacy SOC datasets. Our results highlight
important uncertainty levels (<inline-formula><mml:math id="M150" display="inline"><mml:mo lspace="0mm">&gt;</mml:mo></mml:math></inline-formula> 100 %) across large areas of Latin
America (Table <xref ref-type="fig" rid="Ch1.F6"/>). The data contained in WoSIS have a
low-density distribution given the large area and environmental complexity of
several countries analyzed. Thus, larger uncertainty dominates countries with
larger SOC pools probably because available data do not capture the large
spatial heterogeneity of SOC stocks. We highlight that the WoSIS dataset is a
unique and invaluable effort that has proven to generate global SOC
predictions <xref ref-type="bibr" rid="bib1.bibx29 bib1.bibx59" id="paren.53"/>, but there is a global need to
increase information and networking capabilities for SOC <xref ref-type="bibr" rid="bib1.bibx22" id="paren.54"/>.</p>
      <?pagebreak page188?><p id="d1e5688">This study generated predictions of SOC across Latin America but also
provided information about the main relationships driving the spatial
distribution of SOC. Machine learning (i.e., data-driven) models have proven
to be more efficient to model non-linear relationships of SOC
<xref ref-type="bibr" rid="bib1.bibx28" id="paren.55"/>, but our results suggest that linear-based models (e.g.,
RK) could outperform machine learning methods under well-distributed and
representative SOC data scenarios. Similar results were found across
productive landscapes of Brazil <xref ref-type="bibr" rid="bib1.bibx8" id="paren.56"/>. We argue that our
capacity to meet modeling assumptions will determine the most suitable
prediction algorithm or ensemble methods (i.e., stack, blend, bucket of
models). Machine learning models are usually conceived as black boxes and the
influence of non-informative SOC prediction factors on machine learning-based
SOC models has not been evaluated in detail. Therefore, we propose that the
use of simple linear methods (i.e., correlation of available data and their
predictors) can be a useful and parsimonious first step to inform data-driven
approaches and enhance the interpretability of machine learning models to
predict SOC. However, the simple selection of prediction factors based on
simple correlation analysis does not prevent multi-collinearity, in which
hypothesis-driven methods (e.g., RK) may be at risk to fail, but provides
useful information about the main effects of the predictors on SOC. Thus, the
use of machine learning and other statistical models (i.e., PL) is suitable
to overcome the bias associated with the potential statistical redundancy of
our simple variable selection approach based on simple correlation analysis.
Furthermore, our data suggest that country-specific predictor factors are
needed to better parameterize models but also could be useful for
country-specific model interpretation. These results have important
implications because it has been proposed that an extensive set of prediction
factors is required to capture the large variance of the global SOC pool
<xref ref-type="bibr" rid="bib1.bibx29" id="paren.57"/>. Thus, we propose that limited but informative
country-specific prediction factors could be jointly explored to describe the
local biophysical characteristics controlling SOC variability.</p>
      <p id="d1e5700">This study is expected to increase the capacity of Latin American
institutions to provide accurate baseline estimates of SOC with a
country-specific perspective following recommendations of GSOCmap-GSP.
Ultimately, these efforts will enhance the development of new guidelines for
measuring, mapping, reporting, verification, and monitoring SOC stocks <xref ref-type="bibr" rid="bib1.bibx65" id="paren.58"/>.
Accurate country-specific DSM frameworks for SOC are required to facilitate
interoperability and inform environmental policy across developing countries
<xref ref-type="bibr" rid="bib1.bibx66" id="paren.59"/>. Our results highlight that attention is needed to better
understand the influence of model prediction limits (e.g., the full
conditional distribution) for the predicted SOC stocks. Setting an unreliable
(excessive or low) prediction limit can have important effects (under- or
overestimating) on the overall estimated stocks (Fig. <xref ref-type="fig" rid="Ch1.F8"/>).
Therefore, we argue that data science systems for DSM focused on carbon
assessments should be fundamentally based on SOC expert knowledge and
informed by expert-based soil mapping systems.</p>
</sec>
<sec id="Ch1.S5" sec-type="conclusions">
  <title>Conclusions</title>
      <p id="d1e5718">We provided a multi-model comparison approach to map SOC stocks across Latin
America and found that there is no dominant best prediction algorithm
given the available data. The relative performance of the different methods
varies from one place to another as well as the relative correlation of SOC
with the prediction factors given the available data. We compared and combined
hypothesis-driven approaches (e.g., linear geostatistics) and data-driven
algorithms (e.g., machine learning), respectively, to generate interpretable
and predictable models of SOC variability. We argue that models should not be
conceived as competitors, because they have different assumptions (about the
data themselves or about the empirical relationship between the response
variable and its predictors) as different models will capture different
portions of SOC variability. We highlight potential levels of uncertainty in
SOC stocks associated with the maximum allowed prediction limit. Public data
may not be representative across large areas, and we call for all countries to
strengthen digital soil mapping capacity building initiatives, SOC research,
and data sharing. The use of country-specific information and the use of
different modeling approaches will enhance regional SOC mapping efforts and
will provide insights to identify where and why different modeling approaches
generate similar SOC estimates.</p>
</sec>

      
      </body>
    <back><notes notes-type="codeavailability">

      <p id="d1e5725">The codes used for this work are available under the AGPL 3.0 license at
<ext-link xlink:href="https://doi.org/10.5281/zenodo.1304392" ext-link-type="DOI">10.5281/zenodo.1304392</ext-link> <xref ref-type="bibr" rid="bib1.bibx21" id="paren.60"/>.</p>

      <p id="d1e5734">Working codes are also available at
<uri>https://github.com/vargaslab/SoilCarbon_Latin_America</uri> (last access:
16 July 2018).</p>
  </notes><notes notes-type="dataavailability">

      <p id="d1e5743">The soil dataset can be downloaded from WoSIS at
<uri>http://www.isric.org/explore/wosis</uri> (last access: 16 July 2018) and
corresponds to the July 2016 version <xref ref-type="bibr" rid="bib1.bibx5" id="paren.61"/>. Soil covariates are
available at <uri>http://worldgrids.org</uri> (last access: 20 February 2018). A
list of the codes for the SOC prediction factors used here can be found at <uri>https://docs.google.com/spreadsheets/d/1yr09cPDoSVdoahN_fXcNLfgipQcCodRl66WCcj6hJ9A/edit?usp=sharing</uri>
(last access: 16 July 2018).</p>
  </notes><?xmltex \hack{\clearpage}?><app-group>

<?pagebreak page189?><app id="App1.Ch1.S1">
  <title/>
<sec id="App1.Ch1.S1.SS1">
  <title>Brief description of implemented methods</title>
      <p id="d1e5771">RK is a hybrid model with both a deterministic and a stochastic component
<xref ref-type="bibr" rid="bib1.bibx27" id="paren.62"/>. The regression part took the form of a stepwise (backward and
forward) multiple linear regression to avoid statistical redundancy among the
best prediction factors. The residual kriging was ordinary. The variogram
parameters supporting the spatial interpolation were automatically fitted
using the framework proposed by <xref ref-type="bibr" rid="bib1.bibx33" id="text.63"/>. RK was applied only to
countries with 10 or more available observations.</p>
      <p id="d1e5780">PLS is a common method to deal with the presence of highly correlated
predictors. The PLS algorithm integrates the compression and regression steps
and it selects successive orthogonal factors that maximize the covariance
between predictor and response variables <xref ref-type="bibr" rid="bib1.bibx69 bib1.bibx68" id="paren.64"/>. Most of its development and application are in the fields
of chemometrics but it is used in several research areas to effectively solve
regression and classification problems.</p>
      <p id="d1e5786">SVM applies a simple linear method to the data but in a high-dimensional
feature space non-linearly related to the input space
<xref ref-type="bibr" rid="bib1.bibx35" id="paren.65"/>. It creates a hyperplane through <inline-formula><mml:math id="M151" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula>-dimensional
spectral space. Then, SVM separates numerical data based on a kernel function
and parameters (e.g., gamma and cost) that maximize the margin from the
closest point to the hyperplane that divides data with the largest possible
margin, being the support vectors the points which fall within
<xref ref-type="bibr" rid="bib1.bibx31" id="paren.66"/>. Then, linear models are fitted to the support vectors. A
radial general purpose kernel was found optimal after the cross-validation
strategy for parameter selection.</p>
      <p id="d1e5802"><?xmltex \hack{\newpage}?>RF is an ensemble of regression trees based on bagging <xref ref-type="bibr" rid="bib1.bibx10" id="paren.67"/>.
This machine learning algorithm uses a different combination of prediction
factors to train multiple regression trees. Each tree is generated using
different subsets of available data <xref ref-type="bibr" rid="bib1.bibx11" id="paren.68"/>. The number of
prediction factors to use on each tree is known as the mtry parameter. The
final prediction is the weighted average of all individual trees.</p>
      <p id="d1e5813">KK is a pattern recognition technique which is based on the distances to
training examples in the feature space <xref ref-type="bibr" rid="bib1.bibx61" id="paren.69"/>. The observations
within the learning set, which are particularly close to the new observation
(<inline-formula><mml:math id="M152" display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M153" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula>), should get a higher weight in the decision than such neighbors
that are far away from (<inline-formula><mml:math id="M154" display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M155" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula>) <xref ref-type="bibr" rid="bib1.bibx24" id="paren.70"/>. The parameter k
determines the number of neighbors from which information will be considered
for prediction, and a kernel function (e.g., triangular, Gaussian among
others) converts distances into weights which will be used for regression
problems. The Gaussian and (in less proportion) the triangular kernels were
the optimal options for all countries.</p><?xmltex \hack{\clearpage}?><supplementary-material position="anchor"><p id="d1e5851">The supplement related to this article is available online at: <inline-supplementary-material xlink:href="https://doi.org/10.5194/soil-4-173-2018-supplement" xlink:title="pdf">https://doi.org/10.5194/soil-4-173-2018-supplement</inline-supplementary-material>.</p></supplementary-material>
</sec>
</app>
  </app-group><notes notes-type="authorcontribution">

      <p id="d1e5863">All coauthors contributed to the planning of the study with support
from the GSP secretariat to develop the GSOCmap. MG, GFO and RV designed the
experiment. MG and GFO performed analyses. MG, ES and GFO prepared datasets.
MG, GFO and RV wrote the manuscript with feedback from all
coauthors.</p>
  </notes><notes notes-type="competinginterests">

      <p id="d1e5869">The authors declare that they have no conflict of
interest.</p>
  </notes><notes notes-type="sistatement">

      <p id="d1e5875">This article is part of the special issue “Regional
perspectives and challenges of soil organic carbon management and monitoring
– a special issue from the Global Symposium on Soil Organic Carbon 2017”.
It is a result of the Global Symposium on Soil Organic Carbon, Rome, Italy,
21–23 March 2017.</p>
  </notes><ack><title>Acknowledgements</title><p id="d1e5881">This work was supported by the Global Soil Partnership, the Central America,
Caribbean and Mexico Soil Partnership, and the South America Soil Partnership
in collaboration with the Department of Plant and Soil Sciences at the
University of Delaware. Mario Guevara acknowledges support from a CONACYT
fellowship. Guillermo Federico Olmedo is supported by the Argentinian
government through the project INTA PNSUELO1134032. Rodrigo Vargas
acknowledges support from NASA (80NSSC18K0173) and USDA
(2014-67003-22070).<?xmltex \hack{\newline}?><?xmltex \hack{\newline}?> Edited by: Peter
Finke<?xmltex \hack{\newline}?> Reviewed by: Tomislav Hengl and one anonymous referee</p></ack><ref-list>
    <title>References</title>

      <ref id="bib1.bibx1"><label>Adhikari et al.(2014)Adhikari, Hartemink, Minasny, Bou Kheir,
Greve, and Greve</label><mixed-citation>Adhikari, K., Hartemink, A. E., Minasny, B., Bou Kheir, R., Greve, M. B., and
Greve, M. H.: Digital mapping of soil organic carbon contents and stocks in
Denmark, PLoS ONE, 9, e105519, <ext-link xlink:href="https://doi.org/10.1371/journal.pone.0105519" ext-link-type="DOI">10.1371/journal.pone.0105519</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx2"><label>Angelini et al.(2016)Angelini, Heuvelink, Kempen, and
Morrás</label><mixed-citation>Angelini, M. E., Heuvelink, G. B., Kempen, B., and Morrás, H. J.: Mapping the
soils of an Argentine Pampas region using structural equation modelling,
Geoderma, 281, 102–118,
<ext-link xlink:href="https://doi.org/10.1016/j.geoderma.2016.06.031" ext-link-type="DOI">10.1016/j.geoderma.2016.06.031</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx3"><label>Angelini et al.(2017)Angelini, Heuvelink, and Kempen</label><mixed-citation>Angelini, M. E., Heuvelink, G. B. M., and Kempen, B.: Multivariate mapping of
soil with structural equation modelling, Eur. J. Soil Sci.,
68, 575–591, <ext-link xlink:href="https://doi.org/10.1111/ejss.12446" ext-link-type="DOI">10.1111/ejss.12446</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx4"><label>Arrouays et al.(2017)</label><mixed-citation>Arrouays, D., Leenaars, J. G., de Forges, A. C. R., Adhikari, K., Ballabio, C.,
Greve, M., Grundy, M., Guerrero, E., Hempel, J., Hengl, T., Heuvelink, G.,
Batjes, N., Carvalho, E., Hartemink, A., Hewitt, A., Hong, S.-Y.,
Krasilnikov, P., Lagacherie, P., Lelyk, G., Libohova, Z., Lilly, A.,
McBratney, A., McKenzie, N., Vasquez, G. M., Mulder, V. L., Minasny, B.,
Montanarella, L., Odeh, I., Padarian, J., Poggio, L., Roudier, P., Saby, N.,
Savin, I., Searle, R., Solbovoy, V., Thompson, J., Smith, S., Sulaeman, Y.,
Vintila, R., Rossel, R. V., Wilson, P., Zhang, G.-L., Swerts, M., Oorts, K.,
Karklins, A., Feng, L., Navarro, A. R. I., Levin, A., Laktionova, T.,
Dell'Acqua, M., Suvannang, N., Ruam, W., Prasad, J., Patil, N., Husnjak, S.,
Pásztor, L., Okx, J., Hallett, S., Keay, C., Farewell, T., Lilja, H.,
Juilleret, J., Marx, S., Takata, Y., Kazuyuki, Y., Mansuy, N., Panagos, P.,
Liedekerke, M. V., Skalsky, R., Sobocka, J., Kobza, J., Eftekhari, K.,
Alavipanah, S. K., Moussadek, R., Badraoui, M., Silva, M. D., Paterson, G.,
da Conceição Gonçalves, M., Theocharopoulos, S., Yemefack, M., Tedou, S.,
Vrscaj, B., Grob, U., Kozák, J., Boruvka, L., Dobos, E., Taboada, M.,
Moretti, L., and Rodriguez, D.: Soil legacy data rescue via GlobalSoilMap and
other international and national initiatives, GeoResJ, 14, 1–19,
<ext-link xlink:href="https://doi.org/10.1016/j.grj.2017.06.001" ext-link-type="DOI">10.1016/j.grj.2017.06.001</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx5"><label>Batjes et al.(2017)Batjes, Ribeiro, van Oostrum, Leenaars, Hengl, and
de Jesus</label><mixed-citation>Batjes, N. H., Ribeiro, E., van Oostrum, A., Leenaars, J., Hengl, T., and Mendes de Jesus, J.:
WoSIS: providing standardised soil profile data for the world, Earth Syst. Sci. Data, 9, 1–14, <ext-link xlink:href="https://doi.org/10.5194/essd-9-1-2017" ext-link-type="DOI">10.5194/essd-9-1-2017</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx6"><label>Bishop et al.(1999)Bishop, McBratney, and Laslett</label><mixed-citation>Bishop, T., McBratney, A., and Laslett, G.: Modelling soil attribute depth
functions with equal-area quadratic smoothing splines, Geoderma, 91, 27–45, <ext-link xlink:href="https://doi.org/10.1016/S0016-7061(99)00003-8" ext-link-type="DOI">10.1016/S0016-7061(99)00003-8</ext-link>, 1999.</mixed-citation></ref>
      <ref id="bib1.bibx7"><label>Blanco et al.(2013)</label><mixed-citation>Blanco, P. D., Colditz, R. R., Saldaña, G. L., Hardtke, L. A., Llamas, R. M.,
Mari, N. A., Fischer, A., Caride, C., Aceñolaza, P. G., del Valle, H. F.,
Lillo-Saavedra, M., Coronato, F., Opazo, S. A., Morelli, F., Anaya, J. A.,
Sione, W. F., Zamboni, P., and Arroyo, V. B.: A land cover map of Latin
America and the Caribbean in the framework of the SERENA project, Remote
Sens. Environ., 132, 13–31,
<ext-link xlink:href="https://doi.org/10.1016/j.rse.2012.12.025" ext-link-type="DOI">10.1016/j.rse.2012.12.025</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx8"><label>Bonfatti et al.(2016)Bonfatti, Hartemink, and Giasson</label><mixed-citation>Bonfatti, B. R., Hartemink, A. E., and Giasson, E.: Comparing Soil C Stocks
from Soil Profile Data Using Four Different Methods, in: Progress in Soil
Science,   315–329, Springer International Publishing,
<ext-link xlink:href="https://doi.org/10.1007/978-3-319-28295-4_20" ext-link-type="DOI">10.1007/978-3-319-28295-4_20</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx9"><label>Borra and Di Ciaccio(2010)</label><mixed-citation>Borra, S. and Di Ciaccio, A.: Measuring the Prediction Error. A Comparison of
Cross-validation, Bootstrap and Covariance Penalty Methods, Comput. Stat.
Data An., 54, 2976–2989, <ext-link xlink:href="https://doi.org/10.1016/j.csda.2010.03.004" ext-link-type="DOI">10.1016/j.csda.2010.03.004</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bibx10"><label>Breiman(1996)</label><mixed-citation>Breiman, L.: Bagging Predictors, Mach. Learn., 24, 123–140,
<ext-link xlink:href="https://doi.org/10.1023/A:1018054314350" ext-link-type="DOI">10.1023/A:1018054314350</ext-link>,  1996.</mixed-citation></ref>
      <ref id="bib1.bibx11"><label>Breiman(2001)</label><mixed-citation>Breiman, L.: Random forests, Mach. Learn., 45, 5–32,
<ext-link xlink:href="https://doi.org/10.1023/A:1010933404324" ext-link-type="DOI">10.1023/A:1010933404324</ext-link>, 2001.</mixed-citation></ref>
      <ref id="bib1.bibx12"><label>Carslaw and Ropkins(2012)</label><mixed-citation>Carslaw, D. C. and Ropkins, K.: openair – An R package for air quality data
analysis, Environ. Modell. Softw., 27–28, 52–61,
<ext-link xlink:href="https://doi.org/10.1016/j.envsoft.2011.09.008" ext-link-type="DOI">10.1016/j.envsoft.2011.09.008</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx13"><label>Conrad et al.(2015)Conrad, Bechtel, Bock, Dietrich, Fischer, Gerlitz,
Wehberg, Wichmann, and Böhner</label><mixed-citation>Conrad, O., Bechtel, B., Bock, M., Dietrich, H., Fischer, E., Gerlitz, L.,
Wehberg, J., Wichmann, V., and Böhner, J.: System for Automated
Geoscientific Analyses (SAGA) v. 2.1.4, Geosci. Model Dev., 8, 1991–2007,
<ext-link xlink:href="https://doi.org/10.5194/gmd-8-1991-2015" ext-link-type="DOI">10.5194/gmd-8-1991-2015</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx14"><label>Deane-Mayer and Knowles(2016)</label><mixed-citation>Deane-Mayer, Z. A. and Knowles, J. E.: caretEnsemble: Ensembles of Caret
Models, available at: <uri>https://CRAN.R-project.org/package=caretEnsemble</uri> (last access: 16 July 2018), r
package version 2.0.0, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx15"><label>Delgado-Baquerizo et al.(2017)Delgado-Baquerizo, Eldridge, Maestre,
Karunaratne, Trivedi, Reich, and Singh</label><mixed-citation>Delgado-Baquerizo, M., Eldridge, D. J., Maestre, F. T., Karunaratne, S. B.,
Trivedi, P., Reich, P. B., and Singh, B. K.: Climate legacies drive global
soil carbon stocks in terrestrial ecosystems, Sci. Adv., 3,
e1602008, <ext-link xlink:href="https://doi.org/10.1126/sciadv.1602008" ext-link-type="DOI">10.1126/sciadv.1602008</ext-link>, 2017.</mixed-citation></ref>
      <?pagebreak page191?><ref id="bib1.bibx16"><label>Drew(1973)</label><mixed-citation>Drew, L. A.: Bulk Density Estimation Based on Organic Matter Content of Some
Minnesota
Soils, St. Paul, Minn., School of Forestry, University of Minnesota, Digital
Conservancy, available at: <uri>http://hdl.handle.net/11299/58293</uri> (last access: 16 July 2018), 1973.</mixed-citation></ref>
      <ref id="bib1.bibx17"><label>FAO(2017)</label><mixed-citation>FAO: Fifth Meeting of the Global Soil Partnership Plenary Assembly, available
at: <uri>http://www.fao.org/3/a-bs973e.pdf</uri> (last access: 16 July 2018),
2017.</mixed-citation></ref>
      <ref id="bib1.bibx18"><label>Finke(2012)</label><mixed-citation>Finke, P. A.: On digital soil assessment with models and the Pedometrics
agenda, Geoderma, 171–172, 3–15,
<ext-link xlink:href="https://doi.org/10.1016/j.geoderma.2011.01.001" ext-link-type="DOI">10.1016/j.geoderma.2011.01.001</ext-link>,   2012.</mixed-citation></ref>
      <ref id="bib1.bibx19"><label>Florinsky(2012)</label><mixed-citation>Florinsky, I. V.: The Dokuchaev hypothesis as a basis for predictive digital
soil mapping (on the 125th anniversary of its publication), Eurasian Soil
Sci., 45, 445–451, <ext-link xlink:href="https://doi.org/10.1134/S1064229312040047" ext-link-type="DOI">10.1134/S1064229312040047</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx20"><label>Grimm et al.(2008)Grimm, Behrens, Märker, and
Elsenbeer</label><mixed-citation>Grimm, R., Behrens, T., Märker, M., and Elsenbeer, H.: Soil organic
carbon concentrations and stocks on Barro Colorado Island – Digital soil
mapping using Random Forests analysis, Geoderma, 146, 102–113,
<ext-link xlink:href="https://doi.org/10.1016/j.geoderma.2008.05.008" ext-link-type="DOI">10.1016/j.geoderma.2008.05.008</ext-link>, 2008.</mixed-citation></ref>
      <ref id="bib1.bibx21"><label>Guevara et al.(2018)Guevara, Olmedo, and Vargas</label><mixed-citation>Guevara, M., Olmedo, G. F., and Vargas, R.: DSM-LAC/NoSilverBulletOnDSM: No
Silver Bullets – raw code, Zenodo, <ext-link xlink:href="https://doi.org/10.5281/zenodo.1304392" ext-link-type="DOI">10.5281/zenodo.1304392</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx22"><label>Harden et al.(2017)</label><mixed-citation>Harden, J. W., Hugelius, G., Ahlström, A., Blankinship, J. C.,
Bond-Lamberty,
B., Lawrence, C. R., Loisel, J., Malhotra, A., Jackson, R. B., Ogle, S.,
Phillips, C., Ryals, R., Todd-Brown, K., Vargas, R., Vergara, S. E., Cotrufo,
M. F., Keiluweit, M., Heckman, K. A., Crow, S. E., Silver, W. L., DeLonge,
M., and Nave, L. E.: Networking our science to characterize the state,
vulnerabilities, and management opportunities of soil organic matter, Glob.
Change Biol., 24,  e705–e718, <ext-link xlink:href="https://doi.org/10.1111/gcb.13896" ext-link-type="DOI">10.1111/gcb.13896</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx23"><label>Hashimoto et al.(2017)</label><mixed-citation>Hashimoto, S., Nanko, K., Ťupek, B., and Lehtonen, A.: Data-mining
analysis of the global distribution of soil carbon in observational databases
and Earth system models, Geosci. Model Dev., 10, 1321–1337,
<ext-link xlink:href="https://doi.org/10.5194/gmd-10-1321-2017" ext-link-type="DOI">10.5194/gmd-10-1321-2017</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx24"><label>Hechenbichler and Schliep(2004)</label><mixed-citation>Hechenbichler, K. and Schliep, K. P.: Weighted <inline-formula><mml:math id="M156" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula>-nearest-neighbor
techniques and ordinal classification. Discussion paper 399, SFB 386,
Ludwig-Maximilians University, Munich, available at:
<uri>http://www.stat.uni-muenchen.de/sfb386/papers/dsp/paper399.ps</uri> (last
access: 16 July 2018), 2004.</mixed-citation></ref>
      <ref id="bib1.bibx25"><label>Hengl(2009)</label><mixed-citation>
Hengl, T.: A Practical Guide to Geostatistical Mapping, 2nd Edn., extended
edition of the EUR 22904 EN Scientific and Technical Research series report
published by 10 Office for Official Publications of the European Communities,
Luxembourg, 293 pp., 2009.</mixed-citation></ref>
      <ref id="bib1.bibx26"><label>Hengl(2017)</label><mixed-citation>Hengl, T.: GSIF: Global Soil Information Facilities, available at:
<uri>https://CRAN.R-project.org/package=GSIF</uri> (last access: 16 July 2018), r
package version
0.5-4, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx27"><label>Hengl et al.(2004)Hengl, Heuvelink, and Stein</label><mixed-citation>Hengl, T., Heuvelink, G. B., and Stein, A.: A generic framework for spatial
prediction of soil variables based on regression-kriging, Geoderma, 120,
75–93, <ext-link xlink:href="https://doi.org/10.1016/j.geoderma.2003.08.018" ext-link-type="DOI">10.1016/j.geoderma.2003.08.018</ext-link>, 2004.</mixed-citation></ref>
      <ref id="bib1.bibx28"><label>Hengl et al.(2015)Hengl, Heuvelink, Kempen, Leenaars, Walsh,
Shepherd, Sila, MacMillan, De Jesus, Tamene, and Tondoh</label><mixed-citation>Hengl, T., Heuvelink, G. B., Kempen, B., Leenaars, J. G., Walsh, M. G.,
Shepherd, K. D., Sila, A., MacMillan, R. A., De Jesus, J. M., Tamene, L.,
and Tondoh, J. E.: Mapping soil properties of Africa at 250 m resolution:
Random forests significantly improve current predictions, PLoS ONE, 10,
1–26, <ext-link xlink:href="https://doi.org/10.1371/journal.pone.0125814" ext-link-type="DOI">10.1371/journal.pone.0125814</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx29"><label>Hengl et al.(2017)Hengl, Mendes de Jesus, Heuvelink, Ruiperez
Gonzalez, Kilibarda, Blagotić, Shangguan, Wright, Geng,
Bauer-Marschallinger, Guevara, Vargas, MacMillan, Batjes, Leenaars, Ribeiro,
Wheeler, Mantel, and Kempen</label><mixed-citation>Hengl, T., Mendes de Jesus, J., Heuvelink, G. B. M., Ruiperez Gonzalez, M.,
Kilibarda, M., Blagotić, A., Shangguan, W., Wright, M. N., Geng, X.,
Bauer-Marschallinger, B., Guevara, M. A., Vargas, R., MacMillan, R. A.,
Batjes, N. H., Leenaars, J. G. B., Ribeiro, E., Wheeler, I., Mantel, S., and
Kempen, B.: SoilGrids250m: Global gridded soil information based on machine
learning, PLoS ONE, 12, e0169748, <ext-link xlink:href="https://doi.org/10.1371/journal.pone.0169748" ext-link-type="DOI">10.1371/journal.pone.0169748</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx30"><label>Hengl et al.(2018)Hengl, Nussbaum, Wright, and
Heuvelink</label><mixed-citation>Hengl, T., Nussbaum, M., Wright, M. N., and Heuvelink, G. B.: Random Forest as
a generic framework for predictive modeling of spatial and spatio-temporal
variables, PeerJ, 6, e26693v1,
<ext-link xlink:href="https://doi.org/10.7287/peerj.preprints.26693v1" ext-link-type="DOI">10.7287/peerj.preprints.26693v1</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx31"><label>Heumann(2011)</label><mixed-citation>Heumann, B. W.: An object-based classification of mangroves using a hybrid
decision tree-support vector machine approach, Remote Sens., 3,
2440–2460, <ext-link xlink:href="https://doi.org/10.3390/rs3112440" ext-link-type="DOI">10.3390/rs3112440</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx32"><label>Heuvelink(2018)</label><mixed-citation>Heuvelink, G. B. M.: Uncertainty and Uncertainty Propagation in Soil Mapping
and Modelling, Springer International Publishing, Cham,  439–461,
<ext-link xlink:href="https://doi.org/10.1007/978-3-319-63439-5_14" ext-link-type="DOI">10.1007/978-3-319-63439-5_14</ext-link>,  2018.</mixed-citation></ref>
      <ref id="bib1.bibx33"><label>Hiemstra et al.(2008)Hiemstra, Pebesma, Twenh”ofel, and
Heuvelink</label><mixed-citation>Hiemstra, P., Pebesma, E., Twenh”ofel, C., and Heuvelink, G.: Real-time
automatic interpolation of ambient gamma dose rates from the Dutch
Radioactivity Monitoring Network, Comput. Geosci., 35, 1711–1721,  <ext-link xlink:href="https://doi.org/10.1016/j.cageo.2008.10.011" ext-link-type="DOI">10.1016/j.cageo.2008.10.011</ext-link>, 2008.</mixed-citation></ref>
      <ref id="bib1.bibx34"><label>Jackson et al.(2017)</label><mixed-citation>Jackson, R. B., Lajtha, K., Crow, S. E., Hugelius, G., Kramer, M. G., and
Piñeiro, G.: The Ecology of Soil Carbon: Pools, Vulnerabilities, and Biotic
and Abiotic Controls, Annu. Rev. Ecol. Evol. S.,
48, 419–445, <ext-link xlink:href="https://doi.org/10.1146/annurev-ecolsys-112414-054234" ext-link-type="DOI">10.1146/annurev-ecolsys-112414-054234</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx35"><label>Karatzoglou et al.(2006)Karatzoglou, Meyer, and
Hornik</label><mixed-citation>
Karatzoglou, A., Meyer, D., and Hornik, K.: Support Vector Algorithm in R,
J. Stat. Softw., 15, 1–28, 2006.</mixed-citation></ref>
      <ref id="bib1.bibx36"><label>Köchy et al.(2015)Köchy, Hiederer, and
Freibauer</label><mixed-citation>Köchy, M., Hiederer, R., and Freibauer, A.: Global distribution of soil
organic carbon – Part 1: Masses and frequency distributions of SOC stocks
for the tropics, permafrost regions, wetlands, and the world, SOIL, 1,
351–365, <ext-link xlink:href="https://doi.org/10.5194/soil-1-351-2015" ext-link-type="DOI">10.5194/soil-1-351-2015</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx37"><label>Kuhn et al.(2017)Kuhn., Wing, Weston, Williams, Keefer, Engelhardt,
Cooper, Mayer, Kenkel, the R Core Team, Benesty, Lescarbeau, Ziem, Scrucca,
Tang, Candan, and Hunt.</label><mixed-citation>Kuhn, M., Wing, J., Weston, S., Williams, A., Keefer, C., Engelhardt, A.,
Cooper, T., Mayer, Z., Kenkel, B., the R Core Team, Benesty, M., Lescarbeau,
R., Ziem, A., Scrucca, L., Tang, Y., Candan, C., and Hunt, T.: caret:
Classification and Regression Training, available at:
<uri>https://CRAN.R-project.org/package=caret</uri> (last access: 16 July 2018), r
package version
6.0-78, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx38"><label>Kumar et al.(2012)Kumar, Lal, and Liu</label><mixed-citation>Kumar, S., Lal, R., and Liu, D.: A geographically weighted regression kriging
approach for mapping soil organic carbon stock, Geoderma, 189–190, 627–634,
<ext-link xlink:href="https://doi.org/10.1016/j.geoderma.2012.05.022" ext-link-type="DOI">10.1016/j.geoderma.2012.05.022</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx39"><label>Ließ et al.(2016)Ließ, Schmidt, and Glaser</label><mixed-citation>Ließ, M., Schmidt, J., and Glaser, B.: Improving the spatial prediction of
soil organic carbon stocks in a complex tropical mountain landscape by
methodological specifications in machine learning approaches, PLoS ONE, 11,
1–22, <ext-link xlink:href="https://doi.org/10.1371/journal.pone.0153673" ext-link-type="DOI">10.1371/journal.pone.0153673</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx40"><label>Malone et al.(2017)Malone, Minasny, and McBratney</label><mixed-citation>Malone, B. P., Minasny, B., and McBratney, A. B.: Using R for Digital Soil
Mapping, Springer International Publishing, <ext-link xlink:href="https://doi.org/10.1007/978-3-319-44327-0" ext-link-type="DOI">10.1007/978-3-319-44327-0</ext-link>,
2017.</mixed-citation></ref>
      <ref id="bib1.bibx41"><label>Marchetti et al.(2012)Marchetti, Piccini, Francaviglia, and
Mabit</label><mixed-citation>Marchetti, A., Piccini, C., Francaviglia, R., and Mabit, L.: Spatial
Distribution of Soil Organic Matter Using Geostatistics:<?pagebreak page192?> A Key Indicator to
Assess Soil Degradation Status in Central Italy, Pedosphere, 22, 230–242,
<ext-link xlink:href="https://doi.org/10.1016/S1002-0160(12)60010-1" ext-link-type="DOI">10.1016/S1002-0160(12)60010-1</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx42"><label>Martin et al.(2011)Martin, Wattenbach, Smith, Meersmans, Jolivet,
Boulonne, and Arrouays</label><mixed-citation>Martin, M. P., Wattenbach, M., Smith, P., Meersmans, J., Jolivet, C.,
Boulonne, L., and Arrouays, D.: Spatial distribution of soil organic carbon
stocks in France, Biogeosciences, 8, 1053–1065,
<ext-link xlink:href="https://doi.org/10.5194/bg-8-1053-2011" ext-link-type="DOI">10.5194/bg-8-1053-2011</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx43"><label>McBratney et al.(2003)McBratney, Santos, and Minasny</label><mixed-citation>McBratney, A., Santos, M. M., and Minasny, B.: On digital soil mapping,
Geoderma, 117, 3–52,
<ext-link xlink:href="https://doi.org/10.1016/S0016-7061(03)00223-4" ext-link-type="DOI">10.1016/S0016-7061(03)00223-4</ext-link>, 2003.</mixed-citation></ref>
      <ref id="bib1.bibx44"><label>Meinshausen(2006)</label><mixed-citation>
Meinshausen, N.: Quantile Regression Forests, J. Mach. Learn. Res., 7,
983–999,
2006.</mixed-citation></ref>
      <ref id="bib1.bibx45"><label>Meinshausen(2017)</label><mixed-citation>Meinshausen, N.: quantregForest: Quantile Regression Forests, available at:
<uri>https://CRAN.R-project.org/package=quantregForest</uri> (last access:
16 July 2018), r package
version 1.3-7, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx46"><label>Minasny et al.(2013)Minasny, McBratney, Malone, and
Wheeler</label><mixed-citation>Minasny, B., McBratney, A. B., Malone, B. P., and Wheeler, I.: Digital Mapping
of Soil Carbon, in: Advances in Agronomy,  1–47, Elsevier,
<ext-link xlink:href="https://doi.org/10.1016/b978-0-12-405942-9.00001-3" ext-link-type="DOI">10.1016/b978-0-12-405942-9.00001-3</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx47"><label>Mishra et al.(2009)Mishra, Lal, Slater, Calhoun, Liu, and Van
Meirvenne</label><mixed-citation>Mishra, U., Lal, R., Slater, B., Calhoun, F., Liu, D., and Van Meirvenne, M.:
Predicting Soil Organic Carbon Stock Using Profile Depth Distribution
Functions and Ordinary Kriging, Soil Sci. Soc. Am. J., 73,
614–621, <ext-link xlink:href="https://doi.org/10.2136/sssaj2007.0410" ext-link-type="DOI">10.2136/sssaj2007.0410</ext-link>, 2009.</mixed-citation></ref>
      <ref id="bib1.bibx48"><label>Møller et al.(2018)</label><mixed-citation>Møller, A. B., Beucher, A., Iversen, B. V., and Greve, M. H.: Predicting
artificially drained areas by means of a selective model ensemble, Geoderma,
320, 30–42, <ext-link xlink:href="https://doi.org/10.1016/j.geoderma.2018.01.018" ext-link-type="DOI">10.1016/j.geoderma.2018.01.018</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx49"><label>Mondal et al.(2017)Mondal, Khare, Kundu, Mondal, Mukherjee, and
Mukhopadhyay</label><mixed-citation>Mondal, A., Khare, D., Kundu, S., Mondal, S., Mukherjee, S., and Mukhopadhyay,
A.: Spatial soil organic carbon (SOC) prediction by regression kriging using
remote sensing data, Egyptian Journal of Remote Sensing and Space Science,
20, 61–70, <ext-link xlink:href="https://doi.org/10.1016/j.ejrs.2016.06.004" ext-link-type="DOI">10.1016/j.ejrs.2016.06.004</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx50"><label>Nelson and Sommers(1982)</label><mixed-citation>
Nelson, D. W. and Sommers, L. E.: Total carbon, organic carbon and organic matter, in: Methods of soil analysis. Part 2 Chemical and Microbiological Properties, edited by:  Page,
A. L., Miller, R. H., and  Keeney, D. R., 539–579, 1982.</mixed-citation></ref>
      <ref id="bib1.bibx51"><label>Nussbaum et al.(2014)Nussbaum, Papritz, Baltensweiler, and
Walthert</label><mixed-citation>Nussbaum, M., Papritz, A., Baltensweiler, A., and Walthert, L.: Estimating
soil organic carbon stocks of Swiss forest soils by robust external-drift
kriging, Geosci. Model Dev., 7, 1197–1210,
<ext-link xlink:href="https://doi.org/10.5194/gmd-7-1197-2014" ext-link-type="DOI">10.5194/gmd-7-1197-2014</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx52"><label>Nussbaum et al.(2018)Nussbaum, Spiess, Baltensweiler, Grob, Keller,
Greiner, Schaepman, and Papritz</label><mixed-citation>Nussbaum, M., Spiess, K., Baltensweiler, A., Grob, U., Keller, A., Greiner,
L., Schaepman, M. E., and Papritz, A.: Evaluation of digital soil mapping
approaches with large sets of environmental covariates, SOIL, 4, 1–22,
<ext-link xlink:href="https://doi.org/10.5194/soil-4-1-2018" ext-link-type="DOI">10.5194/soil-4-1-2018</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx53"><label>Padarian et al.(2017)Padarian, Minasny, and McBratney</label><mixed-citation>Padarian, J., Minasny, B., and McBratney, A.: Chile and the Chilean soil grid:
A contribution to GlobalSoilMap, Geoderma Regional, 9, 17–28,
<ext-link xlink:href="https://doi.org/10.1016/j.geodrs.2016.12.001" ext-link-type="DOI">10.1016/j.geodrs.2016.12.001</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx54"><label>Peng et al.(2013)Peng, Bing, Guangpo, and Guangcan</label><mixed-citation>Peng, G., Bing, W., Guangpo, G., and Guangcan, Z.: Spatial distribution of
soil organic carbon and total nitrogen based on GIS and geostatistics in a
small watershed in a hilly area of northern China, PLoS ONE, 8, 1–9,
<ext-link xlink:href="https://doi.org/10.1371/journal.pone.0083592" ext-link-type="DOI">10.1371/journal.pone.0083592</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx55"><label>R Core Team(2017)</label><mixed-citation>R Core Team: R: A Language and Environment for Statistical Computing, R
Foundation for Statistical Computing, Vienna, Austria,
available at: <uri>https://www.R-project.org/</uri> (last access: 16 July 2018),
2017.</mixed-citation></ref>
      <ref id="bib1.bibx56"><label>Reyes-Rojas et al.(2018)Reyes-Rojas, L. A., and Ventura</label><mixed-citation>Reyes-Rojas, L. A., Adhikari, K., and Ventura, S. J.: Projecting Soil Organic
Carbon Distribution in Central Chile under Future Climate Scenarios, J.
Environ. Qual., 47, 735–745, <ext-link xlink:href="https://doi.org/10.2134/jeq2017.08.0329" ext-link-type="DOI">10.2134/jeq2017.08.0329</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx57"><label>Rossel and Behrens(2010)</label><mixed-citation>Rossel, R. V. and Behrens, T.: Using data mining to model and interpret soil
diffuse reflectance spectra, Geoderma, 158, 46–54,
<ext-link xlink:href="https://doi.org/10.1016/j.geoderma.2009.12.025" ext-link-type="DOI">10.1016/j.geoderma.2009.12.025</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bibx58"><label>Samuel-Rosa et al.(2015)Samuel-Rosa, Heuvelink, Vasques, and
Anjos</label><mixed-citation>Samuel-Rosa, A., Heuvelink, G., Vasques, G., and Anjos, L.: Do more detailed
environmental covariates deliver more accurate soil maps?, Geoderma,
243–244, 214–227, <ext-link xlink:href="https://doi.org/10.1016/j.geoderma.2014.12.017" ext-link-type="DOI">10.1016/j.geoderma.2014.12.017</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx59"><label>Sanderman et al.(2017)Sanderman, Hengl, and Fiske</label><mixed-citation>Sanderman, J., Hengl, T., and Fiske, G. J.: Soil carbon debt of 12,000 years
of human land use, P. Natl. Acad. Sci. USA, 114, 9575–9580,
<ext-link xlink:href="https://doi.org/10.1073/pnas.1706103114" ext-link-type="DOI">10.1073/pnas.1706103114</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx60"><label>Shangguan et al.(2017)Shangguan, Hengl, Mendes de Jesus, Yuan, and
Dai</label><mixed-citation>Shangguan, W., Hengl, T., Mendes de Jesus, J., Yuan, H., and Dai, Y.:
Mapping the global depth to bedrock for land surface modeling, J.
Adv. Model. Earth Sy., 9, 65–88, <ext-link xlink:href="https://doi.org/10.1002/2016MS000686" ext-link-type="DOI">10.1002/2016MS000686</ext-link>,
2017.</mixed-citation></ref>
      <ref id="bib1.bibx61"><label>Silverman and Jones(1989)</label><mixed-citation>
Silverman, B. W. and Jones, M. C.: An Important Contribution to Nonparametric
Discriminant Analysis and Density Estimation: Commentary on Fix and
Hodges (1951), Int. Stat. Rev., 57, 233–238, 1989.</mixed-citation></ref>
      <ref id="bib1.bibx62"><label>Sreenivas et al.(2016)Sreenivas, Dadhwal, Kumar, Harsha, Mitran,
Sujatha, Suresh, Fyzee, and Ravisankar</label><mixed-citation>Sreenivas, K., Dadhwal, V. K., Kumar, S., Harsha, G. S., Mitran, T., Sujatha,
G., Suresh, G. J. R., Fyzee, M. A., and Ravisankar, T.: Digital mapping of
soil organic and inorganic carbon status in India, Geoderma, 269, 160–173,
<ext-link xlink:href="https://doi.org/10.1016/j.geoderma.2016.02.002" ext-link-type="DOI">10.1016/j.geoderma.2016.02.002</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx63"><label>Taylor(2001)</label><mixed-citation>Taylor, K. E.: Summarizing multiple aspects of model performance in a single
diagram, J. Geophys. Res.-Atmos., 106,  7183–7192, <ext-link xlink:href="https://doi.org/10.1029/2000JD900719" ext-link-type="DOI">10.1029/2000JD900719</ext-link>, 2001.</mixed-citation></ref>
      <ref id="bib1.bibx64"><label>Tifafi et al.(2018)</label><mixed-citation>Tifafi, M., Guenet, B., and Hatté, C.: Large Differences in Global and
Regional Total Soil Carbon Stock Estimates Based on SoilGrids, HWSD, and
NCSCD: Intercomparison and Evaluation Based on Field Data From USA,
England, Wales, and France,  Global Biogeochem. Cy., 32, 42–56, <ext-link xlink:href="https://doi.org/10.1002/2017GB005678" ext-link-type="DOI">10.1002/2017GB005678</ext-link>,
2018.</mixed-citation></ref>
      <ref id="bib1.bibx65"><label>Vargas et al.(2013)Vargas, Paz, and de Jong</label><mixed-citation>Vargas, R., Paz, F., and de Jong, B.: Quantification of forest degradation and
belowground carbon dynamics: ongoing challenges for monitoring, reporting and
verification activities for REDD+, Carbon Manag., 4, 579–582,
<ext-link xlink:href="https://doi.org/10.4155/cmt.13.63" ext-link-type="DOI">10.4155/cmt.13.63</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx66"><label>Vargas et al.(2017)Vargas, Alcaraz-Segura, Birdsey, Brunsell,
Cruz-Gaistardo, de Jong, Etchevers, Guevara, Hayes, Johnson, Loescher, Paz,
Ryu, Sanchez-Mejia, and Toledo-Gutierrez</label><mixed-citation>Vargas, R., Alcaraz-Segura, D., Birdsey, R., Brunsell, N. A., Cruz-Gaistardo,
C. O., de Jong, B., Etchevers, J., Guevara, M., Hayes, D. J., Johnson, K.,
Loescher, H. W., Paz, F., Ryu, Y., Sanchez-Mejia, Z., and Toledo-Gutierrez,
K. P.: Enhancing interoperability to facilitate implementation of REDD+:
case study of Mexico, Carbon Manag., 8, 57–65,
<ext-link xlink:href="https://doi.org/10.1080/17583004.2017.1285177" ext-link-type="DOI">10.1080/17583004.2017.1285177</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx67"><label>Vasques et al.(2016)Vasques, Coelho, Dart, Oliveira, and
Teixeira</label><mixed-citation>
Vasques, G. M., Coelho, M. A.-C. R., Dart, R. O., Oliveira, R. P., and
Teixeira, W. G.: Mapping soil carbon, particle-size fractions, and water
retention in tropical dry forest in Brazil, Pesqui. Agropecu.
Bras., 51, 1371–1385, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx68"><label>Viscarra Rossel et al.(2014)Viscarra Rossel, Webster, Bui, and
Baldock</label><mixed-citation>Viscarra Rossel, R. A., Webster, R., Bui, E. N., and Baldock, J. A.:
Baseline map of organic carbon in Australian soil to support national carbon
accounting and monitoring under climate change, Glob. Change Biol., 20,
2953–2970, <ext-link xlink:href="https://doi.org/10.1111/gcb.12569" ext-link-type="DOI">10.1111/gcb.12569</ext-link>, 2014.</mixed-citation></ref>
      <?pagebreak page193?><ref id="bib1.bibx69"><label>Wold(1983)</label><mixed-citation>Wold, H.: Systems Analysis by Partial Least Squares, Iiasa collaborative paper,
IIASA, Laxenburg, Austria, available at: <uri>http://pure.iiasa.ac.at/2336/</uri> (last access: 16 July 2018),
1983.</mixed-citation></ref>
      <ref id="bib1.bibx70"><label>Yang et al.(2016)Yang, Zhang, Yang, Zhi, Yang, Liu, Zhao, and
Li</label><mixed-citation>Yang, R.-M., Zhang, G.-L., Yang, F., Zhi, J.-J., Yang, F., Liu, F., Zhao,
Y.-G., and Li, D.-C.: Precise estimation of soil organic carbon stocks in
the northeast Tibetan Plateau, Sci. Rep.-UK, 6, 21842,
<ext-link xlink:href="https://doi.org/10.1038/srep21842" ext-link-type="DOI">10.1038/srep21842</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx71"><label>Yigini and Panagos(2016)</label><mixed-citation>Yigini, Y. and Panagos, P.: Assessment of soil organic carbon stocks under
future climate and land cover changes in Europe, Sci. Total
Environ., 557–558, 838–850, <ext-link xlink:href="https://doi.org/10.1016/j.scitotenv.2016.03.085" ext-link-type="DOI">10.1016/j.scitotenv.2016.03.085</ext-link>, 2016.
</mixed-citation></ref><?xmltex \hack{\newpage}?>
      <ref id="bib1.bibx72"><label>Yigini et al.(2018)Yigini, Olmedo, Reiter, Baritz, Viatkin, and
Vargas</label><mixed-citation>Yigini, Y., Olmedo, G. F., Reiter, S., Baritz, R., Viatkin, K., and Vargas,
R. R. (Eds.): Soil Organic Carbon Mapping Cookbook, FAO, Rome, Italy, 2nd
Edn., available at: <uri>http://www.fao.org/documents/card/en/c/I8895EN</uri> (last access: 16 July 2018), 2018.</mixed-citation></ref>

  </ref-list></back>
    <!--<article-title-html>No silver bullet for digital soil mapping: country-specific soil organic carbon estimates across Latin America</article-title-html>
<abstract-html><p>Country-specific soil organic carbon (SOC) estimates are the baseline for
the Global SOC Map of the Global Soil Partnership (GSOCmap-GSP). This
endeavor is key to explaining the uncertainty of global SOC estimates but
requires harmonizing heterogeneous datasets and building country-specific
capacities for digital soil mapping (DSM). We identified country-specific
predictors for SOC and tested the performance of five predictive algorithms
for mapping SOC across Latin America. The algorithms included support vector
machines (SVMs), random forest (RF), kernel-weighted nearest neighbors (KK),
partial least squares regression (PL), and regression kriging based on
stepwise multiple linear models (RK). Country-specific training data and SOC
predictors (5&thinsp; × &thinsp;5&thinsp;km pixel resolution) were obtained from
ISRIC – World Soil Information. Temperature, soil type, vegetation
indices, and topographic constraints were the best predictors for SOC, but
country-specific predictors and their respective weights varied across Latin
America. We compared a large diversity of country-specific datasets and
models, and were able to explain SOC variability in a range between  ∼ &thinsp;1 and  ∼ &thinsp;60&thinsp;%, with no universal predictive algorithm among
countries. A regional (<i>n</i>&thinsp; = &thinsp;11&thinsp;268 SOC estimates) ensemble of these
five algorithms was able to explain  ∼ &thinsp;39&thinsp;% of SOC variability from
repeated 5-fold cross-validation. We report a combined SOC stock of
77.8&thinsp;±&thinsp;43.6&thinsp;Pg (uncertainty represented by the full conditional
response of independent model residuals) across Latin America. SOC stocks
were higher in tropical forests (30&thinsp;±&thinsp;16.5&thinsp;Pg) and croplands
(13&thinsp;±&thinsp;8.1&thinsp;Pg). Country-specific and regional ensembles revealed
spatial discrepancies across geopolitical borders, higher elevations, and
coastal plains, but provided similar regional stocks (77.8&thinsp;±&thinsp;42.2 and
76.8&thinsp;±&thinsp;45.1&thinsp;Pg, respectively). These results are conservative
compared to global estimates (e.g., SoilGrids250m 185.8&thinsp;Pg, the Harmonized
World Soil Database 138.4&thinsp;Pg, or the GSOCmap-GSP 99.7&thinsp;Pg). Countries with
large area (i.e., Brazil, Bolivia, Mexico, Peru) and large spatial SOC
heterogeneity had lower SOC stocks per unit area and larger uncertainty in
their predictions. We highlight that expert opinion is needed to set boundary
prediction limits to avoid unrealistically high modeling estimates. For
maximizing explained variance while minimizing prediction bias, the selection
of predictive algorithms for SOC mapping should consider density of available
data and variability of country-specific environmental gradients. This study
highlights the large degree of spatial uncertainty in SOC estimates across
Latin America. We provide a framework for improving country-specific mapping
efforts and reducing current discrepancy of global, regional, and
country-specific SOC estimates.</p></abstract-html>
<ref-html id="bib1.bib1"><label>Adhikari et al.(2014)Adhikari, Hartemink, Minasny, Bou Kheir,
Greve, and Greve</label><mixed-citation>
Adhikari, K., Hartemink, A. E., Minasny, B., Bou Kheir, R., Greve, M. B., and
Greve, M. H.: Digital mapping of soil organic carbon contents and stocks in
Denmark, PLoS ONE, 9, e105519, <a href="https://doi.org/10.1371/journal.pone.0105519" target="_blank">https://doi.org/10.1371/journal.pone.0105519</a>, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib2"><label>Angelini et al.(2016)Angelini, Heuvelink, Kempen, and
Morrás</label><mixed-citation>
Angelini, M. E., Heuvelink, G. B., Kempen, B., and Morrás, H. J.: Mapping the
soils of an Argentine Pampas region using structural equation modelling,
Geoderma, 281, 102–118,
<a href="https://doi.org/10.1016/j.geoderma.2016.06.031" target="_blank">https://doi.org/10.1016/j.geoderma.2016.06.031</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib3"><label>Angelini et al.(2017)Angelini, Heuvelink, and Kempen</label><mixed-citation>
Angelini, M. E., Heuvelink, G. B. M., and Kempen, B.: Multivariate mapping of
soil with structural equation modelling, Eur. J. Soil Sci.,
68, 575–591, <a href="https://doi.org/10.1111/ejss.12446" target="_blank">https://doi.org/10.1111/ejss.12446</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib4"><label>Arrouays et al.(2017)</label><mixed-citation>
Arrouays, D., Leenaars, J. G., de Forges, A. C. R., Adhikari, K., Ballabio, C.,
Greve, M., Grundy, M., Guerrero, E., Hempel, J., Hengl, T., Heuvelink, G.,
Batjes, N., Carvalho, E., Hartemink, A., Hewitt, A., Hong, S.-Y.,
Krasilnikov, P., Lagacherie, P., Lelyk, G., Libohova, Z., Lilly, A.,
McBratney, A., McKenzie, N., Vasquez, G. M., Mulder, V. L., Minasny, B.,
Montanarella, L., Odeh, I., Padarian, J., Poggio, L., Roudier, P., Saby, N.,
Savin, I., Searle, R., Solbovoy, V., Thompson, J., Smith, S., Sulaeman, Y.,
Vintila, R., Rossel, R. V., Wilson, P., Zhang, G.-L., Swerts, M., Oorts, K.,
Karklins, A., Feng, L., Navarro, A. R. I., Levin, A., Laktionova, T.,
Dell'Acqua, M., Suvannang, N., Ruam, W., Prasad, J., Patil, N., Husnjak, S.,
Pásztor, L., Okx, J., Hallett, S., Keay, C., Farewell, T., Lilja, H.,
Juilleret, J., Marx, S., Takata, Y., Kazuyuki, Y., Mansuy, N., Panagos, P.,
Liedekerke, M. V., Skalsky, R., Sobocka, J., Kobza, J., Eftekhari, K.,
Alavipanah, S. K., Moussadek, R., Badraoui, M., Silva, M. D., Paterson, G.,
da Conceição Gonçalves, M., Theocharopoulos, S., Yemefack, M., Tedou, S.,
Vrscaj, B., Grob, U., Kozák, J., Boruvka, L., Dobos, E., Taboada, M.,
Moretti, L., and Rodriguez, D.: Soil legacy data rescue via GlobalSoilMap and
other international and national initiatives, GeoResJ, 14, 1–19,
<a href="https://doi.org/10.1016/j.grj.2017.06.001" target="_blank">https://doi.org/10.1016/j.grj.2017.06.001</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib5"><label>Batjes et al.(2017)Batjes, Ribeiro, van Oostrum, Leenaars, Hengl, and
de Jesus</label><mixed-citation>
Batjes, N. H., Ribeiro, E., van Oostrum, A., Leenaars, J., Hengl, T., and Mendes de Jesus, J.:
WoSIS: providing standardised soil profile data for the world, Earth Syst. Sci. Data, 9, 1–14, <a href="https://doi.org/10.5194/essd-9-1-2017" target="_blank">https://doi.org/10.5194/essd-9-1-2017</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib6"><label>Bishop et al.(1999)Bishop, McBratney, and Laslett</label><mixed-citation>
Bishop, T., McBratney, A., and Laslett, G.: Modelling soil attribute depth
functions with equal-area quadratic smoothing splines, Geoderma, 91, 27–45, <a href="https://doi.org/10.1016/S0016-7061(99)00003-8" target="_blank">https://doi.org/10.1016/S0016-7061(99)00003-8</a>, 1999.
</mixed-citation></ref-html>
<ref-html id="bib1.bib7"><label>Blanco et al.(2013)</label><mixed-citation>
Blanco, P. D., Colditz, R. R., Saldaña, G. L., Hardtke, L. A., Llamas, R. M.,
Mari, N. A., Fischer, A., Caride, C., Aceñolaza, P. G., del Valle, H. F.,
Lillo-Saavedra, M., Coronato, F., Opazo, S. A., Morelli, F., Anaya, J. A.,
Sione, W. F., Zamboni, P., and Arroyo, V. B.: A land cover map of Latin
America and the Caribbean in the framework of the SERENA project, Remote
Sens. Environ., 132, 13–31,
<a href="https://doi.org/10.1016/j.rse.2012.12.025" target="_blank">https://doi.org/10.1016/j.rse.2012.12.025</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib8"><label>Bonfatti et al.(2016)Bonfatti, Hartemink, and Giasson</label><mixed-citation>
Bonfatti, B. R., Hartemink, A. E., and Giasson, E.: Comparing Soil C Stocks
from Soil Profile Data Using Four Different Methods, in: Progress in Soil
Science,   315–329, Springer International Publishing,
<a href="https://doi.org/10.1007/978-3-319-28295-4_20" target="_blank">https://doi.org/10.1007/978-3-319-28295-4_20</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib9"><label>Borra and Di Ciaccio(2010)</label><mixed-citation>
Borra, S. and Di Ciaccio, A.: Measuring the Prediction Error. A Comparison of
Cross-validation, Bootstrap and Covariance Penalty Methods, Comput. Stat.
Data An., 54, 2976–2989, <a href="https://doi.org/10.1016/j.csda.2010.03.004" target="_blank">https://doi.org/10.1016/j.csda.2010.03.004</a>, 2010.
</mixed-citation></ref-html>
<ref-html id="bib1.bib10"><label>Breiman(1996)</label><mixed-citation>
Breiman, L.: Bagging Predictors, Mach. Learn., 24, 123–140,
<a href="https://doi.org/10.1023/A:1018054314350" target="_blank">https://doi.org/10.1023/A:1018054314350</a>,  1996.
</mixed-citation></ref-html>
<ref-html id="bib1.bib11"><label>Breiman(2001)</label><mixed-citation>
Breiman, L.: Random forests, Mach. Learn., 45, 5–32,
<a href="https://doi.org/10.1023/A:1010933404324" target="_blank">https://doi.org/10.1023/A:1010933404324</a>, 2001.
</mixed-citation></ref-html>
<ref-html id="bib1.bib12"><label>Carslaw and Ropkins(2012)</label><mixed-citation>
Carslaw, D. C. and Ropkins, K.: openair – An R package for air quality data
analysis, Environ. Modell. Softw., 27–28, 52–61,
<a href="https://doi.org/10.1016/j.envsoft.2011.09.008" target="_blank">https://doi.org/10.1016/j.envsoft.2011.09.008</a>, 2012.
</mixed-citation></ref-html>
<ref-html id="bib1.bib13"><label>Conrad et al.(2015)Conrad, Bechtel, Bock, Dietrich, Fischer, Gerlitz,
Wehberg, Wichmann, and Böhner</label><mixed-citation>
Conrad, O., Bechtel, B., Bock, M., Dietrich, H., Fischer, E., Gerlitz, L.,
Wehberg, J., Wichmann, V., and Böhner, J.: System for Automated
Geoscientific Analyses (SAGA) v. 2.1.4, Geosci. Model Dev., 8, 1991–2007,
<a href="https://doi.org/10.5194/gmd-8-1991-2015" target="_blank">https://doi.org/10.5194/gmd-8-1991-2015</a>, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib14"><label>Deane-Mayer and Knowles(2016)</label><mixed-citation>
Deane-Mayer, Z. A. and Knowles, J. E.: caretEnsemble: Ensembles of Caret
Models, available at: <a href="https://CRAN.R-project.org/package=caretEnsemble" target="_blank">https://CRAN.R-project.org/package=caretEnsemble</a> (last access: 16 July 2018), r
package version 2.0.0, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib15"><label>Delgado-Baquerizo et al.(2017)Delgado-Baquerizo, Eldridge, Maestre,
Karunaratne, Trivedi, Reich, and Singh</label><mixed-citation>
Delgado-Baquerizo, M., Eldridge, D. J., Maestre, F. T., Karunaratne, S. B.,
Trivedi, P., Reich, P. B., and Singh, B. K.: Climate legacies drive global
soil carbon stocks in terrestrial ecosystems, Sci. Adv., 3,
e1602008, <a href="https://doi.org/10.1126/sciadv.1602008" target="_blank">https://doi.org/10.1126/sciadv.1602008</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib16"><label>Drew(1973)</label><mixed-citation>
Drew, L. A.: Bulk Density Estimation Based on Organic Matter Content of Some
Minnesota
Soils, St. Paul, Minn., School of Forestry, University of Minnesota, Digital
Conservancy, available at: <a href="http://hdl.handle.net/11299/58293" target="_blank">http://hdl.handle.net/11299/58293</a> (last access: 16 July 2018), 1973.
</mixed-citation></ref-html>
<ref-html id="bib1.bib17"><label>FAO(2017)</label><mixed-citation>
FAO: Fifth Meeting of the Global Soil Partnership Plenary Assembly, available
at: <a href="http://www.fao.org/3/a-bs973e.pdf" target="_blank">http://www.fao.org/3/a-bs973e.pdf</a> (last access: 16 July 2018),
2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib18"><label>Finke(2012)</label><mixed-citation>
Finke, P. A.: On digital soil assessment with models and the Pedometrics
agenda, Geoderma, 171–172, 3–15,
<a href="https://doi.org/10.1016/j.geoderma.2011.01.001" target="_blank">https://doi.org/10.1016/j.geoderma.2011.01.001</a>,   2012.
</mixed-citation></ref-html>
<ref-html id="bib1.bib19"><label>Florinsky(2012)</label><mixed-citation>
Florinsky, I. V.: The Dokuchaev hypothesis as a basis for predictive digital
soil mapping (on the 125th anniversary of its publication), Eurasian Soil
Sci., 45, 445–451, <a href="https://doi.org/10.1134/S1064229312040047" target="_blank">https://doi.org/10.1134/S1064229312040047</a>, 2012.
</mixed-citation></ref-html>
<ref-html id="bib1.bib20"><label>Grimm et al.(2008)Grimm, Behrens, Märker, and
Elsenbeer</label><mixed-citation>
Grimm, R., Behrens, T., Märker, M., and Elsenbeer, H.: Soil organic
carbon concentrations and stocks on Barro Colorado Island – Digital soil
mapping using Random Forests analysis, Geoderma, 146, 102–113,
<a href="https://doi.org/10.1016/j.geoderma.2008.05.008" target="_blank">https://doi.org/10.1016/j.geoderma.2008.05.008</a>, 2008.
</mixed-citation></ref-html>
<ref-html id="bib1.bib21"><label>Guevara et al.(2018)Guevara, Olmedo, and Vargas</label><mixed-citation>
Guevara, M., Olmedo, G. F., and Vargas, R.: DSM-LAC/NoSilverBulletOnDSM: No
Silver Bullets – raw code, Zenodo, <a href="https://doi.org/10.5281/zenodo.1304392" target="_blank">https://doi.org/10.5281/zenodo.1304392</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib22"><label>Harden et al.(2017)</label><mixed-citation>
Harden, J. W., Hugelius, G., Ahlström, A., Blankinship, J. C.,
Bond-Lamberty,
B., Lawrence, C. R., Loisel, J., Malhotra, A., Jackson, R. B., Ogle, S.,
Phillips, C., Ryals, R., Todd-Brown, K., Vargas, R., Vergara, S. E., Cotrufo,
M. F., Keiluweit, M., Heckman, K. A., Crow, S. E., Silver, W. L., DeLonge,
M., and Nave, L. E.: Networking our science to characterize the state,
vulnerabilities, and management opportunities of soil organic matter, Glob.
Change Biol., 24,  e705–e718, <a href="https://doi.org/10.1111/gcb.13896" target="_blank">https://doi.org/10.1111/gcb.13896</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib23"><label>Hashimoto et al.(2017)</label><mixed-citation>
Hashimoto, S., Nanko, K., Ťupek, B., and Lehtonen, A.: Data-mining
analysis of the global distribution of soil carbon in observational databases
and Earth system models, Geosci. Model Dev., 10, 1321–1337,
<a href="https://doi.org/10.5194/gmd-10-1321-2017" target="_blank">https://doi.org/10.5194/gmd-10-1321-2017</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib24"><label>Hechenbichler and Schliep(2004)</label><mixed-citation>
Hechenbichler, K. and Schliep, K. P.: Weighted <i>k</i>-nearest-neighbor
techniques and ordinal classification. Discussion paper 399, SFB 386,
Ludwig-Maximilians University, Munich, available at:
<a href="http://www.stat.uni-muenchen.de/sfb386/papers/dsp/paper399.ps" target="_blank">http://www.stat.uni-muenchen.de/sfb386/papers/dsp/paper399.ps</a> (last
access: 16 July 2018), 2004.
</mixed-citation></ref-html>
<ref-html id="bib1.bib25"><label>Hengl(2009)</label><mixed-citation>
Hengl, T.: A Practical Guide to Geostatistical Mapping, 2nd Edn., extended
edition of the EUR 22904 EN Scientific and Technical Research series report
published by 10 Office for Official Publications of the European Communities,
Luxembourg, 293 pp., 2009.
</mixed-citation></ref-html>
<ref-html id="bib1.bib26"><label>Hengl(2017)</label><mixed-citation>
Hengl, T.: GSIF: Global Soil Information Facilities, available at:
<a href="https://CRAN.R-project.org/package=GSIF" target="_blank">https://CRAN.R-project.org/package=GSIF</a> (last access: 16 July 2018), r
package version
0.5-4, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib27"><label>Hengl et al.(2004)Hengl, Heuvelink, and Stein</label><mixed-citation>
Hengl, T., Heuvelink, G. B., and Stein, A.: A generic framework for spatial
prediction of soil variables based on regression-kriging, Geoderma, 120,
75–93, <a href="https://doi.org/10.1016/j.geoderma.2003.08.018" target="_blank">https://doi.org/10.1016/j.geoderma.2003.08.018</a>, 2004.
</mixed-citation></ref-html>
<ref-html id="bib1.bib28"><label>Hengl et al.(2015)Hengl, Heuvelink, Kempen, Leenaars, Walsh,
Shepherd, Sila, MacMillan, De Jesus, Tamene, and Tondoh</label><mixed-citation>
Hengl, T., Heuvelink, G. B., Kempen, B., Leenaars, J. G., Walsh, M. G.,
Shepherd, K. D., Sila, A., MacMillan, R. A., De Jesus, J. M., Tamene, L.,
and Tondoh, J. E.: Mapping soil properties of Africa at 250&thinsp;m resolution:
Random forests significantly improve current predictions, PLoS ONE, 10,
1–26, <a href="https://doi.org/10.1371/journal.pone.0125814" target="_blank">https://doi.org/10.1371/journal.pone.0125814</a>, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib29"><label>Hengl et al.(2017)Hengl, Mendes de Jesus, Heuvelink, Ruiperez
Gonzalez, Kilibarda, Blagotić, Shangguan, Wright, Geng,
Bauer-Marschallinger, Guevara, Vargas, MacMillan, Batjes, Leenaars, Ribeiro,
Wheeler, Mantel, and Kempen</label><mixed-citation>
Hengl, T., Mendes de Jesus, J., Heuvelink, G. B. M., Ruiperez Gonzalez, M.,
Kilibarda, M., Blagotić, A., Shangguan, W., Wright, M. N., Geng, X.,
Bauer-Marschallinger, B., Guevara, M. A., Vargas, R., MacMillan, R. A.,
Batjes, N. H., Leenaars, J. G. B., Ribeiro, E., Wheeler, I., Mantel, S., and
Kempen, B.: SoilGrids250m: Global gridded soil information based on machine
learning, PLoS ONE, 12, e0169748, <a href="https://doi.org/10.1371/journal.pone.0169748" target="_blank">https://doi.org/10.1371/journal.pone.0169748</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib30"><label>Hengl et al.(2018)Hengl, Nussbaum, Wright, and
Heuvelink</label><mixed-citation>
Hengl, T., Nussbaum, M., Wright, M. N., and Heuvelink, G. B.: Random Forest as
a generic framework for predictive modeling of spatial and spatio-temporal
variables, PeerJ, 6, e26693v1,
<a href="https://doi.org/10.7287/peerj.preprints.26693v1" target="_blank">https://doi.org/10.7287/peerj.preprints.26693v1</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib31"><label>Heumann(2011)</label><mixed-citation>
Heumann, B. W.: An object-based classification of mangroves using a hybrid
decision tree-support vector machine approach, Remote Sens., 3,
2440–2460, <a href="https://doi.org/10.3390/rs3112440" target="_blank">https://doi.org/10.3390/rs3112440</a>, 2011.
</mixed-citation></ref-html>
<ref-html id="bib1.bib32"><label>Heuvelink(2018)</label><mixed-citation>
Heuvelink, G. B. M.: Uncertainty and Uncertainty Propagation in Soil Mapping
and Modelling, Springer International Publishing, Cham,  439–461,
<a href="https://doi.org/10.1007/978-3-319-63439-5_14" target="_blank">https://doi.org/10.1007/978-3-319-63439-5_14</a>,  2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib33"><label>Hiemstra et al.(2008)Hiemstra, Pebesma, Twenh”ofel, and
Heuvelink</label><mixed-citation>
Hiemstra, P., Pebesma, E., Twenh”ofel, C., and Heuvelink, G.: Real-time
automatic interpolation of ambient gamma dose rates from the Dutch
Radioactivity Monitoring Network, Comput. Geosci., 35, 1711–1721,  <a href="https://doi.org/10.1016/j.cageo.2008.10.011" target="_blank">https://doi.org/10.1016/j.cageo.2008.10.011</a>, 2008.
</mixed-citation></ref-html>
<ref-html id="bib1.bib34"><label>Jackson et al.(2017)</label><mixed-citation>
Jackson, R. B., Lajtha, K., Crow, S. E., Hugelius, G., Kramer, M. G., and
Piñeiro, G.: The Ecology of Soil Carbon: Pools, Vulnerabilities, and Biotic
and Abiotic Controls, Annu. Rev. Ecol. Evol. S.,
48, 419–445, <a href="https://doi.org/10.1146/annurev-ecolsys-112414-054234" target="_blank">https://doi.org/10.1146/annurev-ecolsys-112414-054234</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib35"><label>Karatzoglou et al.(2006)Karatzoglou, Meyer, and
Hornik</label><mixed-citation>
Karatzoglou, A., Meyer, D., and Hornik, K.: Support Vector Algorithm in R,
J. Stat. Softw., 15, 1–28, 2006.
</mixed-citation></ref-html>
<ref-html id="bib1.bib36"><label>Köchy et al.(2015)Köchy, Hiederer, and
Freibauer</label><mixed-citation>
Köchy, M., Hiederer, R., and Freibauer, A.: Global distribution of soil
organic carbon – Part 1: Masses and frequency distributions of SOC stocks
for the tropics, permafrost regions, wetlands, and the world, SOIL, 1,
351–365, <a href="https://doi.org/10.5194/soil-1-351-2015" target="_blank">https://doi.org/10.5194/soil-1-351-2015</a>, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib37"><label>Kuhn et al.(2017)Kuhn., Wing, Weston, Williams, Keefer, Engelhardt,
Cooper, Mayer, Kenkel, the R Core Team, Benesty, Lescarbeau, Ziem, Scrucca,
Tang, Candan, and Hunt.</label><mixed-citation>
Kuhn, M., Wing, J., Weston, S., Williams, A., Keefer, C., Engelhardt, A.,
Cooper, T., Mayer, Z., Kenkel, B., the R Core Team, Benesty, M., Lescarbeau,
R., Ziem, A., Scrucca, L., Tang, Y., Candan, C., and Hunt, T.: caret:
Classification and Regression Training, available at:
<a href="https://CRAN.R-project.org/package=caret" target="_blank">https://CRAN.R-project.org/package=caret</a> (last access: 16 July 2018), r
package version
6.0-78, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib38"><label>Kumar et al.(2012)Kumar, Lal, and Liu</label><mixed-citation>
Kumar, S., Lal, R., and Liu, D.: A geographically weighted regression kriging
approach for mapping soil organic carbon stock, Geoderma, 189–190, 627–634,
<a href="https://doi.org/10.1016/j.geoderma.2012.05.022" target="_blank">https://doi.org/10.1016/j.geoderma.2012.05.022</a>, 2012.
</mixed-citation></ref-html>
<ref-html id="bib1.bib39"><label>Ließ et al.(2016)Ließ, Schmidt, and Glaser</label><mixed-citation>
Ließ, M., Schmidt, J., and Glaser, B.: Improving the spatial prediction of
soil organic carbon stocks in a complex tropical mountain landscape by
methodological specifications in machine learning approaches, PLoS ONE, 11,
1–22, <a href="https://doi.org/10.1371/journal.pone.0153673" target="_blank">https://doi.org/10.1371/journal.pone.0153673</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib40"><label>Malone et al.(2017)Malone, Minasny, and McBratney</label><mixed-citation>
Malone, B. P., Minasny, B., and McBratney, A. B.: Using R for Digital Soil
Mapping, Springer International Publishing, <a href="https://doi.org/10.1007/978-3-319-44327-0" target="_blank">https://doi.org/10.1007/978-3-319-44327-0</a>,
2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib41"><label>Marchetti et al.(2012)Marchetti, Piccini, Francaviglia, and
Mabit</label><mixed-citation>
Marchetti, A., Piccini, C., Francaviglia, R., and Mabit, L.: Spatial
Distribution of Soil Organic Matter Using Geostatistics: A Key Indicator to
Assess Soil Degradation Status in Central Italy, Pedosphere, 22, 230–242,
<a href="https://doi.org/10.1016/S1002-0160(12)60010-1" target="_blank">https://doi.org/10.1016/S1002-0160(12)60010-1</a>, 2012.
</mixed-citation></ref-html>
<ref-html id="bib1.bib42"><label>Martin et al.(2011)Martin, Wattenbach, Smith, Meersmans, Jolivet,
Boulonne, and Arrouays</label><mixed-citation>
Martin, M. P., Wattenbach, M., Smith, P., Meersmans, J., Jolivet, C.,
Boulonne, L., and Arrouays, D.: Spatial distribution of soil organic carbon
stocks in France, Biogeosciences, 8, 1053–1065,
<a href="https://doi.org/10.5194/bg-8-1053-2011" target="_blank">https://doi.org/10.5194/bg-8-1053-2011</a>, 2011.
</mixed-citation></ref-html>
<ref-html id="bib1.bib43"><label>McBratney et al.(2003)McBratney, Santos, and Minasny</label><mixed-citation>
McBratney, A., Santos, M. M., and Minasny, B.: On digital soil mapping,
Geoderma, 117, 3–52,
<a href="https://doi.org/10.1016/S0016-7061(03)00223-4" target="_blank">https://doi.org/10.1016/S0016-7061(03)00223-4</a>, 2003.
</mixed-citation></ref-html>
<ref-html id="bib1.bib44"><label>Meinshausen(2006)</label><mixed-citation>
Meinshausen, N.: Quantile Regression Forests, J. Mach. Learn. Res., 7,
983–999,
2006.
</mixed-citation></ref-html>
<ref-html id="bib1.bib45"><label>Meinshausen(2017)</label><mixed-citation>
Meinshausen, N.: quantregForest: Quantile Regression Forests, available at:
<a href="https://CRAN.R-project.org/package=quantregForest" target="_blank">https://CRAN.R-project.org/package=quantregForest</a> (last access:
16 July 2018), r package
version 1.3-7, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib46"><label>Minasny et al.(2013)Minasny, McBratney, Malone, and
Wheeler</label><mixed-citation>
Minasny, B., McBratney, A. B., Malone, B. P., and Wheeler, I.: Digital Mapping
of Soil Carbon, in: Advances in Agronomy,  1–47, Elsevier,
<a href="https://doi.org/10.1016/b978-0-12-405942-9.00001-3" target="_blank">https://doi.org/10.1016/b978-0-12-405942-9.00001-3</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib47"><label>Mishra et al.(2009)Mishra, Lal, Slater, Calhoun, Liu, and Van
Meirvenne</label><mixed-citation>
Mishra, U., Lal, R., Slater, B., Calhoun, F., Liu, D., and Van Meirvenne, M.:
Predicting Soil Organic Carbon Stock Using Profile Depth Distribution
Functions and Ordinary Kriging, Soil Sci. Soc. Am. J., 73,
614–621, <a href="https://doi.org/10.2136/sssaj2007.0410" target="_blank">https://doi.org/10.2136/sssaj2007.0410</a>, 2009.
</mixed-citation></ref-html>
<ref-html id="bib1.bib48"><label>Møller et al.(2018)</label><mixed-citation>
Møller, A. B., Beucher, A., Iversen, B. V., and Greve, M. H.: Predicting
artificially drained areas by means of a selective model ensemble, Geoderma,
320, 30–42, <a href="https://doi.org/10.1016/j.geoderma.2018.01.018" target="_blank">https://doi.org/10.1016/j.geoderma.2018.01.018</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib49"><label>Mondal et al.(2017)Mondal, Khare, Kundu, Mondal, Mukherjee, and
Mukhopadhyay</label><mixed-citation>
Mondal, A., Khare, D., Kundu, S., Mondal, S., Mukherjee, S., and Mukhopadhyay,
A.: Spatial soil organic carbon (SOC) prediction by regression kriging using
remote sensing data, Egyptian Journal of Remote Sensing and Space Science,
20, 61–70, <a href="https://doi.org/10.1016/j.ejrs.2016.06.004" target="_blank">https://doi.org/10.1016/j.ejrs.2016.06.004</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib50"><label>Nelson and Sommers(1982)</label><mixed-citation>
Nelson, D. W. and Sommers, L. E.: Total carbon, organic carbon and organic matter, in: Methods of soil analysis. Part 2 Chemical and Microbiological Properties, edited by:  Page,
A. L., Miller, R. H., and  Keeney, D. R., 539–579, 1982.
</mixed-citation></ref-html>
<ref-html id="bib1.bib51"><label>Nussbaum et al.(2014)Nussbaum, Papritz, Baltensweiler, and
Walthert</label><mixed-citation>
Nussbaum, M., Papritz, A., Baltensweiler, A., and Walthert, L.: Estimating
soil organic carbon stocks of Swiss forest soils by robust external-drift
kriging, Geosci. Model Dev., 7, 1197–1210,
<a href="https://doi.org/10.5194/gmd-7-1197-2014" target="_blank">https://doi.org/10.5194/gmd-7-1197-2014</a>, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib52"><label>Nussbaum et al.(2018)Nussbaum, Spiess, Baltensweiler, Grob, Keller,
Greiner, Schaepman, and Papritz</label><mixed-citation>
Nussbaum, M., Spiess, K., Baltensweiler, A., Grob, U., Keller, A., Greiner,
L., Schaepman, M. E., and Papritz, A.: Evaluation of digital soil mapping
approaches with large sets of environmental covariates, SOIL, 4, 1–22,
<a href="https://doi.org/10.5194/soil-4-1-2018" target="_blank">https://doi.org/10.5194/soil-4-1-2018</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib53"><label>Padarian et al.(2017)Padarian, Minasny, and McBratney</label><mixed-citation>
Padarian, J., Minasny, B., and McBratney, A.: Chile and the Chilean soil grid:
A contribution to GlobalSoilMap, Geoderma Regional, 9, 17–28,
<a href="https://doi.org/10.1016/j.geodrs.2016.12.001" target="_blank">https://doi.org/10.1016/j.geodrs.2016.12.001</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib54"><label>Peng et al.(2013)Peng, Bing, Guangpo, and Guangcan</label><mixed-citation>
Peng, G., Bing, W., Guangpo, G., and Guangcan, Z.: Spatial distribution of
soil organic carbon and total nitrogen based on GIS and geostatistics in a
small watershed in a hilly area of northern China, PLoS ONE, 8, 1–9,
<a href="https://doi.org/10.1371/journal.pone.0083592" target="_blank">https://doi.org/10.1371/journal.pone.0083592</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib55"><label>R Core Team(2017)</label><mixed-citation>
R Core Team: R: A Language and Environment for Statistical Computing, R
Foundation for Statistical Computing, Vienna, Austria,
available at: <a href="https://www.R-project.org/" target="_blank">https://www.R-project.org/</a> (last access: 16 July 2018),
2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib56"><label>Reyes-Rojas et al.(2018)Reyes-Rojas, L. A., and Ventura</label><mixed-citation>
Reyes-Rojas, L. A., Adhikari, K., and Ventura, S. J.: Projecting Soil Organic
Carbon Distribution in Central Chile under Future Climate Scenarios, J.
Environ. Qual., 47, 735–745, <a href="https://doi.org/10.2134/jeq2017.08.0329" target="_blank">https://doi.org/10.2134/jeq2017.08.0329</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib57"><label>Rossel and Behrens(2010)</label><mixed-citation>
Rossel, R. V. and Behrens, T.: Using data mining to model and interpret soil
diffuse reflectance spectra, Geoderma, 158, 46–54,
<a href="https://doi.org/10.1016/j.geoderma.2009.12.025" target="_blank">https://doi.org/10.1016/j.geoderma.2009.12.025</a>, 2010.
</mixed-citation></ref-html>
<ref-html id="bib1.bib58"><label>Samuel-Rosa et al.(2015)Samuel-Rosa, Heuvelink, Vasques, and
Anjos</label><mixed-citation>
Samuel-Rosa, A., Heuvelink, G., Vasques, G., and Anjos, L.: Do more detailed
environmental covariates deliver more accurate soil maps?, Geoderma,
243–244, 214–227, <a href="https://doi.org/10.1016/j.geoderma.2014.12.017" target="_blank">https://doi.org/10.1016/j.geoderma.2014.12.017</a>, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib59"><label>Sanderman et al.(2017)Sanderman, Hengl, and Fiske</label><mixed-citation>
Sanderman, J., Hengl, T., and Fiske, G. J.: Soil carbon debt of 12,000 years
of human land use, P. Natl. Acad. Sci. USA, 114, 9575–9580,
<a href="https://doi.org/10.1073/pnas.1706103114" target="_blank">https://doi.org/10.1073/pnas.1706103114</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib60"><label>Shangguan et al.(2017)Shangguan, Hengl, Mendes de Jesus, Yuan, and
Dai</label><mixed-citation>
Shangguan, W., Hengl, T., Mendes de Jesus, J., Yuan, H., and Dai, Y.:
Mapping the global depth to bedrock for land surface modeling, J.
Adv. Model. Earth Sy., 9, 65–88, <a href="https://doi.org/10.1002/2016MS000686" target="_blank">https://doi.org/10.1002/2016MS000686</a>,
2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib61"><label>Silverman and Jones(1989)</label><mixed-citation>
Silverman, B. W. and Jones, M. C.: An Important Contribution to Nonparametric
Discriminant Analysis and Density Estimation: Commentary on Fix and
Hodges (1951), Int. Stat. Rev., 57, 233–238, 1989.
</mixed-citation></ref-html>
<ref-html id="bib1.bib62"><label>Sreenivas et al.(2016)Sreenivas, Dadhwal, Kumar, Harsha, Mitran,
Sujatha, Suresh, Fyzee, and Ravisankar</label><mixed-citation>
Sreenivas, K., Dadhwal, V. K., Kumar, S., Harsha, G. S., Mitran, T., Sujatha,
G., Suresh, G. J. R., Fyzee, M. A., and Ravisankar, T.: Digital mapping of
soil organic and inorganic carbon status in India, Geoderma, 269, 160–173,
<a href="https://doi.org/10.1016/j.geoderma.2016.02.002" target="_blank">https://doi.org/10.1016/j.geoderma.2016.02.002</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib63"><label>Taylor(2001)</label><mixed-citation>
Taylor, K. E.: Summarizing multiple aspects of model performance in a single
diagram, J. Geophys. Res.-Atmos., 106,  7183–7192, <a href="https://doi.org/10.1029/2000JD900719" target="_blank">https://doi.org/10.1029/2000JD900719</a>, 2001.
</mixed-citation></ref-html>
<ref-html id="bib1.bib64"><label>Tifafi et al.(2018)</label><mixed-citation>
Tifafi, M., Guenet, B., and Hatté, C.: Large Differences in Global and
Regional Total Soil Carbon Stock Estimates Based on SoilGrids, HWSD, and
NCSCD: Intercomparison and Evaluation Based on Field Data From USA,
England, Wales, and France,  Global Biogeochem. Cy., 32, 42–56, <a href="https://doi.org/10.1002/2017GB005678" target="_blank">https://doi.org/10.1002/2017GB005678</a>,
2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib65"><label>Vargas et al.(2013)Vargas, Paz, and de Jong</label><mixed-citation>
Vargas, R., Paz, F., and de Jong, B.: Quantification of forest degradation and
belowground carbon dynamics: ongoing challenges for monitoring, reporting and
verification activities for REDD+, Carbon Manag., 4, 579–582,
<a href="https://doi.org/10.4155/cmt.13.63" target="_blank">https://doi.org/10.4155/cmt.13.63</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib66"><label>Vargas et al.(2017)Vargas, Alcaraz-Segura, Birdsey, Brunsell,
Cruz-Gaistardo, de Jong, Etchevers, Guevara, Hayes, Johnson, Loescher, Paz,
Ryu, Sanchez-Mejia, and Toledo-Gutierrez</label><mixed-citation>
Vargas, R., Alcaraz-Segura, D., Birdsey, R., Brunsell, N. A., Cruz-Gaistardo,
C. O., de Jong, B., Etchevers, J., Guevara, M., Hayes, D. J., Johnson, K.,
Loescher, H. W., Paz, F., Ryu, Y., Sanchez-Mejia, Z., and Toledo-Gutierrez,
K. P.: Enhancing interoperability to facilitate implementation of REDD+:
case study of Mexico, Carbon Manag., 8, 57–65,
<a href="https://doi.org/10.1080/17583004.2017.1285177" target="_blank">https://doi.org/10.1080/17583004.2017.1285177</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib67"><label>Vasques et al.(2016)Vasques, Coelho, Dart, Oliveira, and
Teixeira</label><mixed-citation>
Vasques, G. M., Coelho, M. A.-C. R., Dart, R. O., Oliveira, R. P., and
Teixeira, W. G.: Mapping soil carbon, particle-size fractions, and water
retention in tropical dry forest in Brazil, Pesqui. Agropecu.
Bras., 51, 1371–1385, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib68"><label>Viscarra Rossel et al.(2014)Viscarra Rossel, Webster, Bui, and
Baldock</label><mixed-citation>
Viscarra Rossel, R. A., Webster, R., Bui, E. N., and Baldock, J. A.:
Baseline map of organic carbon in Australian soil to support national carbon
accounting and monitoring under climate change, Glob. Change Biol., 20,
2953–2970, <a href="https://doi.org/10.1111/gcb.12569" target="_blank">https://doi.org/10.1111/gcb.12569</a>, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib69"><label>Wold(1983)</label><mixed-citation>
Wold, H.: Systems Analysis by Partial Least Squares, Iiasa collaborative paper,
IIASA, Laxenburg, Austria, available at: <a href="http://pure.iiasa.ac.at/2336/" target="_blank">http://pure.iiasa.ac.at/2336/</a> (last access: 16 July 2018),
1983.
</mixed-citation></ref-html>
<ref-html id="bib1.bib70"><label>Yang et al.(2016)Yang, Zhang, Yang, Zhi, Yang, Liu, Zhao, and
Li</label><mixed-citation>
Yang, R.-M., Zhang, G.-L., Yang, F., Zhi, J.-J., Yang, F., Liu, F., Zhao,
Y.-G., and Li, D.-C.: Precise estimation of soil organic carbon stocks in
the northeast Tibetan Plateau, Sci. Rep.-UK, 6, 21842,
<a href="https://doi.org/10.1038/srep21842" target="_blank">https://doi.org/10.1038/srep21842</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib71"><label>Yigini and Panagos(2016)</label><mixed-citation>
Yigini, Y. and Panagos, P.: Assessment of soil organic carbon stocks under
future climate and land cover changes in Europe, Sci. Total
Environ., 557–558, 838–850, <a href="https://doi.org/10.1016/j.scitotenv.2016.03.085" target="_blank">https://doi.org/10.1016/j.scitotenv.2016.03.085</a>, 2016.

</mixed-citation></ref-html>
<ref-html id="bib1.bib72"><label>Yigini et al.(2018)Yigini, Olmedo, Reiter, Baritz, Viatkin, and
Vargas</label><mixed-citation>
Yigini, Y., Olmedo, G. F., Reiter, S., Baritz, R., Viatkin, K., and Vargas,
R. R. (Eds.): Soil Organic Carbon Mapping Cookbook, FAO, Rome, Italy, 2nd
Edn., available at: <a href="http://www.fao.org/documents/card/en/c/I8895EN" target="_blank">http://www.fao.org/documents/card/en/c/I8895EN</a> (last access: 16 July 2018), 2018.
</mixed-citation></ref-html>--></article>
