Articles | Volume 6, issue 2
https://doi.org/10.5194/soil-6-565-2020
© Author(s) 2020. This work is distributed under
the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
https://doi.org/10.5194/soil-6-565-2020
© Author(s) 2020. This work is distributed under
the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
The influence of training sample size on the accuracy of deep learning models for the prediction of soil properties with near-infrared spectroscopy data
School of Life and Environmental Sciences and Sydney Institute of
Agriculture, The University of Sydney, NSW, Australia
Budiman Minasny
School of Life and Environmental Sciences and Sydney Institute of
Agriculture, The University of Sydney, NSW, Australia
Wanderson de Sousa Mendes
Department of Soil Science, Luiz de Queiroz College of
Agriculture, University of São Paulo, Piracicaba, São Paulo, 13418-900, Brazil
José Alexandre Melo Demattê
Department of Soil Science, Luiz de Queiroz College of
Agriculture, University of São Paulo, Piracicaba, São Paulo, 13418-900, Brazil
Related authors
Wartini Ng, Budiman Minasny, Alex McBratney, Patrice de Caritat, and John Wilford
Earth Syst. Sci. Data, 15, 2465–2482, https://doi.org/10.5194/essd-15-2465-2023, https://doi.org/10.5194/essd-15-2465-2023, 2023
Short summary
Short summary
With a higher demand for lithium (Li), a better understanding of its concentration and spatial distribution is important to delineate potential anomalous areas. This study uses a framework that combines data from recent geochemical surveys and relevant environmental factors to predict and map Li content across Australia. The map shows high Li concentration around existing mines and other potentially anomalous Li areas. The same mapping principles can potentially be applied to other elements.
Yin-Chung Huang, José Padarian, Budiman Minasny, and Alex B. McBratney
SOIL, 11, 553–563, https://doi.org/10.5194/soil-11-553-2025, https://doi.org/10.5194/soil-11-553-2025, 2025
Short summary
Short summary
Uncertainty quantification plays a crucial role in reporting machine learning models in soil spectroscopy. This study introduces Monte Carlo conformal prediction (MC-CP), a novel method for uncertainty quantification in deep-learning soil spectral models. MC-CP outperformed two established methods, providing the most reliable results. Its efficiency and robustness make it a practical choice for implementing soil spectral models in decision making.
Marliana Tri Widyastuti, José Padarian, Budiman Minasny, Mathew Webb, Muh Taufik, and Darren Kidd
SOIL, 11, 287–307, https://doi.org/10.5194/soil-11-287-2025, https://doi.org/10.5194/soil-11-287-2025, 2025
Short summary
Short summary
This work aims to predict soil water content at a fine spatiotemporal resolution (80 m grids, daily) to support agricultural management in Tasmania. It proves that transfer learning can improve the accuracy of deep learning models to predict multilevel soil moisture. We address the challenge of mapping soil moisture at field-scale resolution and integrate the model into a near-real-time monitoring system.
Marliana Tri Widyastuti, Budiman Minasny, José Padarian, Federico Maggi, Matt Aitkenhead, Amélie Beucher, John Connolly, Dian Fiantis, Darren Kidd, Yuxin Ma, Fraser Macfarlane, Ciaran Robb, Rudiyanto, Budi Indra Setiawan, and Muh Taufik
Earth Syst. Sci. Data Discuss., https://doi.org/10.5194/essd-2024-333, https://doi.org/10.5194/essd-2024-333, 2024
Preprint withdrawn
Short summary
Short summary
PEATGRIDS, the first dataset containing maps of global peat thickness and carbon stock at 1 km resolution. The dataset has been publicly available at Zenodo to support further analyses and modelling of peatlands across the globe. This work employed the random forest machine learning model to provide spatially explicit peat carbon stock at pixel basis.
Tobias Karl David Weber, Lutz Weihermüller, Attila Nemes, Michel Bechtold, Aurore Degré, Efstathios Diamantopoulos, Simone Fatichi, Vilim Filipović, Surya Gupta, Tobias L. Hohenbrink, Daniel R. Hirmas, Conrad Jackisch, Quirijn de Jong van Lier, John Koestel, Peter Lehmann, Toby R. Marthews, Budiman Minasny, Holger Pagel, Martine van der Ploeg, Shahab Aldin Shojaeezadeh, Simon Fiil Svane, Brigitta Szabó, Harry Vereecken, Anne Verhoef, Michael Young, Yijian Zeng, Yonggen Zhang, and Sara Bonetti
Hydrol. Earth Syst. Sci., 28, 3391–3433, https://doi.org/10.5194/hess-28-3391-2024, https://doi.org/10.5194/hess-28-3391-2024, 2024
Short summary
Short summary
Pedotransfer functions (PTFs) are used to predict parameters of models describing the hydraulic properties of soils. The appropriateness of these predictions critically relies on the nature of the datasets for training the PTFs and the physical comprehensiveness of the models. This roadmap paper is addressed to PTF developers and users and critically reflects the utility and future of PTFs. To this end, we present a manifesto aiming at a paradigm shift in PTF research.
Danilo César de Mello, Clara Glória Oliveira Baldi, Cássio Marques Moquedace, Isabelle de Angeli Oliveira, Gustavo Vieira Veloso, Lucas Carvalho Gomes, Márcio Rocha Francelino, Carlos Ernesto Gonçalves Reynaud Schaefer, Elpídio Inácio Fernandes-Filho, Edgar Batista de Medeiros Júnior, Fabio Soares de Oliveira, José João Lelis Leal de Souza Souza, Tiago Ferreira, and José A. M. Demattê
Geosci. Model Dev. Discuss., https://doi.org/10.5194/gmd-2024-2, https://doi.org/10.5194/gmd-2024-2, 2024
Revised manuscript under review for GMD
Short summary
Short summary
The study explores Maritime Antarctica's geology, shaped by periglacial forces, using pioneering gamma-spectrometric and magnetic surveys on igneous rocks due to limited Antarctic surveys. Machine learning predicts radionuclide and magnetic content based on terrain features, linking their distribution to landscape processes, morphometrics, lithology, and pedogeomorphology. Inaccuracies arise due to complex periglacial processes and landscape complexities.
Frisa Irawan Ginting, Rudiyanto Rudiyanto, Fatchurahman, Ramisah Mohd Shah, Norhidayah Che Soh, Sunny Goh Eng Giap, Dian Fiantis, Budi Indra Setiawan, Sam Schiller, Aaron Davitt, and Budiman Minasny
Earth Syst. Sci. Data Discuss., https://doi.org/10.5194/essd-2024-90, https://doi.org/10.5194/essd-2024-90, 2024
Preprint withdrawn
Short summary
Short summary
This study is the first to map rice cropping intensity and the harvested area across Southeast Asia at a spatial resolution of 10 m (SEA-Rice-Ci10). We have developed a geospatial inventory of paddy rice parcels and rice cropping intensity by integrating Sentinel-1 and 2 time-series data in a framework called LUCK-PALM, based on local phenological expert interpretation. According to our best knowledge, it is the finest-resolution and most accurate database of paddy rice in Southeast Asia.
Wartini Ng, Budiman Minasny, Alex McBratney, Patrice de Caritat, and John Wilford
Earth Syst. Sci. Data, 15, 2465–2482, https://doi.org/10.5194/essd-15-2465-2023, https://doi.org/10.5194/essd-15-2465-2023, 2023
Short summary
Short summary
With a higher demand for lithium (Li), a better understanding of its concentration and spatial distribution is important to delineate potential anomalous areas. This study uses a framework that combines data from recent geochemical surveys and relevant environmental factors to predict and map Li content across Australia. The map shows high Li concentration around existing mines and other potentially anomalous Li areas. The same mapping principles can potentially be applied to other elements.
Mercedes Román Dobarco, Alexandre M. J-C. Wadoux, Brendan Malone, Budiman Minasny, Alex B. McBratney, and Ross Searle
Biogeosciences, 20, 1559–1586, https://doi.org/10.5194/bg-20-1559-2023, https://doi.org/10.5194/bg-20-1559-2023, 2023
Short summary
Short summary
Soil organic carbon (SOC) is of a heterogeneous nature and varies in chemistry, stabilisation mechanisms, and persistence in soil. In this study we mapped the stocks of SOC fractions with different characteristics and turnover rates (presumably PyOC >= MAOC > POC) across Australia, combining spectroscopy and digital soil mapping. The SOC stocks (0–30 cm) were estimated as 13 Pg MAOC, 2 Pg POC, and 5 Pg PyOC.
Danilo César de Mello, Tiago Osório Ferreira, Gustavo Vieira Veloso, Marcos Guedes de Lana, Fellipe Alcantara de Oliveira Mello, Luis Augusto Di Loreto Di Raimo, Diego Ribeiro Oquendo Cabrero, José João Lelis Leal de Souza, Elpídio Inácio Fernandes-Filho, Márcio Rocha Francelino, Carlos Ernesto Gonçalves Reynaud Schaefer, and José A. M. Demattê
SOIL Discuss., https://doi.org/10.5194/soil-2022-17, https://doi.org/10.5194/soil-2022-17, 2022
Revised manuscript not accepted
Short summary
Short summary
We proposed a different method to evaluate different intensities of weathering in a heterogeneous area (soils, geology and relief) and small number of samples. We use combined data from three geophysical sensors, clustering and machine learning (nested-leave-one-out-cross-validation) to distinguish weathering intensities and assess the relationship of these variables with weathering, relief, geology, and soil types and attributes. and we obtained satisfactory performances of models evaluation.
Danilo César de Mello, Gustavo Vieira Veloso, Marcos Guedes de Lana, Fellipe Alcantara de Oliveira Mello, Raul Roberto Poppiel, Diego Ribeiro Oquendo Cabrero, Luis Augusto Di Loreto Di Raimo, Carlos Ernesto Gonçalves Reynaud Schaefer, Elpídio Inácio Fernandes Filho, Emilson Pereira Leite, and José Alexandre Melo Demattê
Geosci. Model Dev., 15, 1219–1246, https://doi.org/10.5194/gmd-15-1219-2022, https://doi.org/10.5194/gmd-15-1219-2022, 2022
Short summary
Short summary
We used soil parent material, terrain attributes, and geophysical data from the soil surface to test and compare different and unprecedented geophysical sensor combination, as well as different machine learning algorithms to model and predict several soil attributes. Also, we analyzed the importance of pedoenvironmental variables. The soil attributes were modeled throughout different machine learning algorithms and related to different geophysical sensor combinations.
José Padarian, Budiman Minasny, Alex B. McBratney, and Pete Smith
SOIL Discuss., https://doi.org/10.5194/soil-2021-73, https://doi.org/10.5194/soil-2021-73, 2021
Manuscript not accepted for further review
Short summary
Short summary
Soil organic carbon sequestration is considered an attractive technology to partially mitigate climate change. Here, we show how the SOC storage potential varies globally. The estimated additional SOC storage potential in the topsoil of global croplands (29–67 Pg C) equates to only 2 to 5 years of emissions offsetting and 32 % of agriculture's 92 Pg historical carbon debt. Since SOC is temperature-dependent, this potential is likely to reduce by 18 % by 2040 due to climate change.
Cited articles
Acquarelli, J., van Laarhoven, T., Gerretzen, J., Tran, T. N., Buydens, L.
M. C., and Marchiori, E.: Convolutional neural networks for vibrational
spectroscopic data analysis, Anal. Chim. Acta, 954, 22–31,
https://doi.org/10.1016/j.aca.2016.12.010, 2017.
Abadi, M., Barham, P., Chen, J., Chen, Z., Davis, A., Dean, J., Devin, M., Ghemawat, S., Irving, G., Isard, M., Kudlur, M., Levenberg, J., Monga, R., Moore, S., Murray, D. G., Steiner, B., Tucker, P., Vasudevan, V., Warden, P., Wicke, M., Yu, Y., and Zheng, X.: TensorFlow: Large-scale machine learning on heterogeneous systems, Software available from tensorflow.org, available at: https://www.tensorflow.org/ (last access: 1 July 2019), 2015.
Barnes, R. J., Dhanoa, M. S., and Lister, S. J.: Standard Normal Variate
Transformation and De-Trending of near-Infrared Diffuse Reflectance Spectra,
Appl. Spectrosc., 43, 772–777, https://doi.org/10.1366/0003702894202201, 1989.
Bellinaso, H., Demattê, J. A. M., and Romeiro, S. A.: Soil Spectral
Library and Its Use in Soil Classification, Rev. Bras. Cienc. Solo, 34,
861–870, https://doi.org/10.1590/S0100-06832010000300027, 2010.
Bendor, E. and Banin, A.: Near-Infrared Analysis as a Rapid Method to
Simultaneously Evaluate Several Soil Properties, Soil Sci. Soc. Am. J., 59,
364–372, https://doi.org/10.2136/sssaj1995.03615995005900020014x, 1995.
Chang, C. W., Laird, D. A., Mausbach, M. J., and Hurburgh, C. R.:
Near-infrared reflectance spectroscopy-principal components regression
analyses of soil properties, Soil Sci. Soc. Am. J., 65, 480–490, 2001.
Chen, H. Z., Liu, Z. Y., Gu, J., Ai, W., Wen, J. B., and Cai, K.:
Quantitative analysis of soil nutrition based on FT-NIR spectroscopy
integrated with BP neural deep learning, Anal. Methods-UK, 10, 5004–5013,
https://doi.org/10.1039/c8ay01076e, 2018.
Chollet, F.: Keras, available at: https://github.com/fchollet/keras (last access: 1 July 2019), 2015.
Clark, R. N.: Chapter 1: Spectroscopy of Rocks and Minerals, and Principles
of Spectroscopy, in: Manual of Remote Sensing, edited by: Rencz, A. N.,
Remote Sensing for the Earth Sciences, John Wiley and Sons, New York, 3–58,
1999.
Cui, C. H. and Fearn, T.: Modern practical convolutional neural networks
for multivariate regression: Applications to NIR calibration, Chemometr.
Intell. Lab., 182, 9–20, https://doi.org/10.1016/j.chemolab.2018.07.008, 2018.
Dalal, R. C. and Henry, R. J.: Simultaneous Determination of Moisture,
Organic-Carbon, and Total Nitrogen by near-Infrared Reflectance
Spectrophotometry, Soil Sci. Soc. Am. J., 50, 120–123, https://doi.org/10.2136/sssaj1986.03615995005000010023x, 1986.
Dangal, S., Sanderman, J., Wills, S., and Ramirez-Lopez, L.: Accurate and
Precise Prediction of Soil Properties from a Large Mid-Infrared Spectral
Library, Soil Systems, 3, 11, https://doi.org/10.3390/soilsystems3010011, 2019.
Demattê, J. A. M.: Characterization and discrimination of soils by their
reflected electromagnetic energy, Pesqui. Agropecu. Bras., 37, 1445–1458, https://doi.org/10.1590/S0100-204x2002001000013, 2002.
Demattê, J. A. M., Bellinaso, H., Araujo, S. R., Rizzo, R., and Souza,
A. B.: Spectral regionalization of tropical soils in the estimation of soil
attributes, Rev. Cienc. Agron., 47, 589–598, 2016.
Donagema, G. K., de Campos, D. B., Calderano, S. B., Teixeira, W., and
Viana, J. M.: Manual de métodos de análise de solo, Embrapa
Solos-Documentos (INFOTECA-E), 2011.
Fang, Q., Hanlie, H., Zhao, L., Kukolich, S., Yin, K., and Wang, C.: Visible and near-infrared reflectance spectroscopy for investigating soil mineralogy, 2018, https://doi.org/10.1155/2018/3168974, 2018.
Figueroa, R. L., Zeng-Treitler, Q., Kandula, S., and Ngo, L. H.: Predicting
sample size required for classification performance, BMC Med. Inform.
Decis., 12, 8, https://doi.org/10.1186/1472-6947-12-8, 2012.
He, K. M., Zhang, X. Y., Ren, S. Q., and Sun, J.: Deep Residual Learning for
Image Recognition, 2016 Ieee Conference on Computer Vision and Pattern
Recognition (Cvpr), IEEE, 770–778, https://doi.org/10.1109/Cvpr.2016.90, 2016.
Islam, K., Singh, B., and McBratney, A.: Simultaneous estimation of several
soil properties by ultra-violet, visible, and near-infrared reflectance
spectroscopy, Aust. J. Soil Res., 41, 1101–1114, https://doi.org/10.1071/Sr02137, 2003.
Kingma, D. P. and Ba, J.: Adam: A method for stochastic optimization, arXiv
[preprint], arXiv:1412.6980, 2014.
Krizhevsky, A., Sutskever, I., and Hinton, G. E.: ImageNet classification
with deep convolutional neural networks, Proceedings of the 25th
International Conference on Neural Information Processing Systems – Volume
1, Lake Tahoe, Nevada, New York: Curran Associates, Inc., 2012.
Kuang, B. and Mouazen, A. M.: Influence of the number of samples on
prediction error of visible and near infrared spectroscopy of selected soil
properties at the farm scale, Eur. J. Soil Sci., 63, 421–429,
https://doi.org/10.1111/j.1365-2389.2012.01456.x, 2012.
Kuhn, M. and Quinlan, R.: Cubist: Rule- And Instance-Based Regression Modeling, R package version 0.2.2, available at: https://CRAN.R-project.org/package=Cubist (last access: 18 July 2019), 2018.
LeCun, Y., Bengio, Y., and Hinton, G.: Deep learning, Nature, 521, 436–444,
https://doi.org/10.1038/nature14539, 2015.
Lee, K. S., Lee, D. H., Sudduth, K. A., Chung, S. O., Kitchen, N. R., and
Drummond, S. T.: Wavelength Identification and Diffuse Reflectance
Estimation for Surface and Profile Soil Properties, T. ASABE, 52, 683–695,
2009.
Liu, L., Ji, M., and Buchroithner, M.: Transfer Learning for Soil
Spectroscopy Based on Convolutional Neural Networks and Its Application in
Soil Clay Content Mapping Using Hyperspectral Imagery, Sensors-Basel, 18, https://doi.org/10.3390/s18093169, 2018.
Mahapatra, S.: Why Deep Learning over Traditional Machine Learning?, available at: https://towardsdatascience.com/why-deep-learning-is-needed-over-traditional-machine-learning-1b6a99177063 (last access: 1 July 2019), 2018.
Mevik, B.-H., Wehrens, R., and Liland, K. H.: pls: Partial Least Squares and Principal Component Regression. R package version 2.7-0, available at: https://CRAN.R-project.org/package=pls (last access: 18 July 2019), 2018.
Ng, W., Minasny, B., Malone, B., and Filippi, P.: In search of an optimum
sampling algorithm for prediction of soil properties from infrared spectra,
PeerJ, 6, e5722, https://doi.org/10.7717/peerj.5722, 2018.
Ng, W., Minasny, B., Montazerolghaem, M., Padarian, J., Ferguson, R.,
Bailey, S., and McBratney, A. B.: Convolutional neural network for
simultaneous prediction of several soil properties using
visible/near-infrared, mid-infrared, and their combined spectra, Geoderma,
352, 251–267, https://doi.org/10.1016/j.geoderma.2019.06.016,
2019.
Padarian, J., Minasny, B., and McBratney, A. B.: Using deep learning to
predict soil properties from regional spectral data, Geoderma Regional, 16,
e00198, https://doi.org/10.1016/j.geodrs.2018.e00198, 2019.
Pinheiro, E. F. M., Ceddia, M. B., Clingensmith, C. M., Grunwald, S., and
Vasques, G. M.: Prediction of Soil Physical and Chemical Properties by
Visible and Near-Infrared Diffuse Reflectance Spectroscopy in the Central
Amazon, Remote Sens-Basel, 9, 293, https://doi.org/10.3390/rs9040293, 2017.
Python Software Foundation: Python Language Reference, Python Software Foundation, available at: https://www.python.org (last access: 1 July 2019), 2017.
Quinlan, J. R.: C4.5: Programs for Machine Learning, Morgan Kaufmann
Publishers Inc., San Mateo, California, 1993.
R Core Team: R: A language and environment for statistical computing, available at: https://www.R-project.org/ (last access: 18 July 2019), 2019.
Ramirez-Lopez, L., Schmidt, K., Behrens, T., van Wesemael, B., Dematte, J.
A. M., and Scholten, T.: Sampling optimal calibration sets in soil infrared
spectroscopy, Geoderma, 226, 140–150, https://doi.org/10.1016/j.geoderma.2014.02.002, 2014.
Rinnan, A., van den Berg, F., and Engelsen, S. B.: Review of the most common
pre-processing techniques for near-infrared spectra, Trac-Trend Anal. Chem.,
28, 1201–1222, https://doi.org/10.1016/j.trac.2009.07.007,
2009.
Savitzky, A. and Golay, M. J. E.: Smoothing and Differentiation of Data by
Simplified Least Squares Procedures, Anal. Chem., 36, 1627–1639,
https://doi.org/10.1021/ac60214a047, 1964.
Shepherd, K. D. and Walsh, M. G.: Development of Reflectance Spectral
Libraries for Characterization of Soil Properties, Soil Sci. Soc. Am. J., 66,
988–998, https://doi.org/10.2136/sssaj2002.9880, 2002.
Shibusawa, S., Imade Anom, S. W., Sato, S., Sasao, A., and Hirako, S.: Soil
mapping using the real-time soil spectrophotometer, in: ECPA, Third European Conference on Precision
Agriculture, edited by: Grenier, G. and
Blackmore, S., Agro Montpellier, 1. Montpellier, France, 497–508,
2001.
Simonyan, K. and Zisserman, A.: Very Deep Convolutional Networks for
Large-Scale Image Recognition, CoRR, abs/1409.1556, 2014.
Stenberg, B., Viscarra Rossel, R. A., Mouazen, A. M., and Wetterlind, J.:
Chapter Five – Visible and Near Infrared Spectroscopy in Soil Science, in:
Adv Agron, edited by: Sparks, D. L., Academic Press, Burlington, MA, 163–215, 2010.
Szegedy, C., Liu, W., Jia, Y. Q., Sermanet, P., Reed, S., Anguelov, D.,
Erhan, D., Vanhoucke, V., and Rabinovich, A.: Going Deeper with
Convolutions, Proc. Cvpr Ieee, Piscataway: IEEE, 1–9, 2015.
Terra, F. S., Dematte, J. A. M., and Rossel, R. A. V.: Proximal spectral
sensing in pedological assessments: vis-NIR spectra for soil classification
based on weathering and pedogenesis, Geoderma, 318, 123–136,
https://doi.org/10.1016/j.geoderma.2017.10.053, 2018.
Tsakiridis, N. L., Keramaris, K. D., Theocharis, J. B., and Zalidis, G. C.:
Simultaneous prediction of soil properties from VNIR-SWIR spectra using a
localized multi-channel 1-D convolutional neural network, Geoderma, 367,
114208, https://doi.org/10.1016/j.geoderma.2020.114208, 2020.
Viscarra Rossel, R. A. and Behrens, T.: Using data mining to model and
interpret soil diffuse reflectance spectra, Geoderma, 158, 46–54,
https://doi.org/10.1016/j.geoderma.2009.12.025, 2010.
Viscarra Rossel, R. A., Cattle, S. R., Ortega, A., and Fouad, Y.: In situ
measurements of soil colour, mineral composition and clay content by
vis–NIR spectroscopy, Geoderma, 150, 253–266, https://doi.org/10.1016/j.geoderma.2009.01.025, 2009.
Walkley, A. and Black, I. A.: An examination of the Degtjareff method for
determining soil organic matter, and a proposed modification of the chromic
acid titration method, Soil Sci., 37, 29–38, https://doi.org/10.1097/00010694-193401000-00003, 1934.
Wold, S., Martens, H., and Wold, H.: The Multivariate Calibration-Problem in
Chemistry Solved by the Pls Method, Lect. Notes Math., 973, 286–293, 1983.
Yang, J., Xu, J. F., Zhang, X. L., Wu, C. Y., Lin, T., and Ying, Y. B.: Deep
learning for vibrational spectral analysis: Recent progress and a practical
guide, Anal. Chim. Acta, 1081, 6–17, https://doi.org/10.1016/j.aca.2019.06.012, 2019.
Zhang, X. L., Xu, J. F., Yang, J., Chen, L., Zhou, H. B., Liu, X. J., Li, H.
F., Lin, T., and Ying, Y. B.: Understanding the learning mechanism of
convolutional neural networks in spectral analysis, Anal. Chim. Acta, 1119,
41–51, https://doi.org/10.1016/j.aca.2020.03.055, 2020.
Zhu, X., Vondrick, C., Fowlkes, C. C., and Ramanan, D.: Do We Need More
Training Data?, International Journal of Computer Vision, 119, 76–92,
https://doi.org/10.1007/s11263-015-0812-2, 2016.
Short summary
The number of samples utilised to create predictive models affected model performance. This research compares the number of samples needed by a deep learning model to outperform the traditional machine learning models using visible near-infrared spectroscopy data for soil properties predictions. The deep learning model was found to outperform machine learning models when the sample size was above 2000.
The number of samples utilised to create predictive models affected model performance. This...