A scalable framework for soil property mapping tested across a highly diverse tropical data-scarce region

[thumbnail of Open Access]
Preview
Text (Open Access)
- Published Version
· Available under License Creative Commons Attribution Non-commercial.
[thumbnail of Main paper]
Text (Main paper)
- Accepted Version
· Restricted to Repository staff only
· The Copyright of this document has not been checked yet. This may affect its availability.
[thumbnail of Supplementary material]
Text (Supplementary material)
- Accepted Version
· Restricted to Repository staff only
· The Copyright of this document has not been checked yet. This may affect its availability.

Please see our End User Agreement.

It is advisable to refer to the publisher's version if you intend to cite from this work. See Guidance on citing.

Add to AnyAdd to TwitterAdd to FacebookAdd to LinkedinAdd to PinterestAdd to Email

de Q. Miranda, R., Nóbrega, R. L. B., Verhoef, A. ORCID: https://orcid.org/0000-0002-9498-6696, da Silva, E. L. R., da Silva, J. F., de Araújo Filho, J. C., de Moura, M. S. B., Alexandre H. C., B., Souza, A. G. S. S., Yang, W., Shao, H., Srinivasan, R., Ziadat, F., Montenegro, S. M. G. L., do S. B. Araújo, M. and Galvíncio, J. D. (2025) A scalable framework for soil property mapping tested across a highly diverse tropical data-scarce region. Soil Advances, 4. 100064. ISSN 2950-2896 doi: 10.1016/j.soilad.2025.100064

Abstract/Summary

Reliable soil property maps are essential for environmental modeling, yet conventional mapping methods remain costly and time-consuming. We developed a machine learning framework that integrates the Soil-Landscape Estimation and Evaluation Program (SLEEP) with gradient boosting to predict soil properties at regional scales and multiple depths. Our approach addresses multicollinearity through a recursive feature selection algorithm. We applied this framework to a tropical region characterized by a ~700-km longitudinal gradient of contrasting topography, climate, and vegetation (~98,000 km²; NE Brazil), where scarce soil physicochemical data limit environmental modeling. We used six topographical, ten climate, and two vegetation covariates, along with data from 223 soil profiles (~1 profile per 440 km²). Training and testing of our framework demonstrated strong spatial performance (r² = 0.79–0.98 and percent bias = -1.39 to 1.14%). Topographic and climatic factors held greater weight than other variables in predicting soil layers, texture, and sum of bases. Moreover, we used our soil parameters combined with multiple pedotransfer functions (PTFs) to derive soil hydraulic properties. Our PTFs-derived estimates of hydraulic conductivity were considerably lower than high-resolution global predictions available for our study areadue to differences in clay fraction and mineralogy. Therefore, we recommend the use of region-specific PTFs for hydraulic properties based on multi-covariate soil property maps. This cost-effective framework accurately integrates diverse environmental covariates, adapts to varying soil data availability, and scales across spatial resolutions, making it highly transferable to other data-scarce regions.

Altmetric Badge

Item Type Article
URI https://centaur.reading.ac.uk/id/eprint/123785
Identification Number/DOI 10.1016/j.soilad.2025.100064
Refereed Yes
Divisions Science > School of Archaeology, Geography and Environmental Science > Earth Systems Science
Science > School of Archaeology, Geography and Environmental Science > Department of Geography and Environmental Science
Publisher Elsevier
Download/View statistics View download statistics for this item

Downloads

Downloads per month over past year

University Staff: Request a correction | Centaur Editors: Update this record