Authors / CoAuthors
Wilford, J. | Basak, S. | Hassan, R. | Moushall, B. | McCalman, L. | Steinberg, D. | Zhang, F.
Abstract
The geosciences are a data-rich domain where Earth materials and processes are analysed from local to global scales. However, often we only have discrete measurements at specific locations, and a limited understanding of how these features vary across the landscape. Earth system processes are inherently complex, and trans-disciplinary science will likely become increasingly important in finding solutions to future challenges associated with the environment, mineral/petroleum resources and food security. Machine learning is an important approach to synthesise the increasing complexity and sheer volume of Earth science data, and is now widely used in prediction across many scientific disciplines. In this context, we have built a machine learning pipeline, called Uncover-ML, for both supervised and unsupervised learning, prediction and classification. The Uncover-ML pipeline was developed from a partnership between CSIRO and Geoscience Australia, and is largely built around the Python scikit-learn machine learning libraries. In this paper, we briefly describe the architecture and components of Uncover-ML for feature extraction, data scaling, sample selection, predictive mapping, estimating model performance, model optimisation and estimating model uncertainties. Links to download the source code and information on how to implement the algorithms are also provided. <b>Citation:</b> Wilford, J., Basak, S., Hassan, R., Moushall, B., McCalman, L., Steinberg, D. and Zhang, F, 2020. Uncover-ML: a machine learning pipeline for geoscience data analysis. In: Czarnota, K., Roach, I., Abbott, S., Haynes, M., Kositcin, N., Ray, A. and Slatter, E. (eds.) Exploring for the Future: Extended Abstracts, Geoscience Australia, Canberra, 1–4.
Product Type
document
eCat Id
134466
Contact for the resource
Resource provider
Point of contact
Cnr Jerrabomberra Ave and Hindmarsh Dr GPO Box 378
Canberra
ACT
2601
Australia
Point of contact
- Contact instructions
- MEG
Digital Object Identifier
Keywords
- theme.ANZRC Fields of Research.rdf
-
- EARTH SCIENCESINFORMATION AND COMPUTING SCIENCES
- ( Theme )
-
- machine learning
- ( Project )
-
- EFTF
-
- information and computer sciences
-
- data analytics
-
- Exploring for the Future
-
- Toolbox
-
- Published_External
Publication Date
2020-06-22T08:08:04
Creation Date
Security Constraints
Legal Constraints
Status
completed
Purpose
EFTF Extedned Abstract
Maintenance Information
asNeeded
Topic Category
geoscientificInformation
Series Information
Lineage
The uncover-ML code was developed from a partnership between Data61 (CSIRO) and Geoscience Australia. A large proportion of the code draws on the scikit-learn – machine learning in python resource (https://scikit-learn.org/stable/ ).
Parent Information
Extents
[-44.00, -9.00, 112.00, 154.00]
Reference System
Spatial Resolution
Service Information
Associations
Source Information