Improving Subsurface Characterisation with ‘Big Data’ Mining and Machine Learning

Rachel Brackenridge; Vasily Demyanov; Oleg  Vashutin; Ruslan  Nigmatullin

doi:10.3390/en15031070

Improving Subsurface Characterisation with ‘Big Data’ Mining and Machine Learning

Rachel Brackenridge^* (Corresponding Author), Vasily Demyanov, Oleg Vashutin , Ruslan Nigmatullin

^*Corresponding author for this work

Geology and Geophysics

Research output: Contribution to journal › Article › peer-review

1 Citation (Scopus)

7 Downloads (Pure)

Abstract

Large databases  of  legacy  hydrocarbon  reservoir  and  well data  provide an  opportunity  to use  modern data mining  techniques  to improve our understanding of  the subsur-face  in  the  presence of uncertainty and improve predictability of reservoir properties. A da-ta mining approach provides a way to screen dependencies in reservoir and fluid data and enable  subsurface specialists  to  estimate absent properties in partial or incomplete datasets. This  allows  for uncertainty  to  be managed  and reduced. An  improvement in reservoir  characterisation using  machine learning  results from the capacity of machine learning methods to detect and model hidden dependencies in large multivariate datasets with noisy and missing data.  This study presents a workflow applied to a large basin-scale reservoir characterization database. The study aims to understand the dependencies between reservoir attributes in order to allow for predictions to be made to improve the data coverage. The machine learning workflow comprises the following steps: (i) exploratory data analysis; (ii) detection of outliers and data partitioning into groups showing similar trends using clustering; (iii) identification of dependencies within reservoir data in multivariate feature space with self-organising maps; and (iv) feature selection using supervised learning to identify relevant properties to use for predictions where data are absent. This workflow provides an opportunity to reduce the cost and in-crease accuracy of hydrocarbon exploration and production in mature basins.

Original language	English
Article number	1070
Number of pages	23
Journal	Energies
Volume	15
Issue number	3
Early online date	31 Jan 2022
DOIs	https://doi.org/10.3390/en15031070
Publication status	Published - 31 Jan 2022

Bibliographical note

Funding: This research was supported by Wood Mackenzie through funding of a Postdoctoral Research Associate position at Heriot Watt University, and through access to data from two basins.
Acknowledgments: This work was supported by Wood Mackenzie through funding research collab- oration with Heriot-Watt University. All the data were anonymised and supplied by Wood Mackenzie and authors are thankful for the opportunity to publish the outcomes of this research. Authors also thank Mikhail Kanevski of University of Lausanne for the peer exchange on feature selection and the opportunities opened during his course on Machine Learning hands-on applications. Authors acknowledge the use of Orange Data Mining [27] and ML Office for SOM application [30]. We thank Susan Agar, who reviewed the paper most comprehensively and helped improve it along with two anonymous reviewers.

Data Availability Statement

The data used in this study are held by Wood Mackenzie.

Keywords

reservoir
subsurface characterisation
big data
unsupervised learning
supervised learning
multivariant analysis
machine learning
hydrocarbon exploration

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access to Document

10.3390/en15031070Licence: CC BY

Brackenridge_etal_E_Improving_Subfarce_Characterisation_VoR
This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https:// creativecommons.org/licenses/by/ 4.0/).
Final published version, 12 MBLicence: CC BY

Cite this

@article{0f8480115b114ea0b487996629afbb57,

title = "Improving Subsurface Characterisation with {\textquoteleft}Big Data{\textquoteright} Mining and Machine Learning",

abstract = "Large databases  of  legacy  hydrocarbon  reservoir  and  well data  provide an  opportunity  to use  modern data mining  techniques  to improve our understanding of  the subsur-face  in  the  presence of uncertainty and improve predictability of reservoir properties. A da-ta mining approach provides a way to screen dependencies in reservoir and fluid data and enable  subsurface specialists  to  estimate absent properties in partial or incomplete datasets. This  allows  for uncertainty  to  be managed  and reduced. An  improvement in reservoir  characterisation using  machine learning  results from the capacity of machine learning methods to detect and model hidden dependencies in large multivariate datasets with noisy and missing data.  This study presents a workflow applied to a large basin-scale reservoir characterization database. The study aims to understand the dependencies between reservoir attributes in order to allow for predictions to be made to improve the data coverage. The machine learning workflow comprises the following steps: (i) exploratory data analysis; (ii) detection of outliers and data partitioning into groups showing similar trends using clustering; (iii) identification of dependencies within reservoir data in multivariate feature space with self-organising maps; and (iv) feature selection using supervised learning to identify relevant properties to use for predictions where data are absent. This workflow provides an opportunity to reduce the cost and in-crease accuracy of hydrocarbon exploration and production in mature basins.",

keywords = "reservoir, subsurface characterisation, big data, unsupervised learning, supervised learning, multivariant analysis, machine learning, hydrocarbon exploration",

author = "Rachel Brackenridge and Vasily Demyanov and Oleg Vashutin and Ruslan Nigmatullin",

note = "Funding: This research was supported by Wood Mackenzie through funding of a Postdoctoral Research Associate position at Heriot Watt University, and through access to data from two basins. Acknowledgments: This work was supported by Wood Mackenzie through funding research collab- oration with Heriot-Watt University. All the data were anonymised and supplied by Wood Mackenzie and authors are thankful for the opportunity to publish the outcomes of this research. Authors also thank Mikhail Kanevski of University of Lausanne for the peer exchange on feature selection and the opportunities opened during his course on Machine Learning hands-on applications. Authors acknowledge the use of Orange Data Mining [27] and ML Office for SOM application [30]. We thank Susan Agar, who reviewed the paper most comprehensively and helped improve it along with two anonymous reviewers.",

year = "2022",

month = jan,

day = "31",

doi = "10.3390/en15031070",

language = "English",

volume = "15",

journal = "Energies",

issn = "1996-1073",

publisher = "MDPI",

number = "3",

}

TY - JOUR

T1 - Improving Subsurface Characterisation with ‘Big Data’ Mining and Machine Learning

AU - Brackenridge, Rachel

AU - Demyanov, Vasily

AU - Vashutin , Oleg

AU - Nigmatullin , Ruslan

N1 - Funding: This research was supported by Wood Mackenzie through funding of a Postdoctoral Research Associate position at Heriot Watt University, and through access to data from two basins. Acknowledgments: This work was supported by Wood Mackenzie through funding research collab- oration with Heriot-Watt University. All the data were anonymised and supplied by Wood Mackenzie and authors are thankful for the opportunity to publish the outcomes of this research. Authors also thank Mikhail Kanevski of University of Lausanne for the peer exchange on feature selection and the opportunities opened during his course on Machine Learning hands-on applications. Authors acknowledge the use of Orange Data Mining [27] and ML Office for SOM application [30]. We thank Susan Agar, who reviewed the paper most comprehensively and helped improve it along with two anonymous reviewers.

PY - 2022/1/31

Y1 - 2022/1/31

N2 - Large databases  of  legacy  hydrocarbon  reservoir  and  well data  provide an  opportunity  to use  modern data mining  techniques  to improve our understanding of  the subsur-face  in  the  presence of uncertainty and improve predictability of reservoir properties. A da-ta mining approach provides a way to screen dependencies in reservoir and fluid data and enable  subsurface specialists  to  estimate absent properties in partial or incomplete datasets. This  allows  for uncertainty  to  be managed  and reduced. An  improvement in reservoir  characterisation using  machine learning  results from the capacity of machine learning methods to detect and model hidden dependencies in large multivariate datasets with noisy and missing data.  This study presents a workflow applied to a large basin-scale reservoir characterization database. The study aims to understand the dependencies between reservoir attributes in order to allow for predictions to be made to improve the data coverage. The machine learning workflow comprises the following steps: (i) exploratory data analysis; (ii) detection of outliers and data partitioning into groups showing similar trends using clustering; (iii) identification of dependencies within reservoir data in multivariate feature space with self-organising maps; and (iv) feature selection using supervised learning to identify relevant properties to use for predictions where data are absent. This workflow provides an opportunity to reduce the cost and in-crease accuracy of hydrocarbon exploration and production in mature basins.

AB - Large databases  of  legacy  hydrocarbon  reservoir  and  well data  provide an  opportunity  to use  modern data mining  techniques  to improve our understanding of  the subsur-face  in  the  presence of uncertainty and improve predictability of reservoir properties. A da-ta mining approach provides a way to screen dependencies in reservoir and fluid data and enable  subsurface specialists  to  estimate absent properties in partial or incomplete datasets. This  allows  for uncertainty  to  be managed  and reduced. An  improvement in reservoir  characterisation using  machine learning  results from the capacity of machine learning methods to detect and model hidden dependencies in large multivariate datasets with noisy and missing data.  This study presents a workflow applied to a large basin-scale reservoir characterization database. The study aims to understand the dependencies between reservoir attributes in order to allow for predictions to be made to improve the data coverage. The machine learning workflow comprises the following steps: (i) exploratory data analysis; (ii) detection of outliers and data partitioning into groups showing similar trends using clustering; (iii) identification of dependencies within reservoir data in multivariate feature space with self-organising maps; and (iv) feature selection using supervised learning to identify relevant properties to use for predictions where data are absent. This workflow provides an opportunity to reduce the cost and in-crease accuracy of hydrocarbon exploration and production in mature basins.

KW - reservoir

KW - subsurface characterisation

KW - big data

KW - unsupervised learning

KW - supervised learning

KW - multivariant analysis

KW - machine learning

KW - hydrocarbon exploration

UR - http://www.scopus.com/inward/record.url?scp=85124023265&partnerID=8YFLogxK

U2 - 10.3390/en15031070

DO - 10.3390/en15031070

M3 - Article

SN - 1996-1073

VL - 15

JO - Energies

JF - Energies

IS - 3

M1 - 1070

ER -

Improving Subsurface Characterisation with ‘Big Data’ Mining and Machine Learning

Abstract

Bibliographical note

Data Availability Statement

Keywords

UN SDGs

Access to Document

Other files and links

Fingerprint

Cite this