Machine-learning algorithm for estimating oil-recovery factor using a combination of engineering and stratigraphic dependent parameters

Kachalla Aliyuda; John Howell

doi:10.1190/INT-2018-0211.1

Machine-learning algorithm for estimating oil-recovery factor using a combination of engineering and stratigraphic dependent parameters

Kachalla Aliyuda^* (Corresponding Author), John Howell

^*Corresponding author for this work

Geology and Geophysics

Research output: Contribution to journal › Article › peer-review

17 Citations (Scopus)

Abstract

The methods used to estimate recovery factor change through the life cycle of a field. During appraisal, prior to development when there are no production data, we typically rely on analog fields and empirical methods. Given the absence of a perfect analog, these methods are typically associated with a wide range of uncertainty. During plateau, recovery factors are typically associated with simulation and dynamic modeling, whereas in later field life, once the field drops off the plateau, a decline curve analysis is also used. The use of different methods during different stages of the field life leads to uncertainty and potential inconsistencies in recovery estimates. A wide range of interacting, partially related, reservoir and production variables controls the production and recovery factor. Machine learning allows more complex multivariate analysis that can be used to investigate the roles of these variables using a training data set and then to ultimately predict future performance in fields. To investigate this approach, we used a data set consisting of producing reservoirs all of which are at plateau or in decline to train a series of machine-learning algorithms that can potentially predict the recovery factor with minimal percentage error. The database for this study consists of categorical and numerical properties for 93 reservoirs from the Norwegian Continental Shelf. Of these, 75 are from the Norwegian Sea, the Norwegian North Sea, and the Barents Sea, whereas the remaining 18 reservoirs are from the Viking Graben in the UK sector of the North Sea. The data set was divided into training and testing sets: The training set comprised approximately 80% of the total data, and the remaining 20% was the testing set. Linear regression models and a support vector machine (SVM) models were trained with all parameters in the data set (30 parameters); then with the 16 most influential parameters in the data set, the performance of these models was compared from results of fivefold crossvalidation. SVM training using a combination of 16 geologic/engineering parameters models with Gaussian kernel function has a root-mean-square error of 0.12, mean square error of 0.01, and R-squared of 0.76. This model was tested on 18 reservoirs from the testing set; the test results are very similar to crossvalidation results during models training phase, suggesting that this method can potentially be used to predict the future recovery factor.

Original language	English
Pages (from-to)	SE151-SE159
Number of pages	10
Journal	Interpretation
Volume	7
Issue number	3
Early online date	16 Jul 2019
DOIs	https://doi.org/10.1190/INT-2018-0211.1
Publication status	Published - 1 Aug 2019

Bibliographical note

Funding Information:
This work has been supported by SAFARI III and the Petroleum Technology Development Fund, we are grateful to them for providing funds for this project. We greatly appreciate all of the anonymous reviewers, whose comments have greatly improved the manuscript.

Data Availability Statement

No data availiability statement.

Keywords

algorithm
artificial intelligence
stratigraphy

Access to Document

10.1190/INT-2018-0211.1Licence: Unspecified

Cite this

@article{6dbe825f4e434a5889241115c5eefd0d,

title = "Machine-learning algorithm for estimating oil-recovery factor using a combination of engineering and stratigraphic dependent parameters",

abstract = "The methods used to estimate recovery factor change through the life cycle of a field. During appraisal, prior to development when there are no production data, we typically rely on analog fields and empirical methods. Given the absence of a perfect analog, these methods are typically associated with a wide range of uncertainty. During plateau, recovery factors are typically associated with simulation and dynamic modeling, whereas in later field life, once the field drops off the plateau, a decline curve analysis is also used. The use of different methods during different stages of the field life leads to uncertainty and potential inconsistencies in recovery estimates. A wide range of interacting, partially related, reservoir and production variables controls the production and recovery factor. Machine learning allows more complex multivariate analysis that can be used to investigate the roles of these variables using a training data set and then to ultimately predict future performance in fields. To investigate this approach, we used a data set consisting of producing reservoirs all of which are at plateau or in decline to train a series of machine-learning algorithms that can potentially predict the recovery factor with minimal percentage error. The database for this study consists of categorical and numerical properties for 93 reservoirs from the Norwegian Continental Shelf. Of these, 75 are from the Norwegian Sea, the Norwegian North Sea, and the Barents Sea, whereas the remaining 18 reservoirs are from the Viking Graben in the UK sector of the North Sea. The data set was divided into training and testing sets: The training set comprised approximately 80% of the total data, and the remaining 20% was the testing set. Linear regression models and a support vector machine (SVM) models were trained with all parameters in the data set (30 parameters); then with the 16 most influential parameters in the data set, the performance of these models was compared from results of fivefold crossvalidation. SVM training using a combination of 16 geologic/engineering parameters models with Gaussian kernel function has a root-mean-square error of 0.12, mean square error of 0.01, and R-squared of 0.76. This model was tested on 18 reservoirs from the testing set; the test results are very similar to crossvalidation results during models training phase, suggesting that this method can potentially be used to predict the future recovery factor. ",

keywords = "algorithm, artificial intelligence, stratigraphy",

author = "Kachalla Aliyuda and John Howell",

note = "Funding Information: This work has been supported by SAFARI III and the Petroleum Technology Development Fund, we are grateful to them for providing funds for this project. We greatly appreciate all of the anonymous reviewers, whose comments have greatly improved the manuscript. ",

year = "2019",

month = aug,

day = "1",

doi = "10.1190/INT-2018-0211.1",

language = "English",

volume = "7",

pages = "SE151--SE159",

journal = "Interpretation",

issn = "2324-8858",

publisher = "Society of Exploration Geophysicists",

number = "3",

}

TY - JOUR

T1 - Machine-learning algorithm for estimating oil-recovery factor using a combination of engineering and stratigraphic dependent parameters

AU - Aliyuda, Kachalla

AU - Howell, John

N1 - Funding Information: This work has been supported by SAFARI III and the Petroleum Technology Development Fund, we are grateful to them for providing funds for this project. We greatly appreciate all of the anonymous reviewers, whose comments have greatly improved the manuscript.

PY - 2019/8/1

Y1 - 2019/8/1

N2 - The methods used to estimate recovery factor change through the life cycle of a field. During appraisal, prior to development when there are no production data, we typically rely on analog fields and empirical methods. Given the absence of a perfect analog, these methods are typically associated with a wide range of uncertainty. During plateau, recovery factors are typically associated with simulation and dynamic modeling, whereas in later field life, once the field drops off the plateau, a decline curve analysis is also used. The use of different methods during different stages of the field life leads to uncertainty and potential inconsistencies in recovery estimates. A wide range of interacting, partially related, reservoir and production variables controls the production and recovery factor. Machine learning allows more complex multivariate analysis that can be used to investigate the roles of these variables using a training data set and then to ultimately predict future performance in fields. To investigate this approach, we used a data set consisting of producing reservoirs all of which are at plateau or in decline to train a series of machine-learning algorithms that can potentially predict the recovery factor with minimal percentage error. The database for this study consists of categorical and numerical properties for 93 reservoirs from the Norwegian Continental Shelf. Of these, 75 are from the Norwegian Sea, the Norwegian North Sea, and the Barents Sea, whereas the remaining 18 reservoirs are from the Viking Graben in the UK sector of the North Sea. The data set was divided into training and testing sets: The training set comprised approximately 80% of the total data, and the remaining 20% was the testing set. Linear regression models and a support vector machine (SVM) models were trained with all parameters in the data set (30 parameters); then with the 16 most influential parameters in the data set, the performance of these models was compared from results of fivefold crossvalidation. SVM training using a combination of 16 geologic/engineering parameters models with Gaussian kernel function has a root-mean-square error of 0.12, mean square error of 0.01, and R-squared of 0.76. This model was tested on 18 reservoirs from the testing set; the test results are very similar to crossvalidation results during models training phase, suggesting that this method can potentially be used to predict the future recovery factor.

AB - The methods used to estimate recovery factor change through the life cycle of a field. During appraisal, prior to development when there are no production data, we typically rely on analog fields and empirical methods. Given the absence of a perfect analog, these methods are typically associated with a wide range of uncertainty. During plateau, recovery factors are typically associated with simulation and dynamic modeling, whereas in later field life, once the field drops off the plateau, a decline curve analysis is also used. The use of different methods during different stages of the field life leads to uncertainty and potential inconsistencies in recovery estimates. A wide range of interacting, partially related, reservoir and production variables controls the production and recovery factor. Machine learning allows more complex multivariate analysis that can be used to investigate the roles of these variables using a training data set and then to ultimately predict future performance in fields. To investigate this approach, we used a data set consisting of producing reservoirs all of which are at plateau or in decline to train a series of machine-learning algorithms that can potentially predict the recovery factor with minimal percentage error. The database for this study consists of categorical and numerical properties for 93 reservoirs from the Norwegian Continental Shelf. Of these, 75 are from the Norwegian Sea, the Norwegian North Sea, and the Barents Sea, whereas the remaining 18 reservoirs are from the Viking Graben in the UK sector of the North Sea. The data set was divided into training and testing sets: The training set comprised approximately 80% of the total data, and the remaining 20% was the testing set. Linear regression models and a support vector machine (SVM) models were trained with all parameters in the data set (30 parameters); then with the 16 most influential parameters in the data set, the performance of these models was compared from results of fivefold crossvalidation. SVM training using a combination of 16 geologic/engineering parameters models with Gaussian kernel function has a root-mean-square error of 0.12, mean square error of 0.01, and R-squared of 0.76. This model was tested on 18 reservoirs from the testing set; the test results are very similar to crossvalidation results during models training phase, suggesting that this method can potentially be used to predict the future recovery factor.

KW - algorithm

KW - artificial intelligence

KW - stratigraphy

UR - http://www.scopus.com/inward/record.url?scp=85071136228&partnerID=8YFLogxK

U2 - 10.1190/INT-2018-0211.1

DO - 10.1190/INT-2018-0211.1

M3 - Article

AN - SCOPUS:85071136228

SN - 2324-8858

VL - 7

SP - SE151-SE159

JO - Interpretation

JF - Interpretation

IS - 3

ER -

Machine-learning algorithm for estimating oil-recovery factor using a combination of engineering and stratigraphic dependent parameters

Abstract

Bibliographical note

Data Availability Statement

Keywords

Access to Document

Other files and links

Fingerprint

Cite this