DEVELOPMENT OF TREE-BASED ENSEMBLE LEARNING ALGORITHMS FOR ESTIMATING TOTAL ORGANIC CARBON FROM WIRELINE LOGS

To evaluate the hydrocarbon generation potential of a rock strata, total organic carbon (TOC) is a significant factor. TOC estimation is considered as a challenge from well logs and direct measurement in laboratory from rock specimens is costly and time-consuming. Therefore, due to the complex and n...

Full description

Main Author: A RAHAMAN, MD SHOKOR
Format: Thesis
Language: English
Institution: Universiti Teknologi Petronas
Record Id / ISBN-0: utp-utpedia.22657 /
Published: 2021
Subjects:
Online Access: http://utpedia.utp.edu.my/22657/1/Md%20Shokor_17009796.pdf
http://utpedia.utp.edu.my/22657/
Tags: Add Tag
No Tags, Be the first to tag this record!
id utp-utpedia.22657
recordtype eprints
spelling utp-utpedia.226572022-02-22T07:12:20Z http://utpedia.utp.edu.my/22657/ DEVELOPMENT OF TREE-BASED ENSEMBLE LEARNING ALGORITHMS FOR ESTIMATING TOTAL ORGANIC CARBON FROM WIRELINE LOGS A RAHAMAN, MD SHOKOR Q Science (General) To evaluate the hydrocarbon generation potential of a rock strata, total organic carbon (TOC) is a significant factor. TOC estimation is considered as a challenge from well logs and direct measurement in laboratory from rock specimens is costly and time-consuming. Therefore, due to the complex and nonlinear relationship between well logs and TOC, researchers have begun to use artificial intelligence (AI) techniques. Prediction from Passey method is low and AI techniques such as Artificial Neural Network (ANN), Support Vector Machine (SVM) gets trapped in local optima resulting in overfitting and heavy computational work and even error if the technique isn’t reasonable. In this thesis work, for the TOC prediction we proposed seven AI algorithm - Artificial Neural Netwrok (ANN), four efficient tree-based ensemble techniques that includes Random Forest (RF), Extra Trees (extremely randomized trees) (ET), Gradient Boosting (GB) and Extremely Gradient Boosting (XGB) and two hybrid AI models that includes Genetic algorithm- Artificial Neural Network (GA-ANN) and Particle Swarm Optimization- Artificial Neural Network (PSO-ANN) have been used. Among seven algorithms studied in this work, the four tree-based ensemble models are capable of fitting highly non-linear data and requires minimum data pre-processing. Specifically, 205 data points and seven well logs namely GR, DT, RHOB, SP, NPHI, LLD, and LLS were used from the Goldwyer Formation of the Canning Basin for training and testing the seven AI models to evaluate their efficiency and provide comparable results during the TOC estimation. From results it is validated that the accuracy of the tree-based ensemble techniques is at exemplary level for the TOC content estimation where the XGB model for training and testing data sets outperformed all the other AI models especially all other tree-based ensemble techniques i.e., RF, ET and GB. These robust tree-based ensemble models not only protect overfitting but has achieved better prediction results while dealing with the multidimensional data. Finally, some possible combinations are proposed that have not yet been investigated. 2021-11 Thesis NonPeerReviewed application/pdf en http://utpedia.utp.edu.my/22657/1/Md%20Shokor_17009796.pdf A RAHAMAN, MD SHOKOR (2021) DEVELOPMENT OF TREE-BASED ENSEMBLE LEARNING ALGORITHMS FOR ESTIMATING TOTAL ORGANIC CARBON FROM WIRELINE LOGS. Masters thesis, Universiti Teknologi PETRONAS.
institution Universiti Teknologi Petronas
collection UTPedia
language English
topic Q Science (General)
spellingShingle Q Science (General)
A RAHAMAN, MD SHOKOR
DEVELOPMENT OF TREE-BASED ENSEMBLE LEARNING ALGORITHMS FOR ESTIMATING TOTAL ORGANIC CARBON FROM WIRELINE LOGS
description To evaluate the hydrocarbon generation potential of a rock strata, total organic carbon (TOC) is a significant factor. TOC estimation is considered as a challenge from well logs and direct measurement in laboratory from rock specimens is costly and time-consuming. Therefore, due to the complex and nonlinear relationship between well logs and TOC, researchers have begun to use artificial intelligence (AI) techniques. Prediction from Passey method is low and AI techniques such as Artificial Neural Network (ANN), Support Vector Machine (SVM) gets trapped in local optima resulting in overfitting and heavy computational work and even error if the technique isn’t reasonable. In this thesis work, for the TOC prediction we proposed seven AI algorithm - Artificial Neural Netwrok (ANN), four efficient tree-based ensemble techniques that includes Random Forest (RF), Extra Trees (extremely randomized trees) (ET), Gradient Boosting (GB) and Extremely Gradient Boosting (XGB) and two hybrid AI models that includes Genetic algorithm- Artificial Neural Network (GA-ANN) and Particle Swarm Optimization- Artificial Neural Network (PSO-ANN) have been used. Among seven algorithms studied in this work, the four tree-based ensemble models are capable of fitting highly non-linear data and requires minimum data pre-processing. Specifically, 205 data points and seven well logs namely GR, DT, RHOB, SP, NPHI, LLD, and LLS were used from the Goldwyer Formation of the Canning Basin for training and testing the seven AI models to evaluate their efficiency and provide comparable results during the TOC estimation. From results it is validated that the accuracy of the tree-based ensemble techniques is at exemplary level for the TOC content estimation where the XGB model for training and testing data sets outperformed all the other AI models especially all other tree-based ensemble techniques i.e., RF, ET and GB. These robust tree-based ensemble models not only protect overfitting but has achieved better prediction results while dealing with the multidimensional data. Finally, some possible combinations are proposed that have not yet been investigated.
format Thesis
author A RAHAMAN, MD SHOKOR
author_sort A RAHAMAN, MD SHOKOR
title DEVELOPMENT OF TREE-BASED ENSEMBLE LEARNING ALGORITHMS FOR ESTIMATING TOTAL ORGANIC CARBON FROM WIRELINE LOGS
title_short DEVELOPMENT OF TREE-BASED ENSEMBLE LEARNING ALGORITHMS FOR ESTIMATING TOTAL ORGANIC CARBON FROM WIRELINE LOGS
title_full DEVELOPMENT OF TREE-BASED ENSEMBLE LEARNING ALGORITHMS FOR ESTIMATING TOTAL ORGANIC CARBON FROM WIRELINE LOGS
title_fullStr DEVELOPMENT OF TREE-BASED ENSEMBLE LEARNING ALGORITHMS FOR ESTIMATING TOTAL ORGANIC CARBON FROM WIRELINE LOGS
title_full_unstemmed DEVELOPMENT OF TREE-BASED ENSEMBLE LEARNING ALGORITHMS FOR ESTIMATING TOTAL ORGANIC CARBON FROM WIRELINE LOGS
title_sort development of tree-based ensemble learning algorithms for estimating total organic carbon from wireline logs
publishDate 2021
url http://utpedia.utp.edu.my/22657/1/Md%20Shokor_17009796.pdf
http://utpedia.utp.edu.my/22657/
_version_ 1741195846833143808
score 11.62408