Metocean Prediction using Hadoop, Spark & R

This project is the development of an analysis system for historical Metocean Data. it is also partial recreation of a previous system overcoming some of its shortcomings. The new system will be a single page reactive web application with shiny web UI containing forecasting model, ARIMA developed wi...

Full description

Main Author: Sumayema, Kabir Ricky
Format: Final Year Project
Language: English
Institution: Universiti Teknologi Petronas
Record Id / ISBN-0: utp-utpedia.20962 /
Published: IRC 2019
Subjects:
Online Access: http://utpedia.utp.edu.my/20962/1/SUMAYEMA%20KABIR%20RICKY_24802.pdf
http://utpedia.utp.edu.my/20962/
Tags: Add Tag
No Tags, Be the first to tag this record!
Summary: This project is the development of an analysis system for historical Metocean Data. it is also partial recreation of a previous system overcoming some of its shortcomings. The new system will be a single page reactive web application with shiny web UI containing forecasting model, ARIMA developed with R for the variables of Metocean data stored in Hadoop and spark is integrated to make the computations happen in-memory. The objective is to solve the problem of low speed of matlab and inefficiency of the previous RDBMS. Here, R is replacing functionality of matlab as the backend and Hadoop is replacing the RDBMS as the storage function but distributed file system.. The prediction from arima will be compared to an ML algorithm, Linear Regression, H2O AutoML and the actual data to see its correctness.