Semantic-k-NN algorithm: An enhanced version of traditional k-NN algorithm

The k-NN algorithm is one of the most renowned ML algorithms widely used in the area of data classification research. With the emergence of big data, the performance and the efficiency of the traditional k-NN algorithm is fast becoming a critical issue. The traditional k-NN algorithm is inefficient...

Full description

Main Authors: Ali, M., Jung, L.T., Abdel-Aty, A.-H., Abubakar, M.Y., Elhoseny, M., Ali, I.
Format: Article
Institution: Universiti Teknologi Petronas
Record Id / ISBN-0: utp-eprints.23434 /
Published: Elsevier Ltd 2020
Online Access: https://www.scopus.com/inward/record.uri?eid=2-s2.0-85082131409&doi=10.1016%2fj.eswa.2020.113374&partnerID=40&md5=5b7d85aeca053fbfddcfacd44a1a09de
http://eprints.utp.edu.my/23434/
Tags: Add Tag
No Tags, Be the first to tag this record!
id utp-eprints.23434
recordtype eprints
spelling utp-eprints.234342021-08-19T07:20:17Z Semantic-k-NN algorithm: An enhanced version of traditional k-NN algorithm Ali, M. Jung, L.T. Abdel-Aty, A.-H. Abubakar, M.Y. Elhoseny, M. Ali, I. The k-NN algorithm is one of the most renowned ML algorithms widely used in the area of data classification research. With the emergence of big data, the performance and the efficiency of the traditional k-NN algorithm is fast becoming a critical issue. The traditional k-NN algorithm is inefficient to solve the high volume multi-categorical training datasets Traditional k-NN algorithm has a constraint in filtering the training dataset to yield training data that are most relevant to the intended or the targeted test dataset/file. It has to scan through all the training datasets categories to classify the intended/targeted data. As such, traditional k-NN is considered not intelligent and consequently is suffering poor accuracy performance with high computational complexity. A Semantic-kNN (Sk-NN) algorithm for ML is thus proposed in this paper to address the limitations in the traditional k-NN. The proposed Sk-NN deploys a process by leveraging on the semantic itemization and bigram model to filter the training dataset in accordance with the relevant information engaged in the test dataset. It is aimed for general security applications such as finding (the confidentiality level of the data when the algorithm is trained with multiple training categories during the data classification phase. Ultimately, Sk-NN is to elevate the ML performance in pattern extraction and labeling in the big data context. © 2020 Elsevier Ltd 2020 Article NonPeerReviewed https://www.scopus.com/inward/record.uri?eid=2-s2.0-85082131409&doi=10.1016%2fj.eswa.2020.113374&partnerID=40&md5=5b7d85aeca053fbfddcfacd44a1a09de Ali, M. and Jung, L.T. and Abdel-Aty, A.-H. and Abubakar, M.Y. and Elhoseny, M. and Ali, I. (2020) Semantic-k-NN algorithm: An enhanced version of traditional k-NN algorithm. Expert Systems with Applications, 151 . http://eprints.utp.edu.my/23434/
institution Universiti Teknologi Petronas
collection UTP Institutional Repository
description The k-NN algorithm is one of the most renowned ML algorithms widely used in the area of data classification research. With the emergence of big data, the performance and the efficiency of the traditional k-NN algorithm is fast becoming a critical issue. The traditional k-NN algorithm is inefficient to solve the high volume multi-categorical training datasets Traditional k-NN algorithm has a constraint in filtering the training dataset to yield training data that are most relevant to the intended or the targeted test dataset/file. It has to scan through all the training datasets categories to classify the intended/targeted data. As such, traditional k-NN is considered not intelligent and consequently is suffering poor accuracy performance with high computational complexity. A Semantic-kNN (Sk-NN) algorithm for ML is thus proposed in this paper to address the limitations in the traditional k-NN. The proposed Sk-NN deploys a process by leveraging on the semantic itemization and bigram model to filter the training dataset in accordance with the relevant information engaged in the test dataset. It is aimed for general security applications such as finding (the confidentiality level of the data when the algorithm is trained with multiple training categories during the data classification phase. Ultimately, Sk-NN is to elevate the ML performance in pattern extraction and labeling in the big data context. © 2020
format Article
author Ali, M.
Jung, L.T.
Abdel-Aty, A.-H.
Abubakar, M.Y.
Elhoseny, M.
Ali, I.
spellingShingle Ali, M.
Jung, L.T.
Abdel-Aty, A.-H.
Abubakar, M.Y.
Elhoseny, M.
Ali, I.
Semantic-k-NN algorithm: An enhanced version of traditional k-NN algorithm
author_sort Ali, M.
title Semantic-k-NN algorithm: An enhanced version of traditional k-NN algorithm
title_short Semantic-k-NN algorithm: An enhanced version of traditional k-NN algorithm
title_full Semantic-k-NN algorithm: An enhanced version of traditional k-NN algorithm
title_fullStr Semantic-k-NN algorithm: An enhanced version of traditional k-NN algorithm
title_full_unstemmed Semantic-k-NN algorithm: An enhanced version of traditional k-NN algorithm
title_sort semantic-k-nn algorithm: an enhanced version of traditional k-nn algorithm
publisher Elsevier Ltd
publishDate 2020
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85082131409&doi=10.1016%2fj.eswa.2020.113374&partnerID=40&md5=5b7d85aeca053fbfddcfacd44a1a09de
http://eprints.utp.edu.my/23434/
_version_ 1741196675022585856
score 11.62408