Iterative Fuzzy C Means, Fuzzy Silhouette, and Imputation for Missing Values in a Dataset

A missing value is an error that always happened, and it is unavoidable. This error should be handled correctly before data is processed into the processing model. This paper proposes a method of imputation by employing iterative Fuzzy C Means (FCM), centroid values and, fuzzy silhouette to handle m...

Full description

Main Authors: Mausor, F.H., Jaafar, J., Taib, S.M., Razali, R.
Format: Conference or Workshop Item
Institution: Universiti Teknologi Petronas
Record Id / ISBN-0: utp-eprints.29131 /
Published: Institute of Electrical and Electronics Engineers Inc. 2021
Online Access: https://www.scopus.com/inward/record.uri?eid=2-s2.0-85125305962&doi=10.1109%2fICOCO53166.2021.9673541&partnerID=40&md5=84bd38086f154b2dbc30c8932c25a78c
http://eprints.utp.edu.my/29131/
Tags: Add Tag
No Tags, Be the first to tag this record!
Summary: A missing value is an error that always happened, and it is unavoidable. This error should be handled correctly before data is processed into the processing model. This paper proposes a method of imputation by employing iterative Fuzzy C Means (FCM), centroid values and, fuzzy silhouette to handle missing values problem. Missing values can be treated by imputing the missing values. The advantage of FCM is it can provide a better separation of instances when an object is not well separated. It is a well-known clustering method that can provide better clustering result. The optimal clustering value can be measure by using fuzzy silhouette. In this paper, the relationship between imputation based on FCM, fuzzy silhouette and the optimal cluster is identified. Also, the factors that can give impact to accuracy of imputation is recognized, © 2021 IEEE.