Application of Natural Neighbor-based Algorithm on Oversampling SMOTE Algorithms

dc.contributor.authorChutimet Srinilta
dc.contributor.authorSivakorn Kanharattanachai
dc.date.accessioned2026-05-08T19:15:16Z
dc.date.issued2021-4-1
dc.description.abstractClassification performance depends highly on data distribution. In real life, data often come imbalanced where one class is found more often than others. SMOTE-based algorithms are usually used to handle the class imbalance problem. One key parameter that algorithms in SMOTE family require is k-the number of nearest neighbors with respect to a certain data point. K that fits the dataset the most gives the optimum performance. This paper proposes an approach to suggest a value of the parameter k using Natural Neighbor algorithm. Datasets are made balanced by four SMOTE-based algorithms-standard SMOTE, Safe-Level-SMOTE, ModifiedSMOTE and Weighted-SMOTE. The F-measure and Recall matrices are used to evaluate classification performance of a Support Vector Machine classifier running against six datasets with different imbalance ratios. The results show that, the average classification performance achieved by the proposed k's is closer to the optimum when compared with the performance given by the default value of k.
dc.identifier.doi10.1109/iceast52143.2021.9426310
dc.identifier.urihttps://dspace.kmitl.ac.th/handle/123456789/14952
dc.subjectImbalanced Data Classification Techniques
dc.subjectMachine Learning and Data Classification
dc.subjectAnomaly Detection Techniques and Applications
dc.titleApplication of Natural Neighbor-based Algorithm on Oversampling SMOTE Algorithms
dc.typeArticle

Files

Collections