NONLINEAR PRINCIPAL COMPONENT ANALYSIS AND PRINCIPAL COMPONENT ANALYSIS WITH SUCCESSIVE INTERVAL IN K-MEANS CLUSTER ANALYSIS
Abstract
K-Means Cluster is a cluster analysis for continuous variables with the concept of distance used is a euclidean distance where that distance is used as observation variables which are uncorrelated with each other. The case with the type data that is correlated categorical can be solved either by Nonlinear Principal Component Analysis or by making categorical data into numerical data by the method called successive interval and then used Principal Component Analysis. By comparing the ratio of the variance within cluster and between cluster in poverty data of East Nusa Tenggara Province in K-Means cluster obtained that Principal Component Analysis with Successive interval has a smaller variance ratio than Nonlinear Principal Component Analysis. Variables that take effect to the clusterformation are toilet, fuel,and job.
Keywords: K-Means Cluster Analysis, Nonlinear Principal Component Analysis, Principal Component Analysis, Successive interval.