Find in Library
Search millions of books, articles, and more
Indexed Open Access Databases
An Enhanced Spectral Clustering Algorithm with S-Distance
oleh: Krishna Kumar Sharma, Ayan Seal, Enrique Herrera-Viedma, Ondrej Krejcar
Format: | Article |
---|---|
Diterbitkan: | MDPI AG 2021-04-01 |
Deskripsi
Calculating and monitoring customer churn metrics is important for companies to retain customers and earn more profit in business. In this study, a churn prediction framework is developed by modified spectral clustering (SC). However, the similarity measure plays an imperative role in clustering for predicting churn with better accuracy by analyzing industrial data. The linear Euclidean distance in the traditional SC is replaced by the non-linear S-distance (<inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>S</mi><mi>d</mi></mrow></semantics></math></inline-formula>). The <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>S</mi><mi>d</mi></mrow></semantics></math></inline-formula> is deduced from the concept of S-divergence (<i>SD</i>). Several characteristics of <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>S</mi><mi>d</mi></mrow></semantics></math></inline-formula> are discussed in this work. Assays are conducted to endorse the proposed clustering algorithm on four synthetics, eight UCI, two industrial databases and one telecommunications database related to customer churn. Three existing clustering algorithms—<i>k</i>-means, density-based spatial clustering of applications with noise and conventional SC—are also implemented on the above-mentioned 15 databases. The empirical outcomes show that the proposed clustering algorithm beats three existing clustering algorithms in terms of its Jaccard index, f-score, recall, precision and accuracy. Finally, we also test the significance of the clustering results by the Wilcoxon’s signed-rank test, Wilcoxon’s rank-sum test, and sign tests. The relative study shows that the outcomes of the proposed algorithm are interesting, especially in the case of clusters of arbitrary shape.