"Inital Starting Point Analysis for K-Means Clustering: A Case Study" by Amy Apon, Frank Robinson et al.

Publications

Title

Inital Starting Point Analysis for K-Means Clustering: A Case Study

Authors

Amy Apon, Clemson UniversityFollow
Frank Robinson, Vanderbilt University
Denny Brewer, Acxiom Corporation
Larry Dowdy, Vanderbilt University
Doug Hoffman, Acxiom Corporation
Baochuan Lu, University of Arkansas - Main Campus

Publication Date

3-2006

Publication Title

Proceedings of ALAR 2006 Conference on Applied Research in Information Technology

Abstract

Workload characterization is an important part of systems performance modeling. Clustering is a method used to find classes of jobs within workloads. K-Means is one of the most popular clustering algorithms. Initial starting point values are needed as input parameters when performing k-means clustering. This paper shows that the results of the running the k-means algorithm on the same workload will vary depending on the values chosen as initial starting points. Fourteen methods of composing initial starting point values are compared in a case study. The results indicate that a synthetic method, scrambled midpoints, is an effective starting point method for k-means clustering.

Recommended Citation

Apon, Amy; Robinson, Frank; Brewer, Denny; Dowdy, Larry; Hoffman, Doug; and Lu, Baochuan, "Inital Starting Point Analysis for K-Means Clustering: A Case Study" (2006). Publications. 22.
https://tigerprints.clemson.edu/computing_pubs/22

Download

Included in

Computer Sciences Commons

COinS

Publications

Title

Authors

Publication Date

Publication Title

Abstract

Recommended Citation

Included in

Search

Browse by

Useful Links

Publications

Title

Authors

Publication Date

Publication Title

Abstract

Recommended Citation

Included in

Share

Search

Browse by

Useful Links