Loading...
Thumbnail Image
Publication

Methods in data clustering

Parker, Charley
Coffey, Mark
Citations
Altmetric:
Advisor
Editor
Date
2011-08
Date Issued
Date Submitted
Keywords
Research Projects
Organizational Units
Journal Issue
Embargo Expires
Abstract
Data clustering methods explored included: K-means algorithm, which minimizes the distances from all data points to the centroid of each point's associated cluster; power iteration, which approximates eigenvectors of a similarity matrix used to embed the data into a space where K-means can be useful; and wordplay for clustering a set of documents and using a word count to construct a similarity matrix.
Associated Publications
Rights
Copyright of the original work is retained by the author.
Embedded videos