Thursday, October 20, 2011

10/13/11

Any clustering method has an internal bias
 to find a particular shape of cluster. 
K-Means is looking for spheres (based on
its distance measure) so it Will find spheres.

Consequently, if you have a chain of data, k-means
may not be the best clustering method.  Fortunately,
it is pretty good methodology for text documents.

M.