K-Means clustering generates a specific number of disjoint, non-hierarchical clusters. It is well suited to generating globular clusters. The K-Means method is numerical, unsupervised, non-deterministic and iterative. Every member of a cluster is closer to its cluster center than the center of any other cluster.
The choice of initial partition can greatly affect the final clusters that result, in terms of inter-cluster and intracluster distances and cohesion. As a result k means is best run multiple times in order to avoid the trap of a local minimum.
K-Means clustering generates a specific number of disjoint, non-hierarchical clusters. It is well suited to generating globular clusters. The K-Means method is numerical, unsupervised, non-deterministic and iterative. Every member of a cluster is closer to its cluster center than the center of any other cluster. The choice of initial partition can greatly affect the final clusters that result, in terms of inter-cluster and intracluster distances and cohesion. As a result k means is best run multiple times in order to avoid the trap of a local minimum.
(assignments centroids distance-fn points)
Returns the assignments of the points to the centroids.
Returns the assignments of the points to the centroids.
(classify centroids distance-fn point)
Returns the index of the centroid that is closest to the point.
Returns the index of the centroid that is closest to the point.
(cost centroids distance-fn assignment point)
Returns the distance of an assigned point from its centroid.
Returns the distance of an assigned point from its centroid.
(costs centroids distance-fn assignments points)
Returns the distances of the points from their centroids.
Returns the distances of the points from their centroids.
(dataset-assignments centroids distance-fn cols points)
Updates the assignments dataset with the new assignments.
Updates the assignments dataset with the new assignments.
(dataset-assignments-seq centroids distance-fn cols points-seq)
Updates a sequence of assignment datasets with the new assignments.
Updates a sequence of assignment datasets with the new assignments.
(dataset-costs centroids distance-fn cols points)
Returns the distances of the points from their centroids.
Returns the distances of the points from their centroids.
(dataset-costs-seq centroids distance-fn cols points-seq)
Returns the distances of the points from their centroids.
Returns the distances of the points from their centroids.
(distances centroids distance-fn point)
Returns a vector of distance of the centroids from the point.
Returns a vector of distance of the centroids from the point.
(estimate-size config)
Update the number of records in the dataset.
Update the number of records in the dataset.
(initialize-centroids! s)
Calls initialize-centroids and writes the returned dataset to the centroids file.
Calls initialize-centroids and writes the returned dataset to the centroids file.
(initialize-k-means-state points-file k options)
Sets initial configuration options for the k means calculation.
Sets initial configuration options for the k means calculation.
(k-means-seq dataset k & options)
Returns a lazy sequence of m ClusterResult.
Returns a lazy sequence of m ClusterResult.
(max-index coll)
Returns the index of the minimum value in a collection.
Returns the index of the minimum value in a collection.
(min-index coll)
Returns the index of the minimum value in a collection.
Returns the index of the minimum value in a collection.
(regenerate-assignments! s)
Writes the new assignments to the assignments file.
Writes the new assignments to the assignments file.
(stabilized? centroids-1 centroids-2)
K-means is said to be stabilized when performing an iterative refinement (often called a lloyd iteration), does not result in any shifting of points between clusters. A stabilized k-means calculation can be stopped, because further refinement won't produce any changes.
K-means is said to be stabilized when performing an iterative refinement (often called a lloyd iteration), does not result in any shifting of points between clusters. A stabilized k-means calculation can be stopped, because further refinement won't produce any changes.
(sum coll)
Returns the sum of the numbers in the sequence.
Returns the sum of the numbers in the sequence.
cljdoc is a website building & hosting documentation for Clojure/Script libraries
× close