Liking cljdoc? Tell your friends :D

Change Log

All notable changes to this project will be documented in this file. This change log follows the conventions of keepachangelog.com.

[0.1.5] - 2022-06-08

Default calling convention now runs multiple instances of k means clustering.
k-means can now be called with an option to specify which distance function to use.
k-means now supports many different distance functions.

Now supporting initialization via k-means++.
Now supporting initialization via k-means||.
Now supporting initialization via k-means||.
Now supporting initialization via naive uniform sampling.
k-means can now be called with options to determine the initialization method.
Now supporting parquet format.
Now supporting arrow format.
Now supporting arrows format.
k-means can now be called with options to determine preferred file format.
Added logs to help make progress of computations more obvious.
Added the initialize-centroids multimethod; choice of initialization method can now be controlled by callers via multimethods.
Now support calling k-means with lazyseqs which are ->dataset compatible.

Fixed an issue with processing large datasets by moving from arrow files to arrows IPC format.
Rebranded from clj-kmeans to josh.meanings, because I like the name meanings but figure I should namespace it.

Dramatically improved performance on large datasets accomplished by switching from buffered csv reading to optimized tech.ml.dataset usage.

Initial project created with support for k-means clustering on larger than memory datasets.
Only configuration provided is choice of k. Distance function used is earth mover distance.

Can you improve this documentation?Edit on GitHub

cljdoc builds & hosts documentation for Clojure/Script libraries

Keyboard shortcuts