Liking cljdoc? Tell your friends :D

consimilo.minhash


build-minhashclj

(build-minhash feature-coll)
(build-minhash [feature & features] hashvalues)

iterates through a document feature collection: ['token-1' token-2' ... 'token-n], updating the minhash with each feature. Complete minhash is returned.

iterates through a document feature collection: ['token-1' token-2' ... 'token-n],
updating the minhash with each feature. Complete minhash is returned.
sourceraw docstring

large-primeclj

source

merge-minhashclj

(merge-minhash minhash1 minhash2)

merges two minhashes together by taking the elementwise minimum between the two minhash vectors

merges two minhashes together by taking the elementwise minimum between the two
minhash vectors
sourceraw docstring

permutationsclj

source

update-minhashclj

(update-minhash hashvalues feature)

updates minhash with each document feature (token, shingle, n-gram, etc...) Tokens are hashed using sha1 hash and truncated at max-hash to allow hashing of documents with varying feature sizes. One minhash should be created for each document

updates minhash with each document feature (token, shingle, n-gram, etc...)
Tokens are hashed using sha1 hash and truncated at max-hash to allow hashing
of documents with varying feature sizes. One minhash should be created for
each document
sourceraw docstring

cljdoc is a website building & hosting documentation for Clojure/Script libraries

× close