Liking cljdoc? Tell your friends :D

tech.ml.dataset

Column major dataset abstraction for efficiently manipulating in memory datasets.

Column major dataset abstraction for efficiently manipulating
in memory datasets.
raw docstring

tech.ml.dataset.categorical

Dealing with categorical dataset data involves having two mapping systems. The first is a map of category to integer within the same column. The second is a 'one-hot' encoding where you generate more columns but those have a reduced number of possible categories, usually one categorical value per column.

Dealing with categorical dataset data involves having two mapping systems.
The first is a map of category to integer within the same column.
The second is a 'one-hot' encoding where you generate more columns but those have
a reduced number of possible categories, usually one categorical value per
column.
raw docstring

tech.ml.dataset.dynamic-int-list

An int-list implementation that resizes its backing store as it is required to hold wider data.

An int-list implementation that resizes its backing store as it is required to hold
wider data.
raw docstring

tech.ml.dataset.format-sequence

This code provided initial by genmeblog after careful consideration of R print code

This code provided initial by genmeblog after careful consideration
of R print code
raw docstring

tech.ml.dataset.options

The etl pipeline and dataset operators are built to produce a metadata options map. Their API access to the options is centralized in this file.

The etl pipeline and dataset operators are built to produce a metadata options map.
Their API access to the options is centralized in this file.
raw docstring

tech.ml.dataset.parse.mapseq

Sequences of maps are maybe the most basic pure datastructure for data. Converting them into a more structured form (and back) is a key component of dealing with datatets

Sequences of maps are maybe the most basic pure datastructure for data.
Converting them into a more structured form (and back) is a key component of
dealing with datatets
raw docstring

tech.ml.dataset.parse.spreadsheet

Spreadsheets in general are stored in a cell-based format. This means that any cell could have data of any type. Commonalities around parsing spreadsheet-type systems are captured here.

Spreadsheets in general are stored in a cell-based format.  This means that any cell
could have data of any type.  Commonalities around parsing spreadsheet-type systems
are captured here.
raw docstring

tech.ml.dataset.pca

PCA and K-PCA using smile implementations.

PCA and K-PCA using smile implementations.
raw docstring

tech.ml.dataset.tensor

Conversion mechanisms from dataset to tensor and back

Conversion mechanisms from dataset to tensor and back
raw docstring

cljdoc is a website building & hosting documentation for Clojure/Script libraries

× close