Liking cljdoc? Tell your friends :D

tech.ml.dataset.categorical

Dealing with categorical dataset data involves having two mapping systems. The first is a map of category to integer within the same column. The second is a 'one-hot' encoding where you generate more columns but those have a reduced number of possible categories, usually one categorical value per column.

Dealing with categorical dataset data involves having two mapping systems.
The first is a map of category to integer within the same column.
The second is a 'one-hot' encoding where you generate more columns but those have
a reduced number of possible categories, usually one categorical value per
column.
raw docstring

build-categorical-mapclj

(build-categorical-map dataset column-name-seq & [table-value-list])

Given a dataset and these columns, produce a label-map of column-name to specific categorical label-map.

Given a dataset and these columns, produce a label-map of
column-name to specific categorical label-map.
sourceraw docstring

build-one-hot-mapclj

(build-one-hot-map dataset column-name-seq & [one-hot-table-args])
source

column-categorical-mapclj

(column-categorical-map categorical-map new-dtype old-column)

Given a categorical map for a given column, produce a new column of the desired datatype with the values mapped to the table values.

Given a categorical map for a given column, produce a new column
of the desired datatype with the values mapped to the table values.
sourceraw docstring

column-one-hot-mapclj

(column-one-hot-map one-hot-map new-dtype dataset column-name)

Using one hot map, produce Y new columns while removing existing column.

Using one hot map, produce Y new columns while removing existing column.
sourceraw docstring

column-values->categoricalclj

(column-values->categorical dataset src-column categorical-map)

Given a column encoded via either string->number or one-hot, reverse map to the a sequence of the original string column values.

Given a column encoded via either string->number or one-hot, reverse
map to the a sequence of the original string column values.
sourceraw docstring

inverse-map-categorical-col-fnclj

(inverse-map-categorical-col-fn src-column column-categorical-map)
source

inverse-map-categorical-columnsclj

(inverse-map-categorical-columns dataset src-column column-categorical-map)
source

is-one-hot-label-map?clj

(is-one-hot-label-map? label-map)
source

make-string-table-from-table-argsclj

(make-string-table-from-table-args table-value-list)

Make a mapping of value->index from a list of either string values or [valname idx] pairs. Returns map of value->index.

Make a mapping of value->index from a list of either string values or [valname idx]
pairs.
Returns map of value->index.
sourceraw docstring

cljdoc is a website building & hosting documentation for Clojure/Script libraries

× close