Liking cljdoc? Tell your friends :D

tech.ml.dataset.column


cloneclj

(clone col)

Clone this column not changing anything.

Clone this column not changing anything.
sourceraw docstring

column-nameclj

(column-name col)
source

correlationclj

(correlation lhs rhs correlation-type)

Correlation coefficient for given 2 columns. Available correlation types are: :pearson :spearman :kendall

Returns floating point number between [-1 1]

Correlation coefficient for given 2 columns.  Available correlation types
are:
:pearson
:spearman
:kendall

Returns floating point number between [-1 1]
sourceraw docstring

extend-column-with-emptyclj

(extend-column-with-empty column n-empty)
source

is-column?clj

(is-column? item)

Return true if this item is a column.

Return true if this item is a column.
sourceraw docstring

is-missing?clj

(is-missing? col idx)

Return true if this index is missing.

Return true if this index is missing.
sourceraw docstring

merge-metadataclj

(merge-metadata col data-map)

Merge metadata in column with this map. Beware this could change the name of the column.

Merge metadata in column with this map.
Beware this could change the name of the column.
sourceraw docstring

metadataclj

(metadata col)

Return the metadata map for this column. Metadata must contain :name :datatype :size. Categorical columns must have :categorical? true and the inference target should have :target? true.

Return the metadata map for this column.
Metadata must contain :name :datatype :size.  Categorical columns must have
:categorical? true and the inference target should have :target? true.
sourceraw docstring

missingclj

(missing col)

Indexes of missing values. Both iterable and reader.

Indexes of missing values.  Both iterable and reader.
sourceraw docstring

new-columnclj

(new-column name data)
(new-column name data metadata)
(new-column name data metadata missing)
source

parse-columnclj

(parse-column datatype col)
(parse-column datatype options col)

parse a text or a str column, returning a new column with the same name but with a different datatype. This method is single-threaded.

parser-fn-or-kwd is nil by default and can the keyword :relaxed? or a function that must return one of parsed-value, :tech.ml.dataset.parse/missing in which case a missing value will be added or :tech.ml.dataset.parse/parse-failure in which case the a missing index will be added and the string value will be recorded in the metadata's :unparsed-data, :unparsed-indexes entries.

parse a text or a str column, returning a new column with the same name but with
a different datatype.  This method is single-threaded.

parser-fn-or-kwd is nil by default and can the keyword :relaxed?  or a function that
must return one of parsed-value, :tech.ml.dataset.parse/missing in which case a
missing value will be added or :tech.ml.dataset.parse/parse-failure in which case the
a missing index will be added and the string value will be recorded in the metadata's
:unparsed-data, :unparsed-indexes entries.
sourceraw docstring

prepend-column-with-emptyclj

(prepend-column-with-empty column n-empty)
source

selectclj

(select col idx-seq)

Return a new column with the subset of indexes

Return a new column with the subset of indexes
sourceraw docstring

set-metadataclj

(set-metadata col data-map)

Set the metadata on the column returning a new column. Beware this could change the name.

Set the metadata on the column returning a new column.
Beware this could change the name.
sourceraw docstring

set-missingclj

(set-missing col idx-seq)

Set the missing indexes for a column. This doesn't change any values in the underlying data store.

Set the missing indexes for a column.  This doesn't change any values in the
underlying data store.
sourceraw docstring

set-nameclj

(set-name col name)

Return a new column.

Return a new column.
sourceraw docstring

statsclj

(stats col stats-set)

Return a map of stats. Stats set is a set of the desired stats in keyword form. Guaranteed support across implementations for :mean :variance :median :skew. Implementations should check their metadata before doing calculations.

Return a map of stats.  Stats set is a set of the desired stats in keyword
form.  Guaranteed support across implementations for :mean :variance :median :skew.
Implementations should check their metadata before doing calculations.
sourceraw docstring

string-table-keysetclj

(string-table-keyset col)

Get the string table for this column. Returns nil if this isn't a string column. This doesn't necessarily tell you the unique set of the column unless you have just parsed a file. It is, when non-nil, a strict superset of the strings in the columns.

Get the string table for this column.  Returns nil if this isn't a string column.
This doesn't necessarily tell you the unique set of the column unless you have just
parsed a file.  It is, when non-nil, a strict superset of the strings in the
columns.
sourceraw docstring

supported-statsclj

(supported-stats col)

List of available stats for the column

List of available stats for the column
sourceraw docstring

to-double-arrayclj

(to-double-array col & [error-on-missing?])

Convert to a java primitive array of a given datatype. For strings, an implicit string->double mapping is expected. For booleans, true=1 false=0. Finally, any missing values should be indicated by a NaN of the expected type.

Convert to a java primitive array of a given datatype.  For strings,
an implicit string->double mapping is expected.  For booleans, true=1 false=0.
Finally, any missing values should be indicated by a NaN of the expected type.
sourceraw docstring

uniqueclj

(unique col)

Set of all unique values

Set of all unique values
sourceraw docstring

cljdoc is a website building & hosting documentation for Clojure/Script libraries

× close