(clone col)
Clone this column not changing anything.
Clone this column not changing anything.
(correlation lhs rhs correlation-type)
Correlation coefficient for given 2 columns. Available correlation types are: :pearson :spearman :kendall
Returns floating point number between [-1 1]
Correlation coefficient for given 2 columns. Available correlation types are: :pearson :spearman :kendall Returns floating point number between [-1 1]
(is-column? item)
Return true if this item is a column.
Return true if this item is a column.
(is-missing? col idx)
Return true if this index is missing.
Return true if this index is missing.
(merge-metadata col data-map)
Merge metadata in column with this map. Beware this could change the name of the column.
Merge metadata in column with this map. Beware this could change the name of the column.
(metadata col)
Return the metadata map for this column. Metadata must contain :name :datatype :size. Categorical columns must have :categorical? true and the inference target should have :target? true.
Return the metadata map for this column. Metadata must contain :name :datatype :size. Categorical columns must have :categorical? true and the inference target should have :target? true.
(missing col)
Indexes of missing values. Both iterable and reader.
Indexes of missing values. Both iterable and reader.
(new-column name data)
(new-column name data metadata)
(new-column name data metadata missing)
(parse-column datatype col)
(parse-column datatype options col)
parse a text or a str column, returning a new column with the same name but with a different datatype. This method is single-threaded.
parser-fn-or-kwd is nil by default and can the keyword :relaxed? or a function that must return one of parsed-value, :tech.ml.dataset.parse/missing in which case a missing value will be added or :tech.ml.dataset.parse/parse-failure in which case the a missing index will be added and the string value will be recorded in the metadata's :unparsed-data, :unparsed-indexes entries.
parse a text or a str column, returning a new column with the same name but with a different datatype. This method is single-threaded. parser-fn-or-kwd is nil by default and can the keyword :relaxed? or a function that must return one of parsed-value, :tech.ml.dataset.parse/missing in which case a missing value will be added or :tech.ml.dataset.parse/parse-failure in which case the a missing index will be added and the string value will be recorded in the metadata's :unparsed-data, :unparsed-indexes entries.
(select col idx-seq)
Return a new column with the subset of indexes
Return a new column with the subset of indexes
(set-metadata col data-map)
Set the metadata on the column returning a new column. Beware this could change the name.
Set the metadata on the column returning a new column. Beware this could change the name.
(set-missing col idx-seq)
Set the missing indexes for a column. This doesn't change any values in the underlying data store.
Set the missing indexes for a column. This doesn't change any values in the underlying data store.
(stats col stats-set)
Return a map of stats. Stats set is a set of the desired stats in keyword form. Guaranteed support across implementations for :mean :variance :median :skew. Implementations should check their metadata before doing calculations.
Return a map of stats. Stats set is a set of the desired stats in keyword form. Guaranteed support across implementations for :mean :variance :median :skew. Implementations should check their metadata before doing calculations.
(string-table-keyset col)
Get the string table for this column. Returns nil if this isn't a string column. This doesn't necessarily tell you the unique set of the column unless you have just parsed a file. It is, when non-nil, a strict superset of the strings in the columns.
Get the string table for this column. Returns nil if this isn't a string column. This doesn't necessarily tell you the unique set of the column unless you have just parsed a file. It is, when non-nil, a strict superset of the strings in the columns.
(supported-stats col)
List of available stats for the column
List of available stats for the column
(to-double-array col & [error-on-missing?])
Convert to a java primitive array of a given datatype. For strings, an implicit string->double mapping is expected. For booleans, true=1 false=0. Finally, any missing values should be indicated by a NaN of the expected type.
Convert to a java primitive array of a given datatype. For strings, an implicit string->double mapping is expected. For booleans, true=1 false=0. Finally, any missing values should be indicated by a NaN of the expected type.
cljdoc is a website building & hosting documentation for Clojure/Script libraries
× close