TMD upgrade, no changes in TC
TMD upgrade
reorder-columns on empty dataset returns nilaggregate-columns didn't keep column order (#35)pipeline functions have doc copied from original onessplit can turn off shuffling now (:shuffle? option)split :holdouts - sequence of consecutive holdoutstech.ml.dataset version bump, this introduces the change of the order of the groups after group-by operation
split :holdout supports any number of splits (minimum 2) [#28]split supports split-names to provide custom names for subdatasetsconcat and concat-copying are working with grouped datasetskfold split failed on small number of rows (due to partition-all behavioursplit->seq to return train/test splits as a sequence or datasets or as map of sequences for grouped datasetstablecloth.pipeline returns a map with dataset under :metamorph/data key (see metamorph)split returns now a dataset or grouped dataset with two new columns indicating train/test and split id. See split->seq for previous behaviour.without-grouping-> threading macro which allows operations on grouping dataset treated as a regular one.group-by accepts any java.util.Map for a collection of indexes (use LinkedHashMap to persist an order)tablecloth.api.group-by functions moved to tablecloth.api.utils, no changes to APIadd-or-replace-column(s) replaced by add-column(s) (add-or-replace-column(s) is marked as deprecated) (#16)mark-as-group wasn't visible in API (#18)map-columns didn't propagate new-type for grouped case (#20)let-dataset - to simulate tibble from Rrows and columns new result: :as-double-arrays - convert rows to 2d double arraytablecloth.pipeline for pipeline operationsconcat-copying exposed.split function for splitting into train-test pairs with :kfold, :bootstrap, :loo and holdout strategies + stratified versionsreplace-missing with new strategy :midpointt.m.d update
t.m.d update
t.m.d update
write-nippy! and read-nippy are deprecated, replaced by write! and datasettech.ml.dataset version 5.0-alpha*
map-columns accepts optional target datatypeds/column->dataset functionality introduced in separate-column:text among others)write-csv! replaced by write! (write-csv! is marked as deprecated)info field :size is replaced by :n-elemsseparate-column 3-arity version accepts separator instead target-columns nowtech.ml.dataset version 4.04
tech.ml.dataset version 4.03
parallel? option set to true). These are: aggregate, unique-by, order-by, join-columns, separate-columns, ungroupaggregation uses now in-place ungrouping which is much fastertech.ml.dataset version 3.06
fill-range-replace to inject data to make continuous seqence in columnwrite-nippy! and read-nippytech.ml.dataset version 2.13
replace-missing new strategies: :mid and :lerp, working also for dates.replace-missing has different conctract and default strategy :mid. value argument is the last argument now.replace-missing :up and :down strategies, when value is nil fills border missing values with nearest value.tech.ml.dataset version 2.06
asof-join addedreshape testspivot->wider accepts :drop-missing? option (default: true)pivot->wider drops missing rows by defaultpivto->wider order of concatenated column names is reversed (first: colnames, last: value), was opposite.pivot->longer :splitter accepts string used for splitting column nameCan you improve this documentation? These fine people already did:
GenerateMe, apanj00, ashimapanjwani & genmeblogEdit on GitHub
cljdoc builds & hosts documentation for Clojure/Script libraries
| Ctrl+k | Jump to recent docs |
| ← | Move to previous article |
| → | Move to next article |
| Ctrl+/ | Jump to the search field |