Liking cljdoc? Tell your friends :D

hfds-clj.core

Dataset fetching and storing from HuggingFace.

HF Datasets provide rich functionality https://huggingface.co/docs/datasets/index

Dataset fetching and storing from HuggingFace.

HF Datasets provide rich functionality
https://huggingface.co/docs/datasets/index
raw docstring

download-dsclj

(download-ds {:keys [dataset offset length split] :as params}
             {:hfds/keys [cache-dir limit] :or {cache-dir default-cache-dir}})

Download data set from the HuggingFace. See load-dataset for params documentation

Download data set from the HuggingFace. See `load-dataset` for params documentation
raw docstring

load-datasetclj

(load-dataset ds-params-or-ds-name)
(load-dataset {:keys [dataset split config offset length]
               :or {split "train" config "default" offset 0 length 100}}
              {:hfds/keys [cache-dir download-mode]
               :or {download-mode :reuse-dataset-if-exists
                    cache-dir default-cache-dir}
               :as read-params})

Download a dataset from HuggingFace. Dataset name is usually specified in HuggingFace dataset webpage. Usually in a form of org-name/ds-name

First argument can be

  1. a map specifying HuggingFace HTTP call parameters and is used as is for HF REST API HTTP calls b) a string with dataset name, it will be converted into {:dataset ds-name} param map

Second argument is a map specifying how to read the ds.

Download a *dataset* from HuggingFace. Dataset name is usually specified
in HuggingFace dataset webpage. Usually in a form of `org-name/ds-name`

First argument can be
1) a map specifying HuggingFace HTTP call parameters and is used as is for HF REST API HTTP calls
b) a string with dataset name, it will be converted into `{:dataset ds-name}` param map

Second argument is a map specifying how to read the ds.
raw docstring

read-dsclj

(read-ds {:keys [dataset split]}
         {:hfds/keys [limit cache-dir] :or {cache-dir default-cache-dir}})

Read data set from the cache. It is assumed that it is there. See load-dataset for params documentation.

Read data set from the cache. It is assumed that it is there.
See `load-dataset` for params documentation.
raw docstring

cljdoc is a website building & hosting documentation for Clojure/Script libraries

× close