zero-one.geni.core.foreign-idioms

Liking cljdoc? Tell your friends :D

Clojure only.

->dataset
clip
cut
name-value-seq->dataset
nlargest
nsmallest
nunique
qcut
random-choice
random-exp
random-int
random-norm
random-uniform
rchoice
rexp
rnorm
runif
runiform
select-columns
shape
value-counts

->dataset^cljmultimethod

Create a Dataset from a path or a collection of records.

Create a Dataset from a path or a collection of records.

source raw docstring

clip^clj

(clip expr low high)

Returns a new Column where values outside [low, high] are clipped to the interval edges.

Returns a new Column where values outside `[low, high]` are clipped to the interval edges.

source raw docstring

cut^clj

(cut expr bins)

Returns a new Column of discretised expr into the intervals of bins.

Returns a new Column of discretised `expr` into the intervals of bins.

source raw docstring

name-value-seq->dataset^clj

(name-value-seq->dataset map-of-values)

(name-value-seq->dataset spark map-of-values)

Construct a Dataset from an associative map.

(g/show (g/map->dataset {:a [1 2], :b [3 4]}))
; +---+---+
; |a  |b  |
; +---+---+
; |1  |3  |
; |2  |4  |
; +---+---+

Construct a Dataset from an associative map.

```clojure
(g/show (g/map->dataset {:a [1 2], :b [3 4]}))
; +---+---+
; |a  |b  |
; +---+---+
; |1  |3  |
; |2  |4  |
; +---+---+
```

source raw docstring

nlargest^clj

(nlargest dataframe n-rows expr)

Return the Dataset with the first n-rows rows ordered by expr in descending order.

Return the Dataset with the first `n-rows` rows ordered by `expr` in descending order.

source raw docstring

nsmallest^clj

(nsmallest dataframe n-rows expr)

Return the Dataset with the first n-rows rows ordered by expr in ascending order.

Return the Dataset with the first `n-rows` rows ordered by `expr` in ascending order.

source raw docstring

nunique^clj

(nunique dataframe)

Count distinct observations over all columns in the Dataset.

Count distinct observations over all columns in the Dataset.

source raw docstring

qcut^clj

(qcut expr num-buckets-or-probs)

Returns a new Column of discretised expr into equal-sized buckets based on rank or based on sample quantiles.

Returns a new Column of discretised `expr` into equal-sized buckets based
on rank or based on sample quantiles.

source raw docstring

random-choice^clj

(random-choice choices)

(random-choice choices probs)

(random-choice choices probs seed)

Returns a new Column of a random sample from a given collection of choices.

Returns a new Column of a random sample from a given collection of `choices`.

source raw docstring

random-exp^clj

(random-exp)

(random-exp rate)

(random-exp rate seed)

Returns a new Column of draws from an exponential distribution.

Returns a new Column of draws from an exponential distribution.

source raw docstring

random-int^clj

(random-int)

(random-int low high)

(random-int low high seed)

Returns a new Column of random integers from low (inclusive) to high (exclusive).

Returns a new Column of random integers from `low` (inclusive) to `high` (exclusive).

source raw docstring

random-norm^clj

(random-norm)

(random-norm mu sigma)

(random-norm mu sigma seed)

Returns a new Column of draws from a normal distribution.

Returns a new Column of draws from a normal distribution.

source raw docstring

random-uniform^clj

(random-uniform)

(random-uniform low high)

(random-uniform low high seed)

Returns a new Column of draws from a uniform distribution.

Returns a new Column of draws from a uniform distribution.

source raw docstring

rchoice^clj

(rchoice choices)

(rchoice choices probs)

(rchoice choices probs seed)

Returns a new Column of a random sample from a given collection of choices.

Returns a new Column of a random sample from a given collection of `choices`.

source raw docstring

rexp^clj

(rexp)

(rexp rate)

(rexp rate seed)

Returns a new Column of draws from an exponential distribution.

Returns a new Column of draws from an exponential distribution.

source raw docstring

rnorm^clj

(rnorm)

(rnorm mu sigma)

(rnorm mu sigma seed)

Returns a new Column of draws from a normal distribution.

Returns a new Column of draws from a normal distribution.

source raw docstring

runif^clj

(runif)

(runif low high)

(runif low high seed)

Returns a new Column of draws from a uniform distribution.

Returns a new Column of draws from a uniform distribution.

source raw docstring

runiform^clj

(runiform)

(runiform low high)

(runiform low high seed)

Returns a new Column of draws from a uniform distribution.

Returns a new Column of draws from a uniform distribution.

source raw docstring

select-columns^clj

(select-columns dataframe & exprs)

Params: (cols: Column*)

Result: DataFrame

Selects a set of column based expressions.

2.0.0

Source: https://spark.apache.org/docs/3.0.1/api/scala/org/apache/spark/sql/Dataset.html

Timestamp: 2020-10-19T01:56:20.931Z

Params: (cols: Column*)

Result: DataFrame

Selects a set of column based expressions.

2.0.0

Source: https://spark.apache.org/docs/3.0.1/api/scala/org/apache/spark/sql/Dataset.html

Timestamp: 2020-10-19T01:56:20.931Z

source raw docstring

shape^clj

(shape dataframe)

Returns a vector representing the dimensionality of the Dataset.

Returns a vector representing the dimensionality of the Dataset.

source raw docstring

value-counts^clj

(value-counts dataframe)

Returns a Dataset containing counts of unique rows.

The resulting object will be in descending order so that the first element is the most frequently-occurring element.

Returns a Dataset containing counts of unique rows.

The resulting object will be in descending order so that the
first element is the most frequently-occurring element.

source raw docstring

cljdoc builds & hosts documentation for Clojure/Script libraries

Keyboard shortcuts

`Ctrl`+`k`	Jump to recent docs
`←`	Move to previous article
`→`	Move to next article
`Ctrl`+`/`	Jump to the search field

Raise an issue Browse cljdoc source Chat on Slack

× close