zero-one.geni.core

Liking cljdoc? Tell your friends :D

Clojure only.

!
%
&
&&
*
**
+
-
->col-array
->column
->dataset
->date-col
->debug-string
->kebab-columns
->string
->timestamp-col
->utc-timestamp
/
<
<=
<=>
=
=!=
===
>
>=
abs
acos
add
add-months
agg
agg-all
aggregate
alias
app-name
approx-count-distinct
approx-quantile
array
array-contains
array-distinct
array-except
array-intersect
array-join
array-max
array-min
array-position
array-remove
array-repeat
array-sort
array-type
array-union
arrays-overlap
arrays-zip
as
asc
asc-nulls-first
asc-nulls-last
ascii
asin
assoc
atan
atan-2
atan2
base-64
base64
between
bin
binary-files
bit-size
bitwise-and
bitwise-not
bitwise-or
bitwise-xor
bloom-filter
boolean
broadcast
bround
byte
cache
case
cast
cbrt
ceil
checkpoint
checkpoint-dir
clip
coalesce
col
col-regex
collect
collect-col
collect-list
collect-set
collect-vals
column-names
columns
compatible?
concat
concat-ws
cond
condp
conf
confidence
contains
conv
corr
cos
cosh
count
count-distinct
count-min-sketch
cov
covar
covar-pop
covar-samp
crc-32
crc32
create-dataframe
create-spark-session
cross-join
crosstab
cube
cube-root
cume-dist
current-date
current-timestamp
cut
date-add
date-diff
date-format
date-sub
date-trunc
datediff
day-of-month
day-of-week
day-of-year
dayofmonth
dayofweek
dayofyear
dec
decode
default-min-partitions
default-parallelism
degrees
dense
dense-rank
depth
desc
desc-nulls-first
desc-nulls-last
describe
disk-only
disk-only-2
dissoc
distinct
double
drop
drop-duplicates
drop-na
dtypes
element-at
empty?
encode
ends-with
estimate-count
even?
except
except-all
exists
exp
expected-fpp
explain
explode
expm-1
expm1
expr
factorial
fill-na
filter
first
first-vals
flatten
float
floor
forall
format-number
format-string
freq-items
from-csv
from-json
from-unixtime
get-field
get-item
greatest
group-by
grouping
grouping-id
hash
hash-code
head
head-vals
hex
hint
hour
hypot
if
inc
initcap
input-file-name
input-files
instr
int
interquartile-range
intersect
intersect-all
iqr
is-compatible
is-empty
is-in-collection
is-local
is-nan
is-not-null
is-null
is-streaming
isin
jars
java-spark-context
join
join-with
keys
kurtosis
lag
last
last-day
last-vals
lead
least
length
levenshtein
like
limit
lit
local?
locate
log
log-10
log-1p
log-2
log10
log1p
log2
long
lower
lpad
ltrim
map
map->dataset
map-concat
map-entries
map-filter
map-from-arrays
map-from-entries
map-keys
map-type
map-values
map-zip-with
master
max
md-5
md5
mean
median
memory-and-disk
memory-and-disk-2
memory-and-disk-ser
memory-and-disk-ser-2
memory-only
memory-only-2
memory-only-ser
memory-only-ser-2
merge
merge-in-place
merge-with
might-contain
min
minute
mod
monotonically-increasing-id
month
months-between
name-value-seq->dataset
nan?
nanvl
neg?
negate
next-day
nlargest
none
not
not-null?
nsmallest
ntile
null-count
null-rate
null?
nunique
odd?
off-heap
order-by
over
overlay
partitions
percent-rank
persist
pi
pivot
pmod
pos?
posexplode
posexplode-outer
pow
print-schema
put
qcut
quantile
quarter
radians
rand
rand-nth
randn
random-choice
random-exp
random-int
random-norm
random-split
random-uniform
rank
rchoice
rdd
read-avro!
read-csv!
read-edn!
read-jdbc!
read-json!
read-libsvm!
read-parquet!
read-text!
read-xlsx!
records->dataset
regexp-extract
regexp-replace
relative-error
remove
rename-columns
rename-keys
repartition
repartition-by-range
replace-na
resources
reverse
rexp
rint
rlike
rnorm
rollup
round
row
row-number
rpad
rtrim
runif
runiform
sample
sample-by
sc
schema-of-csv
schema-of-json
second
select
select-columns
select-expr
select-keys
sequence
sha-1
sha-2
sha1
sha2
shape
shift-left
shift-right
shift-right-unsigned
short
show
show-vertical
shuffle
signum
sin
sinh
size
skewness
slice
sort
sort-array
sort-within-partitions
soundex
spark-conf
spark-context
spark-home
spark-partition-id
spark-session
sparse
split
sql-context
sqr
sqrt
starts-with
std
stddev
stddev-pop
stddev-samp
storage-level
str
streaming?
struct
struct-field
struct-type
substring
substring-index
sum
sum-distinct
summary
table->dataset
tail
tail-vals
take
take-vals
tan
tanh
time-window
to-byte-array
to-csv
to-date
to-debug-string
to-df
to-json
to-string
to-timestamp
to-utc-timestamp
total-count
transform
transform-keys
transform-values
translate
trim
unbase-64
unbase64
unbounded-following
unbounded-preceeding
unhex
union
union-by-name
unix-timestamp
unpersist
update
upper
vals
value-counts
var-pop
var-samp
variance
version
week-of-year
weekofyear
when
where
width
window
windowed
with-column
with-column-renamed
write-avro!
write-csv!
write-edn!
write-jdbc!
write-json!
write-libsvm!
write-parquet!
write-text!
write-xlsx!
xxhash-64
xxhash64
year
zero?
zip-with
zipmap
|
||

!^clj

(! expr)

Params: (e: Column) Result: Column Inversion of boolean expression, i.e. NOT. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.985Z

Params: (e: Column)
Result: Column
Inversion of boolean expression, i.e. NOT.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.985Z

source raw docstring

%^clj

(% left-expr right-expr)

source

&^clj

(& left-expr right-expr)

source

&&^clj

(&& & exprs)

source

*^clj

(* & exprs)

source

**^clj

(** base exponent)

Params: (l: Column, r: Column) Result: Column Returns the value of the first argument raised to the power of the second argument.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.009Z

Params: (l: Column, r: Column)
Result: Column
Returns the value of the first argument raised to the power of the second argument.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.009Z

source raw docstring

+^clj

(+ & exprs)

source

-^clj

(- & exprs)

source

->col-array^clj

(->col-array args)

source

->column^cljmultimethod

source

->dataset^cljmultimethod

source

->date-col^clj

(->date-col expr)

(->date-col expr date-format)

Params: (e: Column) Result: Column Converts the column into DateType by casting rules to DateType.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.115Z

Params: (e: Column)
Result: Column
Converts the column into DateType by casting rules to DateType.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.115Z

source raw docstring

->debug-string^clj

source

->kebab-columns^clj

(->kebab-columns dataset)

source

->string^clj

source

->timestamp-col^clj

(->timestamp-col expr)

(->timestamp-col expr date-format)

Params: (s: Column) Result: Column Converts to a timestamp by casting rules to TimestampType.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.123Z

Params: (s: Column)
Result: Column
Converts to a timestamp by casting rules to TimestampType.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.123Z

source raw docstring

->utc-timestamp^clj

(->utc-timestamp expr)

Params: (ts: Column, tz: String) Result: Column Given a timestamp like '2017-07-14 02:40:00.0', interprets it as a time in the given time zone, and renders that time as a timestamp in UTC. For example, 'GMT+1' would yield '2017-07-14 01:40:00.0'.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.125Z

Params: (ts: Column, tz: String)
Result: Column
Given a timestamp like '2017-07-14 02:40:00.0', interprets it as a time in the given time
zone, and renders that time as a timestamp in UTC. For example, 'GMT+1' would yield
'2017-07-14 01:40:00.0'.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.125Z

source raw docstring

/^clj

(/ & exprs)

source

=^clj

(= l-expr r-expr)

source

=!=^clj

source

===^clj

source

>^clj

source

>=^clj

source

abs^clj

(abs expr)

Params: (e: Column) Result: Column Computes the absolute value of a numeric value.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.577Z

Params: (e: Column)
Result: Column
Computes the absolute value of a numeric value.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.577Z

source raw docstring

acos^clj

(acos expr)

Params: (e: Column) Result: Column inverse cosine of e in radians, as if computed by java.lang.Math.acos Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.580Z

Params: (e: Column)
Result: Column
inverse cosine of e in radians, as if computed by java.lang.Math.acos
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.580Z

source raw docstring

add^clj

(add cms item)

(add cms item cnt)

source

add-months^clj

(add-months expr months)

Params: (startDate: Column, numMonths: Int) Result: Column Returns the date that is numMonths after startDate.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.583Z

Params: (startDate: Column, numMonths: Int)
Result: Column
Returns the date that is numMonths after startDate.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.583Z

source raw docstring

agg^clj

(agg dataframe & args)

source

agg-all^clj

(agg-all dataframe agg-fn)

source

aggregate^clj

(aggregate expr init merge-fn)

(aggregate expr init merge-fn finish-fn)

Params: (expr: Column, initialValue: Column, merge: (Column, Column) ⇒ Column, finish: (Column) ⇒ Column) Result: Column Applies a binary operator to an initial state and all elements in the array, and reduces this to a single state. The final state is converted into the final result by applying a finish function. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.587Z

Params: (expr: Column, initialValue: Column, merge: (Column, Column) ⇒ Column, finish: (Column) ⇒ Column)
Result: Column
Applies a binary operator to an initial state and all elements in the array,
and reduces this to a single state. The final state is converted into the final result
by applying a finish function.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.587Z

source raw docstring

alias^cljmultimethod

source

app-name^clj

(app-name)

(app-name spark)

source

approx-count-distinct^clj

(approx-count-distinct expr)

(approx-count-distinct expr rsd)

Params: (e: Column) Result: Column (Since version 2.1.0) Use approx_count_distinct Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.231Z

Params: (e: Column)
Result: Column
(Since version 2.1.0) Use approx_count_distinct
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.231Z

source raw docstring

approx-quantile^clj

(approx-quantile dataframe col-or-cols probs rel-error)

source

array^clj

(array & exprs)

Params: (cols: Column*) Result: Column Creates a new array column. The input columns must all have the same data type.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.594Z

Params: (cols: Column*)
Result: Column
Creates a new array column. The input columns must all have the same data type.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.594Z

source raw docstring

array-contains^clj

(array-contains expr value)

Params: (column: Column, value: Any) Result: Column Returns null if the array is null, true if the array contains value, and false otherwise. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.595Z

Params: (column: Column, value: Any)
Result: Column
Returns null if the array is null, true if the array contains value, and false otherwise.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.595Z

source raw docstring

array-distinct^clj

(array-distinct expr)

Params: (e: Column) Result: Column Removes duplicate values from the array. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.596Z

Params: (e: Column)
Result: Column
Removes duplicate values from the array.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.596Z

source raw docstring

array-except^clj

(array-except left right)

Params: (col1: Column, col2: Column) Result: Column Returns an array of the elements in the first array but not in the second array, without duplicates. The order of elements in the result is not determined

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.597Z

Params: (col1: Column, col2: Column)
Result: Column
Returns an array of the elements in the first array but not in the second array,
without duplicates. The order of elements in the result is not determined

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.597Z

source raw docstring

array-intersect^clj

(array-intersect left right)

Params: (col1: Column, col2: Column) Result: Column Returns an array of the elements in the intersection of the given two arrays, without duplicates.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.598Z

Params: (col1: Column, col2: Column)
Result: Column
Returns an array of the elements in the intersection of the given two arrays,
without duplicates.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.598Z

source raw docstring

array-join^clj

(array-join expr delimiter)

(array-join expr delimiter null-replacement)

Params: (column: Column, delimiter: String, nullReplacement: String) Result: Column Concatenates the elements of column using the delimiter. Null values are replaced with nullReplacement. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.601Z

Params: (column: Column, delimiter: String, nullReplacement: String)
Result: Column
Concatenates the elements of column using the delimiter. Null values are replaced with
nullReplacement.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.601Z

source raw docstring

array-max^clj

(array-max expr)

Params: (e: Column) Result: Column Returns the maximum value in the array.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.602Z

Params: (e: Column)
Result: Column
Returns the maximum value in the array.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.602Z

source raw docstring

array-min^clj

(array-min expr)

Params: (e: Column) Result: Column Returns the minimum value in the array.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.603Z

Params: (e: Column)
Result: Column
Returns the minimum value in the array.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.603Z

source raw docstring

array-position^clj

(array-position expr value)

Params: (column: Column, value: Any) Result: Column Locates the position of the first occurrence of the value in the given array as long. Returns null if either of the arguments are null.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.604Z

Params: (column: Column, value: Any)
Result: Column
Locates the position of the first occurrence of the value in the given array as long.
Returns null if either of the arguments are null.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.604Z

source raw docstring

array-remove^clj

(array-remove expr element)

Params: (column: Column, element: Any) Result: Column Remove all elements that equal to element from the given array.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.606Z

Params: (column: Column, element: Any)
Result: Column
Remove all elements that equal to element from the given array.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.606Z

source raw docstring

array-repeat^clj

(array-repeat left right)

Params: (left: Column, right: Column) Result: Column Creates an array containing the left argument repeated the number of times given by the right argument.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.608Z

Params: (left: Column, right: Column)
Result: Column
Creates an array containing the left argument repeated the number of times given by the
right argument.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.608Z

source raw docstring

array-sort^clj

(array-sort expr)

Params: (e: Column) Result: Column Sorts the input array in ascending order. The elements of the input array must be orderable. Null elements will be placed at the end of the returned array.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.609Z

Params: (e: Column)
Result: Column
Sorts the input array in ascending order. The elements of the input array must be orderable.
Null elements will be placed at the end of the returned array.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.609Z

source raw docstring

array-type^clj

(array-type val-type nullable)

source

array-union^clj

(array-union left right)

Params: (col1: Column, col2: Column) Result: Column Returns an array of the elements in the union of the given two arrays, without duplicates.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.610Z

Params: (col1: Column, col2: Column)
Result: Column
Returns an array of the elements in the union of the given two arrays, without duplicates.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.610Z

source raw docstring

arrays-overlap^clj

(arrays-overlap left right)

Params: (a1: Column, a2: Column) Result: Column Returns true if a1 and a2 have at least one non-null element in common. If not and both the arrays are non-empty and any of them contains a null, it returns null. It returns false otherwise. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.616Z

Params: (a1: Column, a2: Column)
Result: Column
Returns true if a1 and a2 have at least one non-null element in common. If not and both
the arrays are non-empty and any of them contains a null, it returns null. It returns
false otherwise.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.616Z

source raw docstring

arrays-zip^clj

(arrays-zip & exprs)

Params: (e: Column*) Result: Column Returns a merged array of structs in which the N-th struct contains all N-th values of input arrays. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.617Z

Params: (e: Column*)
Result: Column
Returns a merged array of structs in which the N-th struct contains all N-th values of input
arrays.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.617Z

source raw docstring

as^cljmultimethod

source

asc^clj

(asc expr)

source

asc-nulls-first^clj

(asc-nulls-first expr)

source

asc-nulls-last^clj

(asc-nulls-last expr)

source

ascii^clj

(ascii expr)

Params: (e: Column) Result: Column Computes the numeric value of the first character of the string column, and returns the result as an int column.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.623Z

Params: (e: Column)
Result: Column
Computes the numeric value of the first character of the string column, and returns the
result as an int column.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.623Z

source raw docstring

asin^clj

(asin expr)

Params: (e: Column) Result: Column inverse sine of e in radians, as if computed by java.lang.Math.asin Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.626Z

Params: (e: Column)
Result: Column
inverse sine of e in radians, as if computed by java.lang.Math.asin
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.626Z

source raw docstring

assoc^cljmultimethod

source

atan^clj

(atan expr)

Params: (e: Column) Result: Column inverse tangent of e, as if computed by java.lang.Math.atan Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.628Z

Params: (e: Column)
Result: Column
inverse tangent of e, as if computed by java.lang.Math.atan
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.628Z

source raw docstring

atan-2^clj

(atan-2 expr-x expr-y)

Params: (y: Column, x: Column) Result: Column coordinate on y-axis Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.642Z

Params: (y: Column, x: Column)
Result: Column
coordinate on y-axis
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.642Z

source raw docstring

atan2^clj

(atan2 expr-x expr-y)

Params: (y: Column, x: Column) Result: Column coordinate on y-axis Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.642Z

Params: (y: Column, x: Column)
Result: Column
coordinate on y-axis
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.642Z

source raw docstring

base-64^clj

(base-64 expr)

Params: (e: Column) Result: Column Computes the BASE64 encoding of a binary column and returns it as a string column. This is the reverse of unbase64.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.646Z

Params: (e: Column)
Result: Column
Computes the BASE64 encoding of a binary column and returns it as a string column.
This is the reverse of unbase64.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.646Z

source raw docstring

base64^clj

(base64 expr)

Params: (e: Column) Result: Column Computes the BASE64 encoding of a binary column and returns it as a string column. This is the reverse of unbase64.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.646Z

Params: (e: Column)
Result: Column
Computes the BASE64 encoding of a binary column and returns it as a string column.
This is the reverse of unbase64.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.646Z

source raw docstring

between^clj

(between expr lower-bound upper-bound)

source

bin^clj

(bin expr)

Params: (e: Column) Result: Column An expression that returns the string representation of the binary value of the given long column. For example, bin("12") returns "1100".

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.648Z

Params: (e: Column)
Result: Column
An expression that returns the string representation of the binary value of the given long
column. For example, bin("12") returns "1100".

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.648Z

source raw docstring

binary-files^cljmultimethod

source

bit-size^clj

(bit-size bloom)

source

bitwise-and^clj

source

bitwise-not^clj

(bitwise-not expr)

Params: (e: Column) Result: Column Computes bitwise NOT (~) of a number.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.649Z

Params: (e: Column)
Result: Column
Computes bitwise NOT (~) of a number.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.649Z

source raw docstring

bitwise-or^clj

source

bitwise-xor^clj

(bitwise-xor left-expr right-expr)

source

bloom-filter^clj

(bloom-filter dataframe expr expected-num-items num-bits-or-fpp)

source

boolean^clj

(boolean expr)

source

broadcast^clj

(broadcast dataframe)

Params: (df: Dataset[T]) Result: Dataset[T] Marks a DataFrame as small enough for use in broadcast joins. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.650Z

Params: (df: Dataset[T])
Result: Dataset[T]
Marks a DataFrame as small enough for use in broadcast joins.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.650Z

source raw docstring

bround^clj

(bround expr)

Params: (e: Column) Result: Column Returns the value of the column e rounded to 0 decimal places with HALF_EVEN round mode.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.653Z

Params: (e: Column)
Result: Column
Returns the value of the column e rounded to 0 decimal places with HALF_EVEN round mode.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.653Z

source raw docstring

byte^clj

(byte expr)

source

cache^clj

(cache dataframe)

source

case^clj

(case expr & clauses)

source

cast^clj

(cast expr new-type)

source

cbrt^clj

(cbrt expr)

Params: (e: Column) Result: Column Computes the cube-root of the given value.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.663Z

Params: (e: Column)
Result: Column
Computes the cube-root of the given value.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.663Z

source raw docstring

ceil^clj

(ceil expr)

Params: (e: Column) Result: Column Computes the ceiling of the given value.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.665Z

Params: (e: Column)
Result: Column
Computes the ceiling of the given value.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.665Z

source raw docstring

checkpoint^clj

(checkpoint dataframe)

(checkpoint dataframe eager)

source

checkpoint-dir^clj

(checkpoint-dir)

(checkpoint-dir spark)

source

clip^clj

(clip expr low high)

source

coalesce^cljmultimethod

source

col^cljmultimethod

source

col-regex^clj

(col-regex dataframe col-name)

source

collect^clj

(collect dataframe)

source

collect-col^clj

(collect-col dataframe col-name)

source

collect-list^clj

(collect-list expr)

Params: (e: Column) Result: Column Aggregate function: returns a list of objects with duplicates.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.680Z

Params: (e: Column)
Result: Column
Aggregate function: returns a list of objects with duplicates.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.680Z

source raw docstring

collect-set^clj

(collect-set expr)

Params: (e: Column) Result: Column Aggregate function: returns a set of objects with duplicate elements eliminated.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.682Z

Params: (e: Column)
Result: Column
Aggregate function: returns a set of objects with duplicate elements eliminated.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.682Z

source raw docstring

collect-vals^clj

(collect-vals dataframe)

source

column-names^clj

(column-names dataframe)

source

columns^clj

(columns dataframe)

source

compatible?^clj

source

concat^clj

(concat & exprs)

Params: (exprs: Column*) Result: Column Concatenates multiple input columns together into a single column. The function works with strings, binary and compatible array columns.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.686Z

Params: (exprs: Column*)
Result: Column
Concatenates multiple input columns together into a single column.
The function works with strings, binary and compatible array columns.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.686Z

source raw docstring

concat-ws^clj

(concat-ws sep & exprs)

Params: (sep: String, exprs: Column*) Result: Column Concatenates multiple input string columns together into a single string column, using the given separator.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.687Z

Params: (sep: String, exprs: Column*)
Result: Column
Concatenates multiple input string columns together into a single string column,
using the given separator.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.687Z

source raw docstring

cond^clj

(cond & clauses)

source

condp^clj

(condp pred expr & clauses)

source

conf^clj

(conf)

(conf spark)

source

confidence^clj

(confidence cms)

source

contains^clj

(contains expr literal)

source

conv^clj

(conv expr from-base to-base)

Params: (num: Column, fromBase: Int, toBase: Int) Result: Column Convert a number in a string column from one base to another.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.688Z

Params: (num: Column, fromBase: Int, toBase: Int)
Result: Column
Convert a number in a string column from one base to another.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.688Z

source raw docstring

corr^cljmultimethod

source

cos^clj

(cos expr)

Params: (e: Column) Result: Column angle in radians Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.695Z

Params: (e: Column)
Result: Column
angle in radians
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.695Z

source raw docstring

cosh^clj

(cosh expr)

Params: (e: Column) Result: Column hyperbolic angle Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.699Z

Params: (e: Column)
Result: Column
hyperbolic angle
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.699Z

source raw docstring

count^cljmultimethod

source

count-distinct^clj

(count-distinct & exprs)

Params: (expr: Column, exprs: Column*) Result: Column Aggregate function: returns the number of distinct items in a group.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.706Z

Params: (expr: Column, exprs: Column*)
Result: Column
Aggregate function: returns the number of distinct items in a group.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.706Z

source raw docstring

count-min-sketch^clj

(count-min-sketch dataframe expr eps-or-depth confidence-or-width seed)

source

cov^clj

(cov dataframe col-name1 col-name2)

source

covar^clj

(covar l-expr r-expr)

Params: (column1: Column, column2: Column) Result: Column Aggregate function: returns the sample covariance for two columns.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.714Z

Params: (column1: Column, column2: Column)
Result: Column
Aggregate function: returns the sample covariance for two columns.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.714Z

source raw docstring

covar-pop^clj

(covar-pop l-expr r-expr)

Params: (column1: Column, column2: Column) Result: Column Aggregate function: returns the population covariance for two columns.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.710Z

Params: (column1: Column, column2: Column)
Result: Column
Aggregate function: returns the population covariance for two columns.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.710Z

source raw docstring

covar-samp^clj

(covar-samp l-expr r-expr)

Params: (column1: Column, column2: Column) Result: Column Aggregate function: returns the sample covariance for two columns.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.714Z

Params: (column1: Column, column2: Column)
Result: Column
Aggregate function: returns the sample covariance for two columns.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.714Z

source raw docstring

crc-32^clj

(crc-32 expr)

Params: (e: Column) Result: Column Calculates the cyclic redundancy check value (CRC32) of a binary column and returns the value as a bigint.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.717Z

Params: (e: Column)
Result: Column
Calculates the cyclic redundancy check value  (CRC32) of a binary column and
returns the value as a bigint.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.717Z

source raw docstring

crc32^clj

(crc32 expr)

Params: (e: Column) Result: Column Calculates the cyclic redundancy check value (CRC32) of a binary column and returns the value as a bigint.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.717Z

Params: (e: Column)
Result: Column
Calculates the cyclic redundancy check value  (CRC32) of a binary column and
returns the value as a bigint.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.717Z

source raw docstring

create-dataframe^clj

(create-dataframe rows schema)

(create-dataframe spark rows schema)

source

create-spark-session^clj

(create-spark-session
  {:keys [app-name master configs log-level checkpoint-dir]
   :or {app-name "Geni App" master "local[*]" configs {} log-level "WARN"}})

source

cross-join^clj

(cross-join left right)

source

crosstab^clj

(crosstab dataframe col-name1 col-name2)

source

cube^clj

(cube dataframe & exprs)

source

cube-root^clj

(cube-root expr)

Params: (e: Column) Result: Column Computes the cube-root of the given value.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.663Z

Params: (e: Column)
Result: Column
Computes the cube-root of the given value.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.663Z

source raw docstring

cume-dist^clj

(cume-dist)

Params: () Result: Column Window function: returns the cumulative distribution of values within a window partition, i.e. the fraction of rows that are below the current row. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.719Z

Params: ()
Result: Column
Window function: returns the cumulative distribution of values within a window partition,
i.e. the fraction of rows that are below the current row.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.719Z

source raw docstring

current-date^clj

(current-date)

Params: () Result: Column Returns the current date as a date column.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.720Z

Params: ()
Result: Column
Returns the current date as a date column.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.720Z

source raw docstring

current-timestamp^clj

(current-timestamp)

Params: () Result: Column Returns the current timestamp as a timestamp column.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.722Z

Params: ()
Result: Column
Returns the current timestamp as a timestamp column.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.722Z

source raw docstring

cut^clj

(cut expr bins)

source

date-add^clj

(date-add expr days)

Params: (start: Column, days: Int) Result: Column Returns the date that is days days after start

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.735Z

Params: (start: Column, days: Int)
Result: Column
Returns the date that is days days after start

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.735Z

source raw docstring

date-diff^clj

(date-diff l-expr r-expr)

Params: (end: Column, start: Column) Result: Column Returns the number of days from start to end. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.747Z

Params: (end: Column, start: Column)
Result: Column
Returns the number of days from start to end.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.747Z

source raw docstring

date-format^clj

(date-format expr date-fmt)

Params: (dateExpr: Column, format: String) Result: Column Converts a date/timestamp/string to a value of string in the format specified by the date format given by the second argument. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.739Z

Params: (dateExpr: Column, format: String)
Result: Column
Converts a date/timestamp/string to a value of string in the format specified by the date
format given by the second argument.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.739Z

source raw docstring

date-sub^clj

(date-sub expr days)

Params: (start: Column, days: Int) Result: Column Returns the date that is days days before start

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.742Z

Params: (start: Column, days: Int)
Result: Column
Returns the date that is days days before start

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.742Z

source raw docstring

date-trunc^clj

(date-trunc fmt expr)

Params: (format: String, timestamp: Column) Result: Column Returns timestamp truncated to the unit specified by the format. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.744Z

Params: (format: String, timestamp: Column)
Result: Column
Returns timestamp truncated to the unit specified by the format.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.744Z

source raw docstring

datediff^clj

(datediff l-expr r-expr)

Params: (end: Column, start: Column)
Result: Column
Returns the number of days from start to end.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.747Z

source raw docstring

day-of-month^clj

(day-of-month expr)

Params: (e: Column) Result: Column Extracts the day of the month as an integer from a given date/timestamp/string. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.749Z

Params: (e: Column)
Result: Column
Extracts the day of the month as an integer from a given date/timestamp/string.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.749Z

source raw docstring

day-of-week^clj

(day-of-week expr)

Params: (e: Column) Result: Column Extracts the day of the week as an integer from a given date/timestamp/string. Ranges from 1 for a Sunday through to 7 for a Saturday Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.751Z

Params: (e: Column)
Result: Column
Extracts the day of the week as an integer from a given date/timestamp/string.
Ranges from 1 for a Sunday through to 7 for a Saturday
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.751Z

source raw docstring

day-of-year^clj

(day-of-year expr)

Params: (e: Column) Result: Column Extracts the day of the year as an integer from a given date/timestamp/string. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.752Z

Params: (e: Column)
Result: Column
Extracts the day of the year as an integer from a given date/timestamp/string.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.752Z

source raw docstring

dayofmonth^clj

(dayofmonth expr)

Params: (e: Column)
Result: Column
Extracts the day of the month as an integer from a given date/timestamp/string.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.749Z

source raw docstring

dayofweek^clj

(dayofweek expr)

Params: (e: Column)
Result: Column
Extracts the day of the week as an integer from a given date/timestamp/string.
Ranges from 1 for a Sunday through to 7 for a Saturday
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.751Z

source raw docstring

dayofyear^clj

(dayofyear expr)

Params: (e: Column)
Result: Column
Extracts the day of the year as an integer from a given date/timestamp/string.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.752Z

source raw docstring

dec^clj

(dec expr)

source

decode^clj

(decode expr charset)

Params: (value: Column, charset: String) Result: Column Computes the first argument into a string from a binary using the provided character set (one of 'US-ASCII', 'ISO-8859-1', 'UTF-8', 'UTF-16BE', 'UTF-16LE', 'UTF-16'). If either argument is null, the result will also be null.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.756Z

Params: (value: Column, charset: String)
Result: Column
Computes the first argument into a string from a binary using the provided character set
(one of 'US-ASCII', 'ISO-8859-1', 'UTF-8', 'UTF-16BE', 'UTF-16LE', 'UTF-16').
If either argument is null, the result will also be null.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.756Z

source raw docstring

default-min-partitions^clj

(default-min-partitions)

(default-min-partitions spark)

source

default-parallelism^clj

(default-parallelism)

(default-parallelism spark)

source

degrees^clj

(degrees expr)

Params: (e: Column) Result: Column Converts an angle measured in radians to an approximately equivalent angle measured in degrees.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.759Z

Params: (e: Column)
Result: Column
Converts an angle measured in radians to an approximately equivalent angle measured in degrees.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.759Z

source raw docstring

dense^clj

(dense & values)

source

dense-rank^clj

(dense-rank)

Params: () Result: Column Window function: returns the rank of rows within a window partition, without any gaps. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.760Z

Params: ()
Result: Column
Window function: returns the rank of rows within a window partition, without any gaps.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.760Z

source raw docstring

depth^clj

(depth cms)

source

desc^clj

(desc expr)

source

desc-nulls-first^clj

(desc-nulls-first expr)

source

desc-nulls-last^clj

(desc-nulls-last expr)

source

describe^clj

(describe dataframe & col-names)

source

disk-only^clj

source

disk-only-2^clj

source

dissoc^cljmultimethod

source

distinct^clj

(distinct dataframe)

source

double^clj

(double expr)

source

drop^clj

(drop dataframe & col-names)

source

drop-duplicates^clj

(drop-duplicates dataframe & col-names)

source

drop-na^clj

(drop-na dataframe)

(drop-na dataframe min-non-nulls-or-cols)

(drop-na dataframe min-non-nulls cols)

source

dtypes^clj

(dtypes dataframe)

source

element-at^clj

(element-at expr value)

Params: (column: Column, value: Any) Result: Column Returns element of array at given index in value if column is array. Returns value for the given key in value if column is map.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.765Z

Params: (column: Column, value: Any)
Result: Column
Returns element of array at given index in value if column is array. Returns value for
the given key in value if column is map.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.765Z

source raw docstring

empty?^clj

source

encode^clj

(encode expr charset)

Params: (value: Column, charset: String) Result: Column Computes the first argument into a binary from a string using the provided character set (one of 'US-ASCII', 'ISO-8859-1', 'UTF-8', 'UTF-16BE', 'UTF-16LE', 'UTF-16'). If either argument is null, the result will also be null.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.767Z

Params: (value: Column, charset: String)
Result: Column
Computes the first argument into a binary from a string using the provided character set
(one of 'US-ASCII', 'ISO-8859-1', 'UTF-8', 'UTF-16BE', 'UTF-16LE', 'UTF-16').
If either argument is null, the result will also be null.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.767Z

source raw docstring

ends-with^clj

(ends-with expr literal)

source

estimate-count^clj

(estimate-count cms item)

source

even?^clj

(even? expr)

source

except^clj

(except dataframe other)

source

except-all^clj

(except-all dataframe other)

source

exists^clj

(exists expr predicate)

Params: (column: Column, f: (Column) ⇒ Column) Result: Column Returns whether a predicate holds for one or more elements in the array. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.770Z

Params: (column: Column, f: (Column) ⇒ Column)
Result: Column
Returns whether a predicate holds for one or more elements in the array.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.770Z

source raw docstring

exp^clj

(exp expr)

Params: (e: Column) Result: Column Computes the exponential of the given value.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.773Z

Params: (e: Column)
Result: Column
Computes the exponential of the given value.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.773Z

source raw docstring

expected-fpp^clj

(expected-fpp bloom)

source

explain^cljmultimethod

source

explode^clj

(explode expr)

Params: (e: Column) Result: Column Creates a new row for each element in the given array or map column. Uses the default column name col for elements in the array and key and value for elements in the map unless specified otherwise.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.774Z

Params: (e: Column)
Result: Column
Creates a new row for each element in the given array or map column.
Uses the default column name col for elements in the array and
key and value for elements in the map unless specified otherwise.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.774Z

source raw docstring

expm-1^clj

(expm-1 expr)

Params: (e: Column) Result: Column Computes the exponential of the given value minus one.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.777Z

Params: (e: Column)
Result: Column
Computes the exponential of the given value minus one.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.777Z

source raw docstring

expm1^clj

(expm1 expr)

Params: (e: Column) Result: Column Computes the exponential of the given value minus one.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.777Z

Params: (e: Column)
Result: Column
Computes the exponential of the given value minus one.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.777Z

source raw docstring

expr^clj

(expr s)

Params: (expr: String) Result: Column Parses the expression string into the column that it represents, similar to Dataset#selectExpr. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.779Z

Params: (expr: String)
Result: Column
Parses the expression string into the column that it represents, similar to
Dataset#selectExpr.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.779Z

source raw docstring

factorial^clj

(factorial expr)

Params: (e: Column) Result: Column Computes the factorial of the given value.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.780Z

Params: (e: Column)
Result: Column
Computes the factorial of the given value.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.780Z

source raw docstring

fill-na^clj

(fill-na dataframe value)

(fill-na dataframe value cols)

source

filter^cljmultimethod

source

first^cljmultimethod

source

first-vals^clj

(first-vals dataframe)

source

flatten^clj

(flatten expr)

Params: (e: Column) Result: Column Creates a single array from an array of arrays. If a structure of nested arrays is deeper than two levels, only one level of nesting is removed. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.796Z

Params: (e: Column)
Result: Column
Creates a single array from an array of arrays. If a structure of nested arrays is deeper than
two levels, only one level of nesting is removed.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.796Z

source raw docstring

float^clj

(float expr)

source

floor^clj

(floor expr)

Params: (e: Column) Result: Column Computes the floor of the given value.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.798Z

Params: (e: Column)
Result: Column
Computes the floor of the given value.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.798Z

source raw docstring

forall^clj

(forall expr predicate)

Params: (column: Column, f: (Column) ⇒ Column) Result: Column Returns whether a predicate holds for every element in the array. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.800Z

Params: (column: Column, f: (Column) ⇒ Column)
Result: Column
Returns whether a predicate holds for every element in the array.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.800Z

source raw docstring

format-number^clj

(format-number expr decimal-places)

Params: (x: Column, d: Int) Result: Column Formats numeric column x to a format like '#,###,###.##', rounded to d decimal places with HALF_EVEN round mode, and returns the result as a string column. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.802Z

Params: (x: Column, d: Int)
Result: Column
Formats numeric column x to a format like '#,###,###.##', rounded to d decimal places
with HALF_EVEN round mode, and returns the result as a string column.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.802Z

source raw docstring

format-string^clj

(format-string fmt & exprs)

Params: (format: String, arguments: Column*) Result: Column Formats the arguments in printf-style and returns the result as a string column.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.803Z

Params: (format: String, arguments: Column*)
Result: Column
Formats the arguments in printf-style and returns the result as a string column.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.803Z

source raw docstring

freq-items^clj

(freq-items dataframe col-names)

(freq-items dataframe col-names support)

source

from-csv^clj

(from-csv expr schema)

(from-csv expr schema options)

Params: (e: Column, schema: StructType, options: Map[String, String]) Result: Column Parses a column containing a CSV string into a StructType with the specified schema. Returns null, in the case of an unparseable string.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.807Z

Params: (e: Column, schema: StructType, options: Map[String, String])
Result: Column
Parses a column containing a CSV string into a StructType with the specified schema.
Returns null, in the case of an unparseable string.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.807Z

source raw docstring

from-json^clj

(from-json expr schema)

(from-json expr schema options)

Params: (e: Column, schema: StructType, options: Map[String, String]) Result: Column (Scala-specific) Parses a column containing a JSON string into a StructType with the specified schema. Returns null, in the case of an unparseable string.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.826Z

Params: (e: Column, schema: StructType, options: Map[String, String])
Result: Column
(Scala-specific) Parses a column containing a JSON string into a StructType with the
specified schema. Returns null, in the case of an unparseable string.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.826Z

source raw docstring

from-unixtime^clj

(from-unixtime expr)

(from-unixtime expr fmt)

Params: (ut: Column) Result: Column Converts the number of seconds from unix epoch (1970-01-01 00:00:00 UTC) to a string representing the timestamp of that moment in the current system time zone in the yyyy-MM-dd HH:mm:ss format.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.830Z

Params: (ut: Column)
Result: Column
Converts the number of seconds from unix epoch (1970-01-01 00:00:00 UTC) to a string
representing the timestamp of that moment in the current system time zone in the
yyyy-MM-dd HH:mm:ss format.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.830Z

source raw docstring

get-field^clj

(get-field expr field-name)

source

get-item^clj

(get-item expr k)

source

greatest^clj

(greatest & exprs)

Params: (exprs: Column*) Result: Column Returns the greatest value of the list of values, skipping null values. This function takes at least 2 parameters. It will return null iff all parameters are null.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.839Z

Params: (exprs: Column*)
Result: Column
Returns the greatest value of the list of values, skipping null values.
This function takes at least 2 parameters. It will return null iff all parameters are null.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.839Z

source raw docstring

group-by^clj

(group-by dataframe & exprs)

source

grouping^clj

(grouping expr)

Params: (e: Column) Result: Column Aggregate function: indicates whether a specified column in a GROUP BY list is aggregated or not, returns 1 for aggregated or 0 for not aggregated in the result set.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.845Z

Params: (e: Column)
Result: Column
Aggregate function: indicates whether a specified column in a GROUP BY list is aggregated
or not, returns 1 for aggregated or 0 for not aggregated in the result set.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.845Z

source raw docstring

grouping-id^clj

(grouping-id & exprs)

Params: (cols: Column*) Result: Column Aggregate function: returns the level of grouping, equals to Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.848Z

Params: (cols: Column*)
Result: Column
Aggregate function: returns the level of grouping, equals to
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.848Z

source raw docstring

hash^clj

(hash & exprs)

Params: (cols: Column*) Result: Column Calculates the hash code of given columns, and returns the result as an int column.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.849Z

Params: (cols: Column*)
Result: Column
Calculates the hash code of given columns, and returns the result as an int column.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.849Z

source raw docstring

hash-code^clj

(hash-code expr)

source

head^clj

(head dataframe)

(head dataframe n-rows)

source

head-vals^clj

(head-vals dataframe)

(head-vals dataframe n-rows)

source

hex^clj

(hex expr)

Params: (column: Column) Result: Column Computes hex value of the given column.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.851Z

Params: (column: Column)
Result: Column
Computes hex value of the given column.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.851Z

source raw docstring

hint^clj

(hint dataframe hint-name & args)

source

hour^clj

(hour expr)

Params: (e: Column) Result: Column Extracts the hours as an integer from a given date/timestamp/string. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.852Z

Params: (e: Column)
Result: Column
Extracts the hours as an integer from a given date/timestamp/string.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.852Z

source raw docstring

hypot^clj

(hypot left-expr right-expr)

Params: (l: Column, r: Column) Result: Column Computes sqrt(a2 + b2) without intermediate overflow or underflow.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.865Z

Params: (l: Column, r: Column)
Result: Column
Computes sqrt(a2 + b2) without intermediate overflow or underflow.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.865Z

source raw docstring

if^clj

source

inc^clj

(inc expr)

source

initcap^clj

(initcap expr)

Params: (e: Column) Result: Column Returns a new string column by converting the first letter of each word to uppercase. Words are delimited by whitespace. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.866Z

Params: (e: Column)
Result: Column
Returns a new string column by converting the first letter of each word to uppercase.
Words are delimited by whitespace.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.866Z

source raw docstring

input-file-name^clj

(input-file-name)

Params: () Result: Column Creates a string column for the file name of the current Spark task.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.867Z

Params: ()
Result: Column
Creates a string column for the file name of the current Spark task.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.867Z

source raw docstring

input-files^clj

(input-files dataframe)

source

instr^clj

(instr expr substr)

Params: (str: Column, substring: String) Result: Column Locate the position of the first occurrence of substr column in the given string. Returns null if either of the arguments are null.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.869Z

Params: (str: Column, substring: String)
Result: Column
Locate the position of the first occurrence of substr column in the given string.
Returns null if either of the arguments are null.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.869Z

source raw docstring

int^clj

(int expr)

source

interquartile-range^cljmultimethod

source

intersect^clj

(intersect dataframe other)

source

intersect-all^clj

(intersect-all dataframe other)

source

iqr^cljmultimethod

source

is-compatible^clj

(is-compatible bloom other)

source

is-empty^clj

(is-empty dataframe)

source

is-in-collection^clj

(is-in-collection expr coll)

source

is-local^clj

(is-local dataframe)

source

is-nan^clj

(is-nan expr)

source

is-not-null^clj

(is-not-null expr)

source

is-null^clj

(is-null expr)

source

is-streaming^clj

(is-streaming dataframe)

source

isin^clj

(isin expr coll)

source

jars^clj

(jars)

(jars spark)

source

java-spark-context^clj

(java-spark-context spark)

source

join^clj

(join left right expr)

(join left right expr join-type)

source

join-with^clj

(join-with left right condition)

(join-with left right condition join-type)

source

keys^clj

source

kurtosis^clj

(kurtosis expr)

Params: (e: Column) Result: Column Aggregate function: returns the kurtosis of the values in a group.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.894Z

Params: (e: Column)
Result: Column
Aggregate function: returns the kurtosis of the values in a group.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.894Z

source raw docstring

lag^clj

(lag expr offset)

(lag expr offset default)

Params: (e: Column, offset: Int) Result: Column Window function: returns the value that is offset rows before the current row, and null if there is less than offset rows before the current row. For example, an offset of one will return the previous row at any given point in the window partition. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.900Z

Params: (e: Column, offset: Int)
Result: Column
Window function: returns the value that is offset rows before the current row, and
null if there is less than offset rows before the current row. For example,
an offset of one will return the previous row at any given point in the window partition.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.900Z

source raw docstring

last^cljmultimethod

source

last-day^clj

(last-day expr)

Params: (e: Column) Result: Column Returns the last day of the month which the given date belongs to. For example, input "2015-07-27" returns "2015-07-31" since July 31 is the last day of the month in July 2015.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.918Z

Params: (e: Column)
Result: Column
Returns the last day of the month which the given date belongs to.
For example, input "2015-07-27" returns "2015-07-31" since July 31 is the last day of the
month in July 2015.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.918Z

source raw docstring

last-vals^clj

(last-vals dataframe)

source

lead^clj

(lead expr offset)

(lead expr offset default)

Params: (columnName: String, offset: Int) Result: Column Window function: returns the value that is offset rows after the current row, and null if there is less than offset rows after the current row. For example, an offset of one will return the next row at any given point in the window partition. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.924Z

Params: (columnName: String, offset: Int)
Result: Column
Window function: returns the value that is offset rows after the current row, and
null if there is less than offset rows after the current row. For example,
an offset of one will return the next row at any given point in the window partition.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.924Z

source raw docstring

least^clj

(least & exprs)

Params: (exprs: Column*) Result: Column Returns the least value of the list of values, skipping null values. This function takes at least 2 parameters. It will return null iff all parameters are null.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.927Z

Params: (exprs: Column*)
Result: Column
Returns the least value of the list of values, skipping null values.
This function takes at least 2 parameters. It will return null iff all parameters are null.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.927Z

source raw docstring

length^clj

(length expr)

Params: (e: Column) Result: Column Computes the character length of a given string or number of bytes of a binary string. The length of character strings include the trailing spaces. The length of binary strings includes binary zeros.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.928Z

Params: (e: Column)
Result: Column
Computes the character length of a given string or number of bytes of a binary string.
The length of character strings include the trailing spaces. The length of binary strings
includes binary zeros.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.928Z

source raw docstring

levenshtein^clj

(levenshtein left-expr right-expr)

Params: (l: Column, r: Column) Result: Column Computes the Levenshtein distance of the two given string columns. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.929Z

Params: (l: Column, r: Column)
Result: Column
Computes the Levenshtein distance of the two given string columns.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.929Z

source raw docstring

like^clj

(like expr literal)

source

limit^clj

(limit dataframe n-rows)

source

lit^clj

(lit arg)

source

local?^clj

source

locate^clj

(locate substr expr)

Params: (substr: String, str: Column) Result: Column Locate the position of the first occurrence of substr.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.933Z

Params: (substr: String, str: Column)
Result: Column
Locate the position of the first occurrence of substr.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.933Z

source raw docstring

log^clj

(log expr)

Params: (e: Column) Result: Column Computes the natural logarithm of the given value.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.937Z

Params: (e: Column)
Result: Column
Computes the natural logarithm of the given value.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.937Z

source raw docstring

log-10^clj

(log-10 expr)

Params: (e: Column) Result: Column Computes the logarithm of the given value in base 10.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.939Z

Params: (e: Column)
Result: Column
Computes the logarithm of the given value in base 10.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.939Z

source raw docstring

log-1p^clj

(log-1p expr)

Params: (e: Column) Result: Column Computes the natural logarithm of the given value plus one.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.941Z

Params: (e: Column)
Result: Column
Computes the natural logarithm of the given value plus one.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.941Z

source raw docstring

log-2^clj

(log-2 expr)

Params: (expr: Column) Result: Column Computes the logarithm of the given column in base 2.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.943Z

Params: (expr: Column)
Result: Column
Computes the logarithm of the given column in base 2.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.943Z

source raw docstring

log10^clj

(log10 expr)

Params: (e: Column) Result: Column Computes the logarithm of the given value in base 10.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.939Z

Params: (e: Column)
Result: Column
Computes the logarithm of the given value in base 10.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.939Z

source raw docstring

log1p^clj

(log1p expr)

Params: (e: Column) Result: Column Computes the natural logarithm of the given value plus one.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.941Z

Params: (e: Column)
Result: Column
Computes the natural logarithm of the given value plus one.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.941Z

source raw docstring

log2^clj

(log2 expr)

Params: (expr: Column) Result: Column Computes the logarithm of the given column in base 2.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.943Z

Params: (expr: Column)
Result: Column
Computes the logarithm of the given column in base 2.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.943Z

source raw docstring

long^clj

(long expr)

source

lower^clj

(lower expr)

Params: (e: Column) Result: Column Converts a string column to lower case.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.944Z

Params: (e: Column)
Result: Column
Converts a string column to lower case.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.944Z

source raw docstring

lpad^clj

(lpad expr length pad)

Params: (str: Column, len: Int, pad: String) Result: Column Left-pad the string column with pad to a length of len. If the string column is longer than len, the return value is shortened to len characters.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.946Z

Params: (str: Column, len: Int, pad: String)
Result: Column
Left-pad the string column with pad to a length of len. If the string column is longer
than len, the return value is shortened to len characters.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.946Z

source raw docstring

ltrim^clj

(ltrim expr)

Params: (e: Column) Result: Column Trim the spaces from left end for the specified string value.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.948Z

Params: (e: Column)
Result: Column
Trim the spaces from left end for the specified string value.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.948Z

source raw docstring

map^clj

(map & exprs)

Params: (cols: Column*) Result: Column Creates a new map column. The input columns must be grouped as key-value pairs, e.g. (key1, value1, key2, value2, ...). The key columns must all have the same data type, and can't be null. The value columns must all have the same data type.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.949Z

Params: (cols: Column*)
Result: Column
Creates a new map column. The input columns must be grouped as key-value pairs, e.g.
(key1, value1, key2, value2, ...). The key columns must all have the same data type, and can't
be null. The value columns must all have the same data type.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.949Z

source raw docstring

map->dataset^clj

(map->dataset map-of-values)

(map->dataset spark map-of-values)

source

map-concat^clj

(map-concat & exprs)

Params: (cols: Column*) Result: Column Returns the union of all the given maps. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.950Z

Params: (cols: Column*)
Result: Column
Returns the union of all the given maps.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.950Z

source raw docstring

map-entries^clj

(map-entries expr)

Params: (e: Column) Result: Column Returns an unordered array of all entries in the given map. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.951Z

Params: (e: Column)
Result: Column
Returns an unordered array of all entries in the given map.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.951Z

source raw docstring

map-filter^clj

(map-filter expr predicate)

Params: (expr: Column, f: (Column, Column) ⇒ Column) Result: Column Returns a map whose key-value pairs satisfy a predicate. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.953Z

Params: (expr: Column, f: (Column, Column) ⇒ Column)
Result: Column
Returns a map whose key-value pairs satisfy a predicate.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.953Z

source raw docstring

map-from-arrays^clj

(map-from-arrays key-expr val-expr)

Params: (keys: Column, values: Column) Result: Column Creates a new map column. The array in the first column is used for keys. The array in the second column is used for values. All elements in the array for key should not be null.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.958Z

Params: (keys: Column, values: Column)
Result: Column
Creates a new map column. The array in the first column is used for keys. The array in the
second column is used for values. All elements in the array for key should not be null.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.958Z

source raw docstring

map-from-entries^clj

(map-from-entries expr)

Params: (e: Column) Result: Column Returns a map created from the given array of entries. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.959Z

Params: (e: Column)
Result: Column
Returns a map created from the given array of entries.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.959Z

source raw docstring

map-keys^clj

(map-keys expr)

Params: (e: Column) Result: Column Returns an unordered array containing the keys of the map. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.960Z

Params: (e: Column)
Result: Column
Returns an unordered array containing the keys of the map.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.960Z

source raw docstring

map-type^clj

(map-type key-type val-type)

source

map-values^clj

(map-values expr)

Params: (e: Column) Result: Column Returns an unordered array containing the values of the map. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.961Z

Params: (e: Column)
Result: Column
Returns an unordered array containing the values of the map.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.961Z

source raw docstring

map-zip-with^clj

(map-zip-with left right merge-fn)

Params: (left: Column, right: Column, f: (Column, Column, Column) ⇒ Column) Result: Column Merge two given maps, key-wise into a single map using a function. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.963Z

Params: (left: Column, right: Column, f: (Column, Column, Column) ⇒ Column)
Result: Column
Merge two given maps, key-wise into a single map using a function.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.963Z

source raw docstring

master^clj

(master)

(master spark)

source

max^cljmultimethod

source

md-5^clj

(md-5 expr)

Params: (e: Column) Result: Column Calculates the MD5 digest of a binary column and returns the value as a 32 character hex string.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.966Z

Params: (e: Column)
Result: Column
Calculates the MD5 digest of a binary column and returns the value
as a 32 character hex string.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.966Z

source raw docstring

md5^clj

(md5 expr)

Params: (e: Column) Result: Column Calculates the MD5 digest of a binary column and returns the value as a 32 character hex string.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.966Z

Params: (e: Column)
Result: Column
Calculates the MD5 digest of a binary column and returns the value
as a 32 character hex string.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.966Z

source raw docstring

mean^cljmultimethod

source

median^cljmultimethod

source

memory-and-disk^clj

source

memory-and-disk-2^clj

source

memory-and-disk-ser^clj

source

memory-and-disk-ser-2^clj

source

memory-only^clj

source

memory-only-2^clj

source

memory-only-ser^clj

source

memory-only-ser-2^clj

source

merge^clj

(merge expr & ms)

source

merge-in-place^clj

(merge-in-place bloom-or-cms other)

source

merge-with^clj

source

might-contain^clj

(might-contain bloom item)

source

min^cljmultimethod

source

minute^clj

(minute expr)

Params: (e: Column) Result: Column Extracts the minutes as an integer from a given date/timestamp/string. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.971Z

Params: (e: Column)
Result: Column
Extracts the minutes as an integer from a given date/timestamp/string.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.971Z

source raw docstring

mod^clj

source

monotonically-increasing-id^clj

(monotonically-increasing-id)

Params: () Result: Column A column expression that generates monotonically increasing 64-bit integers. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.233Z

Params: ()
Result: Column
A column expression that generates monotonically increasing 64-bit integers.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.233Z

source raw docstring

month^clj

(month expr)

Params: (e: Column) Result: Column Extracts the month as an integer from a given date/timestamp/string. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.974Z

Params: (e: Column)
Result: Column
Extracts the month as an integer from a given date/timestamp/string.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.974Z

source raw docstring

months-between^clj

(months-between l-expr r-expr)

Params: (end: Column, start: Column) Result: Column Returns number of months between dates start and end. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.979Z

Params: (end: Column, start: Column)
Result: Column
Returns number of months between dates start and end.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.979Z

source raw docstring

name-value-seq->dataset^clj

source

nan?^clj

source

nanvl^clj

(nanvl left-expr right-expr)

Params: (col1: Column, col2: Column) Result: Column Returns col1 if it is not NaN, or col2 if col1 is NaN. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.980Z

Params: (col1: Column, col2: Column)
Result: Column
Returns col1 if it is not NaN, or col2 if col1 is NaN.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.980Z

source raw docstring

neg?^clj

(neg? expr)

source

negate^clj

(negate expr)

Params: (e: Column) Result: Column Unary minus, i.e. negate the expression. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.982Z

Params: (e: Column)
Result: Column
Unary minus, i.e. negate the expression.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.982Z

source raw docstring

next-day^clj

(next-day expr day-of-week)

Params: (date: Column, dayOfWeek: String) Result: Column Returns the first date which is later than the value of the date column that is on the specified day of the week. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.984Z

Params: (date: Column, dayOfWeek: String)
Result: Column
Returns the first date which is later than the value of the date column that is on the
specified day of the week.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.984Z

source raw docstring

nlargest^clj

(nlargest dataframe n-rows expr)

source

none^clj

source

not^clj

(not expr)

Params: (e: Column)
Result: Column
Inversion of boolean expression, i.e. NOT.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.985Z

source raw docstring

not-null?^clj

source

nsmallest^clj

(nsmallest dataframe n-rows expr)

source

ntile^clj

(ntile n)

Params: (n: Int) Result: Column Window function: returns the ntile group id (from 1 to n inclusive) in an ordered window partition. For example, if n is 4, the first quarter of the rows will get value 1, the second quarter will get 2, the third quarter will get 3, and the last quarter will get 4. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.988Z

Params: (n: Int)
Result: Column
Window function: returns the ntile group id (from 1 to n inclusive) in an ordered window
partition. For example, if n is 4, the first quarter of the rows will get value 1, the second
quarter will get 2, the third quarter will get 3, and the last quarter will get 4.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.988Z

source raw docstring

null-count^clj

(null-count expr)

source

null-rate^clj

(null-rate expr)

source

null?^clj

source

nunique^clj

(nunique dataframe)

source

odd?^clj

(odd? expr)

source

off-heap^clj

source

order-by^clj

(order-by dataframe & exprs)

source

over^clj

(over column window-spec)

source

overlay^clj

(overlay src rep pos)

(overlay src rep pos len)

Params: (src: Column, replace: Column, pos: Column, len: Column) Result: Column Overlay the specified portion of src with replace, starting from byte position pos of src and proceeding for len bytes.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.991Z

Params: (src: Column, replace: Column, pos: Column, len: Column)
Result: Column
Overlay the specified portion of src with replace,
 starting from byte position pos of src and proceeding for len bytes.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.991Z

source raw docstring

partitions^clj

(partitions dataframe)

source

percent-rank^clj

(percent-rank)

Params: () Result: Column Window function: returns the relative rank (i.e. percentile) of rows within a window partition. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.992Z

Params: ()
Result: Column
Window function: returns the relative rank (i.e. percentile) of rows within a window partition.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.992Z

source raw docstring

persist^clj

(persist dataframe)

(persist dataframe new-level)

source

pi^clj

The double value that is closer than any other to pi, the ratio of the circumference of a circle to its diameter.

The double value that is closer than any other to pi, the ratio of the circumference of a circle to its diameter.

source raw docstring

pivot^clj

(pivot grouped expr)

(pivot grouped expr values)

source

pmod^clj

(pmod left-expr right-expr)

Params: (dividend: Column, divisor: Column) Result: Column Returns the positive value of dividend mod divisor.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.994Z

Params: (dividend: Column, divisor: Column)
Result: Column
Returns the positive value of dividend mod divisor.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.994Z

source raw docstring

pos?^clj

(pos? expr)

source

posexplode^clj

(posexplode expr)

Params: (e: Column) Result: Column Creates a new row for each element with position in the given array or map column. Uses the default column name pos for position, and col for elements in the array and key and value for elements in the map unless specified otherwise.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.995Z

Params: (e: Column)
Result: Column
Creates a new row for each element with position in the given array or map column.
Uses the default column name pos for position, and col for elements in the array
and key and value for elements in the map unless specified otherwise.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.995Z

source raw docstring

posexplode-outer^clj

(posexplode-outer expr)

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:40.995Z

Params: (e: Column)
Result: Column
Creates a new row for each element with position in the given array or map column.
Uses the default column name pos for position, and col for elements in the array
and key and value for elements in the map unless specified otherwise.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:40.995Z

source raw docstring

pow^clj

(pow base exponent)

Params: (l: Column, r: Column) Result: Column Returns the value of the first argument raised to the power of the second argument.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.009Z

Params: (l: Column, r: Column)
Result: Column
Returns the value of the first argument raised to the power of the second argument.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.009Z

source raw docstring

print-schema^clj

(print-schema dataframe)

source

put^clj

(put bloom item)

source

qcut^clj

(qcut expr num-buckets-or-probs)

source

quantile^cljmultimethod

source

quarter^clj

(quarter expr)

Params: (e: Column) Result: Column Extracts the quarter as an integer from a given date/timestamp/string. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.010Z

Params: (e: Column)
Result: Column
Extracts the quarter as an integer from a given date/timestamp/string.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.010Z

source raw docstring

radians^clj

(radians expr)

Params: (e: Column) Result: Column Converts an angle measured in degrees to an approximately equivalent angle measured in radians.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.013Z

Params: (e: Column)
Result: Column
Converts an angle measured in degrees to an approximately equivalent angle measured in radians.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.013Z

source raw docstring

rand^clj

(rand)

(rand seed)

Params: (seed: Long) Result: Column Generate a random column with independent and identically distributed (i.i.d.) samples uniformly distributed in [0.0, 1.0).

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.015Z

Params: (seed: Long)
Result: Column
Generate a random column with independent and identically distributed (i.i.d.) samples
uniformly distributed in [0.0, 1.0).

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.015Z

source raw docstring

rand-nth^clj

(rand-nth dataframe)

source

randn^clj

(randn)

(randn seed)

Params: (seed: Long) Result: Column Generate a column with independent and identically distributed (i.i.d.) samples from the standard normal distribution.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.017Z

Params: (seed: Long)
Result: Column
Generate a column with independent and identically distributed (i.i.d.) samples from
the standard normal distribution.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.017Z

source raw docstring

random-choice^clj

(random-choice choices)

(random-choice choices probs)

(random-choice choices probs seed)

source

random-exp^clj

(random-exp)

(random-exp rate)

(random-exp rate seed)

source

random-int^clj

(random-int)

(random-int low high)

(random-int low high seed)

source

random-norm^clj

(random-norm)

(random-norm mu sigma)

(random-norm mu sigma seed)

source

random-split^clj

(random-split dataframe weights)

(random-split dataframe weights seed)

source

random-uniform^clj

(random-uniform)

(random-uniform low high)

(random-uniform low high seed)

source

rank^clj

(rank)

Params: () Result: Column Window function: returns the rank of rows within a window partition. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.018Z

Params: ()
Result: Column
Window function: returns the rank of rows within a window partition.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.018Z

source raw docstring

rchoice^clj

source

rdd^clj

(rdd dataframe)

source

read-avro!^cljmultimethod

source

read-csv!^cljmultimethod

source

read-edn!^cljmultimethod

source

read-jdbc!^clj

(read-jdbc! options)

(read-jdbc! spark options)

source

read-json!^cljmultimethod

source

read-libsvm!^cljmultimethod

source

read-parquet!^cljmultimethod

source

read-text!^cljmultimethod

source

read-xlsx!^cljmultimethod

source

records->dataset^clj

(records->dataset records)

(records->dataset spark records)

source

regexp-extract^clj

(regexp-extract expr regex idx)

Params: (e: Column, exp: String, groupIdx: Int) Result: Column Extract a specific group matched by a Java regex, from the specified string column. If the regex did not match, or the specified group did not match, an empty string is returned.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.019Z

Params: (e: Column, exp: String, groupIdx: Int)
Result: Column
Extract a specific group matched by a Java regex, from the specified string column.
If the regex did not match, or the specified group did not match, an empty string is returned.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.019Z

source raw docstring

regexp-replace^clj

(regexp-replace expr pattern-expr replacement-expr)

Params: (e: Column, pattern: String, replacement: String) Result: Column Replace all substrings of the specified string value that match regexp with rep.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.022Z

Params: (e: Column, pattern: String, replacement: String)
Result: Column
Replace all substrings of the specified string value that match regexp with rep.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.022Z

source raw docstring

relative-error^clj

(relative-error cms)

source

remove^clj

(remove dataframe expr)

source

rename-columns^clj

(rename-columns dataframe rename-map)

source

rename-keys^clj

(rename-keys expr kmap)

source

repartition^clj

(repartition dataframe & args)

source

repartition-by-range^clj

(repartition-by-range dataframe & args)

source

replace-na^clj

(replace-na dataframe cols replacement)

source

resources^clj

(resources)

(resources spark)

source

reverse^clj

(reverse expr)

Params: (e: Column) Result: Column Returns a reversed string or an array with reverse order of elements. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.024Z

Params: (e: Column)
Result: Column
Returns a reversed string or an array with reverse order of elements.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.024Z

source raw docstring

rexp^clj

source

rint^clj

(rint expr)

Params: (e: Column) Result: Column Returns the double value that is closest in value to the argument and is equal to a mathematical integer.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.026Z

Params: (e: Column)
Result: Column
Returns the double value that is closest in value to the argument and
is equal to a mathematical integer.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.026Z

source raw docstring

rlike^clj

(rlike expr literal)

source

rnorm^clj

source

rollup^clj

(rollup dataframe & exprs)

source

round^clj

(round expr)

Params: (e: Column) Result: Column Returns the value of the column e rounded to 0 decimal places with HALF_UP round mode.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.028Z

Params: (e: Column)
Result: Column
Returns the value of the column e rounded to 0 decimal places with HALF_UP round mode.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.028Z

source raw docstring

row^clj

(row & values)

source

row-number^clj

(row-number)

Params: () Result: Column Window function: returns a sequential number starting at 1 within a window partition.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.029Z

Params: ()
Result: Column
Window function: returns a sequential number starting at 1 within a window partition.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.029Z

source raw docstring

rpad^clj

(rpad expr length pad)

Params: (str: Column, len: Int, pad: String) Result: Column Right-pad the string column with pad to a length of len. If the string column is longer than len, the return value is shortened to len characters.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.030Z

Params: (str: Column, len: Int, pad: String)
Result: Column
Right-pad the string column with pad to a length of len. If the string column is longer
than len, the return value is shortened to len characters.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.030Z

source raw docstring

rtrim^clj

(rtrim expr)

Params: (e: Column) Result: Column Trim the spaces from right end for the specified string value.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.032Z

Params: (e: Column)
Result: Column
Trim the spaces from right end for the specified string value.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.032Z

source raw docstring

runif^clj

source

runiform^clj

source

sample^clj

(sample dataframe fraction)

(sample dataframe fraction with-replacement)

source

sample-by^clj

(sample-by dataframe expr fractions seed)

source

sc^clj

source

schema-of-csv^clj

(schema-of-csv expr)

(schema-of-csv expr options)

Params: (csv: String) Result: Column Parses a CSV string and infers its schema in DDL format.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.036Z

Params: (csv: String)
Result: Column
Parses a CSV string and infers its schema in DDL format.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.036Z

source raw docstring

schema-of-json^clj

(schema-of-json expr)

(schema-of-json expr options)

Params: (json: String) Result: Column Parses a JSON string and infers its schema in DDL format.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.043Z

Params: (json: String)
Result: Column
Parses a JSON string and infers its schema in DDL format.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.043Z

source raw docstring

second^clj

(second expr)

Params: (e: Column) Result: Column Extracts the seconds as an integer from a given date/timestamp/string. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.045Z

Params: (e: Column)
Result: Column
Extracts the seconds as an integer from a given date/timestamp/string.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.045Z

source raw docstring

select^clj

(select dataframe & exprs)

source

select-columns^clj

source

select-expr^clj

(select-expr dataframe & exprs)

source

select-keys^clj

(select-keys expr ks)

source

sequence^clj

(sequence start stop step)

Params: (start: Column, stop: Column, step: Column) Result: Column Generate a sequence of integers from start to stop, incrementing by step.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.047Z

Params: (start: Column, stop: Column, step: Column)
Result: Column
Generate a sequence of integers from start to stop, incrementing by step.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.047Z

source raw docstring

sha-1^clj

(sha-1 expr)

Params: (e: Column) Result: Column Calculates the SHA-1 digest of a binary column and returns the value as a 40 character hex string.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.048Z

Params: (e: Column)
Result: Column
Calculates the SHA-1 digest of a binary column and returns the value
as a 40 character hex string.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.048Z

source raw docstring

sha-2^clj

(sha-2 expr n-bits)

Params: (e: Column, numBits: Int) Result: Column Calculates the SHA-2 family of hash functions of a binary column and returns the value as a hex string.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.049Z

Params: (e: Column, numBits: Int)
Result: Column
Calculates the SHA-2 family of hash functions of a binary column and
returns the value as a hex string.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.049Z

source raw docstring

sha1^clj

(sha1 expr)

Params: (e: Column) Result: Column Calculates the SHA-1 digest of a binary column and returns the value as a 40 character hex string.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.048Z

Params: (e: Column)
Result: Column
Calculates the SHA-1 digest of a binary column and returns the value
as a 40 character hex string.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.048Z

source raw docstring

sha2^clj

(sha2 expr n-bits)

Params: (e: Column, numBits: Int) Result: Column Calculates the SHA-2 family of hash functions of a binary column and returns the value as a hex string.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.049Z

Params: (e: Column, numBits: Int)
Result: Column
Calculates the SHA-2 family of hash functions of a binary column and
returns the value as a hex string.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.049Z

source raw docstring

shape^clj

(shape dataframe)

source

shift-left^clj

(shift-left expr num-bits)

Params: (e: Column, numBits: Int) Result: Column Shift the given value numBits left. If the given value is a long value, this function will return a long value else it will return an integer value.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.050Z

Params: (e: Column, numBits: Int)
Result: Column
Shift the given value numBits left. If the given value is a long value, this function
will return a long value else it will return an integer value.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.050Z

source raw docstring

shift-right^clj

(shift-right expr num-bits)

Params: (e: Column, numBits: Int) Result: Column (Signed) shift the given value numBits right. If the given value is a long value, it will return a long value else it will return an integer value.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.052Z

Params: (e: Column, numBits: Int)
Result: Column
(Signed) shift the given value numBits right. If the given value is a long value, it will
return a long value else it will return an integer value.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.052Z

source raw docstring

shift-right-unsigned^clj

(shift-right-unsigned expr num-bits)

Params: (e: Column, numBits: Int) Result: Column Unsigned shift the given value numBits right. If the given value is a long value, it will return a long value else it will return an integer value.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.053Z

Params: (e: Column, numBits: Int)
Result: Column
Unsigned shift the given value numBits right. If the given value is a long value,
it will return a long value else it will return an integer value.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.053Z

source raw docstring

short^clj

(short expr)

source

show^clj

(show dataframe)

(show dataframe options)

source

show-vertical^clj

(show-vertical dataframe)

(show-vertical dataframe options)

source

shuffle^cljmultimethod

source

signum^clj

(signum expr)

Params: (e: Column) Result: Column Computes the signum of the given value.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.056Z

Params: (e: Column)
Result: Column
Computes the signum of the given value.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.056Z

source raw docstring

sin^clj

(sin expr)

Params: (e: Column) Result: Column angle in radians Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.058Z

Params: (e: Column)
Result: Column
angle in radians
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.058Z

source raw docstring

sinh^clj

(sinh expr)

Params: (e: Column) Result: Column hyperbolic angle Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.060Z

Params: (e: Column)
Result: Column
hyperbolic angle
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.060Z

source raw docstring

size^clj

(size expr)

Params: (e: Column) Result: Column Returns length of array or map. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.062Z

Params: (e: Column)
Result: Column
Returns length of array or map.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.062Z

source raw docstring

skewness^clj

(skewness expr)

Params: (e: Column) Result: Column Aggregate function: returns the skewness of the values in a group.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.064Z

Params: (e: Column)
Result: Column
Aggregate function: returns the skewness of the values in a group.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.064Z

source raw docstring

slice^clj

(slice expr start length)

Params: (x: Column, start: Int, length: Int) Result: Column Returns an array containing all the elements in x from index start (or starting from the end if start is negative) with the specified length.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.065Z

Params: (x: Column, start: Int, length: Int)
Result: Column
Returns an array containing all the elements in x from index start (or starting from the
end if start is negative) with the specified length.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.065Z

source raw docstring

sort^clj

source

sort-array^clj

(sort-array expr)

(sort-array expr asc)

Params: (e: Column) Result: Column Sorts the input array for the given column in ascending order, according to the natural ordering of the array elements. Null elements will be placed at the beginning of the returned array.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.068Z

Params: (e: Column)
Result: Column
Sorts the input array for the given column in ascending order,
according to the natural ordering of the array elements.
Null elements will be placed at the beginning of the returned array.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.068Z

source raw docstring

sort-within-partitions^clj

(sort-within-partitions dataframe & exprs)

source

soundex^clj

(soundex expr)

Params: (e: Column) Result: Column Returns the soundex code for the specified expression.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.069Z

Params: (e: Column)
Result: Column
Returns the soundex code for the specified expression.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.069Z

source raw docstring

spark-conf^clj

(spark-conf spark-session)

source

spark-context^clj

(spark-context)

(spark-context spark)

source

spark-home^clj

(spark-home)

(spark-home spark)

source

spark-partition-id^clj

(spark-partition-id)

Params: () Result: Column Partition ID.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.070Z

Params: ()
Result: Column
Partition ID.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.070Z

source raw docstring

spark-session^clj

(spark-session dataframe)

source

sparse^clj

source

split^clj

(split expr pattern)

Params: (str: Column, pattern: String) Result: Column Splits str around matches of the given pattern.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.073Z

Params: (str: Column, pattern: String)
Result: Column
Splits str around matches of the given pattern.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.073Z

source raw docstring

sql-context^clj

(sql-context dataframe)

source

sqr^clj

(sqr expr)

Returns the value of the first argument raised to the power of two.

Returns the value of the first argument raised to the power of two.

source raw docstring

sqrt^clj

(sqrt expr)

Params: (e: Column) Result: Column Computes the square root of the specified float value.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.075Z

Params: (e: Column)
Result: Column
Computes the square root of the specified float value.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.075Z

source raw docstring

starts-with^clj

(starts-with expr literal)

source

std^clj

(std expr)

Params: (e: Column) Result: Column Aggregate function: alias for stddev_samp.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.077Z

Params: (e: Column)
Result: Column
Aggregate function: alias for stddev_samp.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.077Z

source raw docstring

stddev^clj

(stddev expr)

Params: (e: Column) Result: Column Aggregate function: alias for stddev_samp.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.077Z

Params: (e: Column)
Result: Column
Aggregate function: alias for stddev_samp.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.077Z

source raw docstring

stddev-pop^clj

(stddev-pop expr)

Params: (e: Column) Result: Column Aggregate function: returns the population standard deviation of the expression in a group.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.091Z

Params: (e: Column)
Result: Column
Aggregate function: returns the population standard deviation of
the expression in a group.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.091Z

source raw docstring

stddev-samp^clj

(stddev-samp expr)

Params: (e: Column) Result: Column Aggregate function: alias for stddev_samp.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.077Z

Params: (e: Column)
Result: Column
Aggregate function: alias for stddev_samp.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.077Z

source raw docstring

storage-level^clj

(storage-level dataframe)

source

str^clj

(str expr)

source

streaming?^clj

source

struct^clj

(struct & exprs)

Params: (cols: Column*) Result: Column Creates a new struct column. If the input column is a column in a DataFrame, or a derived column expression that is named (i.e. aliased), its name would be retained as the StructField's name, otherwise, the newly generated StructField's name would be auto generated as col with a suffix index + 1, i.e. col1, col2, col3, ...

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.096Z

Params: (cols: Column*)
Result: Column
Creates a new struct column.
If the input column is a column in a DataFrame, or a derived column expression
that is named (i.e. aliased), its name would be retained as the StructField's name,
otherwise, the newly generated StructField's name would be auto generated as
col with a suffix index + 1, i.e. col1, col2, col3, ...

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.096Z

source raw docstring

struct-field^clj

(struct-field col-name data-type nullable)

source

struct-type^clj

(struct-type & fields)

source

substring^clj

(substring expr pos len)

Params: (str: Column, pos: Int, len: Int) Result: Column Substring starts at pos and is of length len when str is String type or returns the slice of byte array that starts at pos in byte and is of length len when str is Binary type

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.097Z

Params: (str: Column, pos: Int, len: Int)
Result: Column
Substring starts at pos and is of length len when str is String type or
returns the slice of byte array that starts at pos in byte and is of length len
when str is Binary type

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.097Z

source raw docstring

substring-index^clj

(substring-index expr delim cnt)

Params: (str: Column, delim: String, count: Int) Result: Column Returns the substring from string str before count occurrences of the delimiter delim. If count is positive, everything the left of the final delimiter (counting from left) is returned. If count is negative, every to the right of the final delimiter (counting from the right) is returned. substring_index performs a case-sensitive match when searching for delim.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.098Z

Params: (str: Column, delim: String, count: Int)
Result: Column
Returns the substring from string str before count occurrences of the delimiter delim.
If count is positive, everything the left of the final delimiter (counting from left) is
returned. If count is negative, every to the right of the final delimiter (counting from the
right) is returned. substring_index performs a case-sensitive match when searching for delim.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.098Z

source raw docstring

sum^cljmultimethod

source

sum-distinct^clj

(sum-distinct expr)

Params: (e: Column) Result: Column Aggregate function: returns the sum of distinct values in the expression.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.103Z

Params: (e: Column)
Result: Column
Aggregate function: returns the sum of distinct values in the expression.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.103Z

source raw docstring

summary^clj

(summary dataframe & stat-names)

source

table->dataset^clj

(table->dataset table col-names)

(table->dataset spark table col-names)

source

tail^clj

(tail dataframe n-rows)

source

tail-vals^clj

(tail-vals dataframe n-rows)

source

take^clj

(take dataframe n-rows)

source

take-vals^clj

(take-vals dataframe n-rows)

source

tan^clj

(tan expr)

Params: (e: Column) Result: Column angle in radians Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.107Z

Params: (e: Column)
Result: Column
angle in radians
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.107Z

source raw docstring

tanh^clj

(tanh expr)

Params: (e: Column) Result: Column hyperbolic angle Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.109Z

Params: (e: Column)
Result: Column
hyperbolic angle
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.109Z

source raw docstring

time-window^clj

(time-window time-expr duration)

(time-window time-expr duration slide)

(time-window time-expr duration slide start)

Params: (timeColumn: Column, windowDuration: String, slideDuration: String, startTime: String) Result: Column Bucketize rows into one or more time windows given a timestamp specifying column. Window starts are inclusive but the window ends are exclusive, e.g. 12:05 will be in the window [12:05,12:10) but not in [12:00,12:05). Windows can support microsecond precision. Windows in the order of months are not supported. The following example takes the average stock price for a one minute window every 10 seconds starting 5 seconds after the hour: Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.220Z

Params: (timeColumn: Column, windowDuration: String, slideDuration: String, startTime: String)
Result: Column
Bucketize rows into one or more time windows given a timestamp specifying column. Window
starts are inclusive but the window ends are exclusive, e.g. 12:05 will be in the window
[12:05,12:10) but not in [12:00,12:05). Windows can support microsecond precision. Windows in
the order of months are not supported. The following example takes the average stock price for
a one minute window every 10 seconds starting 5 seconds after the hour:
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.220Z

source raw docstring

to-byte-array^clj

(to-byte-array cms)

source

to-csv^clj

(to-csv expr)

(to-csv expr options)

Params: (e: Column, options: Map[String, String]) Result: Column (Java-specific) Converts a column containing a StructType into a CSV string with the specified schema. Throws an exception, in the case of an unsupported type.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.112Z

Params: (e: Column, options: Map[String, String])
Result: Column
(Java-specific) Converts a column containing a StructType into a CSV string with
the specified schema. Throws an exception, in the case of an unsupported type.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.112Z

source raw docstring

to-date^clj

(to-date expr)

(to-date expr date-format)

Params: (e: Column) Result: Column Converts the column into DateType by casting rules to DateType.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.115Z

Params: (e: Column)
Result: Column
Converts the column into DateType by casting rules to DateType.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.115Z

source raw docstring

to-debug-string^clj

source

to-df^cljmultimethod

source

to-json^cljmultimethod

source

to-string^clj

source

to-timestamp^clj

(to-timestamp expr)

(to-timestamp expr date-format)

Params: (s: Column) Result: Column Converts to a timestamp by casting rules to TimestampType.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.123Z

Params: (s: Column)
Result: Column
Converts to a timestamp by casting rules to TimestampType.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.123Z

source raw docstring

to-utc-timestamp^clj

(to-utc-timestamp expr)

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.125Z

Params: (ts: Column, tz: String)
Result: Column
Given a timestamp like '2017-07-14 02:40:00.0', interprets it as a time in the given time
zone, and renders that time as a timestamp in UTC. For example, 'GMT+1' would yield
'2017-07-14 01:40:00.0'.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.125Z

source raw docstring

total-count^clj

(total-count cms)

source

transform^clj

(transform expr xform-fn)

Params: (column: Column, f: (Column) ⇒ Column) Result: Column Returns an array of elements after applying a transformation to each element in the input array. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.128Z

Params: (column: Column, f: (Column) ⇒ Column)
Result: Column
Returns an array of elements after applying a transformation to each element
in the input array.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.128Z

source raw docstring

transform-keys^clj

(transform-keys expr key-fn)

Params: (expr: Column, f: (Column, Column) ⇒ Column) Result: Column Applies a function to every key-value pair in a map and returns a map with the results of those applications as the new keys for the pairs. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.130Z

Params: (expr: Column, f: (Column, Column) ⇒ Column)
Result: Column
Applies a function to every key-value pair in a map and returns
a map with the results of those applications as the new keys for the pairs.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.130Z

source raw docstring

transform-values^clj

(transform-values expr key-fn)

Params: (expr: Column, f: (Column, Column) ⇒ Column) Result: Column Applies a function to every key-value pair in a map and returns a map with the results of those applications as the new values for the pairs. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.138Z

Params: (expr: Column, f: (Column, Column) ⇒ Column)
Result: Column
Applies a function to every key-value pair in a map and returns
a map with the results of those applications as the new values for the pairs.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.138Z

source raw docstring

translate^clj

(translate expr match replacement)

Params: (src: Column, matchingString: String, replaceString: String) Result: Column Translate any character in the src by a character in replaceString. The characters in replaceString correspond to the characters in matchingString. The translate will happen when any character in the string matches the character in the matchingString.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.139Z

Params: (src: Column, matchingString: String, replaceString: String)
Result: Column
Translate any character in the src by a character in replaceString.
The characters in replaceString correspond to the characters in matchingString.
The translate will happen when any character in the string matches the character
in the matchingString.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.139Z

source raw docstring

trim^clj

(trim expr trim-string)

Params: (e: Column) Result: Column Trim the spaces from both ends for the specified string column.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.141Z

Params: (e: Column)
Result: Column
Trim the spaces from both ends for the specified string column.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.141Z

source raw docstring

unbase-64^clj

(unbase-64 expr)

Params: (e: Column) Result: Column Decodes a BASE64 encoded string column and returns it as a binary column. This is the reverse of base64.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.188Z

Params: (e: Column)
Result: Column
Decodes a BASE64 encoded string column and returns it as a binary column.
This is the reverse of base64.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.188Z

source raw docstring

unbase64^clj

(unbase64 expr)

Params: (e: Column) Result: Column Decodes a BASE64 encoded string column and returns it as a binary column. This is the reverse of base64.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.188Z

Params: (e: Column)
Result: Column
Decodes a BASE64 encoded string column and returns it as a binary column.
This is the reverse of base64.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.188Z

source raw docstring

unbounded-following^clj

source

unbounded-preceeding^clj

source

unhex^clj

(unhex expr)

Params: (column: Column) Result: Column Inverse of hex. Interprets each pair of characters as a hexadecimal number and converts to the byte representation of number.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.189Z

Params: (column: Column)
Result: Column
Inverse of hex. Interprets each pair of characters as a hexadecimal number
and converts to the byte representation of number.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.189Z

source raw docstring

union^clj

(union & dataframes)

source

union-by-name^clj

(union-by-name & dataframes)

source

unix-timestamp^clj

(unix-timestamp)

(unix-timestamp expr)

(unix-timestamp expr pattern)

Params: () Result: Column Returns the current Unix timestamp (in seconds) as a long.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.197Z

Params: ()
Result: Column
Returns the current Unix timestamp (in seconds) as a long.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.197Z

source raw docstring

unpersist^clj

(unpersist dataframe)

(unpersist dataframe blocking)

source

update^cljmultimethod

source

upper^clj

(upper expr)

Params: (e: Column) Result: Column Converts a string column to upper case.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.198Z

Params: (e: Column)
Result: Column
Converts a string column to upper case.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.198Z

source raw docstring

vals^clj

source

value-counts^clj

(value-counts dataframe)

source

var-pop^clj

(var-pop expr)

Params: (e: Column) Result: Column Aggregate function: returns the population variance of the values in a group.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.200Z

Params: (e: Column)
Result: Column
Aggregate function: returns the population variance of the values in a group.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.200Z

source raw docstring

var-samp^clj

(var-samp expr)

Params: (e: Column) Result: Column Aggregate function: alias for var_samp.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.204Z

Params: (e: Column)
Result: Column
Aggregate function: alias for var_samp.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.204Z

source raw docstring

variance^clj

(variance expr)

Params: (e: Column) Result: Column Aggregate function: alias for var_samp.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.204Z

Params: (e: Column)
Result: Column
Aggregate function: alias for var_samp.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.204Z

source raw docstring

version^clj

(version)

(version spark)

source

week-of-year^clj

(week-of-year expr)

Params: (e: Column) Result: Column Extracts the week number as an integer from a given date/timestamp/string. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.209Z

Params: (e: Column)
Result: Column
Extracts the week number as an integer from a given date/timestamp/string.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.209Z

source raw docstring

weekofyear^clj

(weekofyear expr)

Params: (e: Column)
Result: Column
Extracts the week number as an integer from a given date/timestamp/string.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.209Z

source raw docstring

when^clj

(when condition if-expr)

(when condition if-expr else-expr)

Params: (condition: Column, value: Any) Result: Column Evaluates a list of conditions and returns one of multiple possible result expressions. If otherwise is not defined at the end, null is returned for unmatched conditions. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.211Z

Params: (condition: Column, value: Any)
Result: Column
Evaluates a list of conditions and returns one of multiple possible result expressions.
If otherwise is not defined at the end, null is returned for unmatched conditions.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.211Z

source raw docstring

where^cljmultimethod

source

width^clj

(width cms)

source

window^clj

(window {:keys [partition-by order-by range-between rows-between]})

source

windowed^clj

(windowed options)

source

with-column^clj

(with-column dataframe col-name expr)

source

with-column-renamed^clj

(with-column-renamed dataframe old-name new-name)

source

write-avro!^clj

(write-avro! dataframe path)

(write-avro! dataframe path options)

source

write-csv!^clj

(write-csv! dataframe path)

(write-csv! dataframe path options)

source

write-edn!^clj

(write-edn! dataframe path)

(write-edn! dataframe path options)

source

write-jdbc!^clj

(write-jdbc! dataframe options)

source

write-json!^clj

(write-json! dataframe path)

(write-json! dataframe path options)

source

write-libsvm!^clj

(write-libsvm! dataframe path)

(write-libsvm! dataframe path options)

source

write-parquet!^clj

(write-parquet! dataframe path)

(write-parquet! dataframe path options)

source

write-text!^clj

(write-text! dataframe path)

(write-text! dataframe path options)

source

write-xlsx!^clj

(write-xlsx! dataframe path)

(write-xlsx! dataframe path options)

source

xxhash-64^clj

(xxhash-64 & exprs)

Params: (cols: Column*) Result: Column Calculates the hash code of given columns using the 64-bit variant of the xxHash algorithm, and returns the result as a long column.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.222Z

Params: (cols: Column*)
Result: Column
Calculates the hash code of given columns using the 64-bit
variant of the xxHash algorithm, and returns the result as a long
column.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.222Z

source raw docstring

xxhash64^clj

(xxhash64 & exprs)

Params: (cols: Column*) Result: Column Calculates the hash code of given columns using the 64-bit variant of the xxHash algorithm, and returns the result as a long column.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.222Z

Params: (cols: Column*)
Result: Column
Calculates the hash code of given columns using the 64-bit
variant of the xxHash algorithm, and returns the result as a long
column.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.222Z

source raw docstring

year^clj

(year expr)

Params: (e: Column) Result: Column Extracts the year as an integer from a given date/timestamp/string. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.223Z

Params: (e: Column)
Result: Column
Extracts the year as an integer from a given date/timestamp/string.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.223Z

source raw docstring

zero?^clj

(zero? expr)

source

zip-with^clj

(zip-with left right merge-fn)

Params: (left: Column, right: Column, f: (Column, Column) ⇒ Column) Result: Column Merge two given arrays, element-wise, into a single array using a function. If one array is shorter, nulls are appended at the end to match the length of the longer array, before applying the function. Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html Timestamp: 2020-10-02T14:21:41.226Z

Params: (left: Column, right: Column, f: (Column, Column) ⇒ Column)
Result: Column
Merge two given arrays, element-wise, into a single array using a function.
If one array is shorter, nulls are appended at the end to match the length of the longer
array, before applying the function.
Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html
Timestamp: 2020-10-02T14:21:41.226Z

source raw docstring

zipmap^clj

source

|^clj

(| left-expr right-expr)

source

||^clj

(|| & exprs)

source

cljdoc is a website building & hosting documentation for Clojure/Script libraries

Keyboard shortcuts Report a problem cljdoc on GitHub

× close