zero-one.geni.rdd

Liking cljdoc? Tell your friends :D

Clojure only.

aggregate
aggregate-by-key
app-name
binary-files
broadcast
cache
cartesian
checkpoint-dir
checkpointed?
coalesce
cogroup
collect
collect-async
collect-partitions
combine-by-key
conf
context
count
count-approx
count-approx-distinct
count-approx-distinct-by-key
count-async
count-by-key
count-by-key-approx
count-by-value
default-min-partitions
default-parallelism
disk-only
disk-only-2
distinct
empty-rdd
empty?
filter
final-value
final?
first
flat-map
flat-map-to-pair
flat-map-values
fold
fold-by-key
foreach
foreach-async
foreach-partition
foreach-partition-async
full-outer-join
get-num-partitions
get-storage-level
glom
group-by
group-by-key
id
initial-value
intersection
is-checkpointed
is-empty
is-initial-value-final
is-local
jars
java-spark-context
join
key-by
keys
left-outer-join
local-property
local?
lookup
map
map-partitions
map-partitions-to-pair
map-partitions-with-index
map-to-pair
map-values
mapcat
mapcat-to-pair
master
max
memory-and-disk
memory-and-disk-2
memory-and-disk-ser
memory-and-disk-ser-2
memory-only
memory-only-2
memory-only-ser
memory-only-ser-2
min
name
none
num-partitions
off-heap
parallelise
parallelise-doubles
parallelise-pairs
parallelize
parallelize-doubles
parallelize-pairs
partition-by
partitioner
partitions
persist
persistent-rdds
random-split
rdd?
reduce
reduce-by-key
reduce-by-key-locally
repartition
repartition-and-sort-within-partitions
resources
right-outer-join
sample
sample-by-key
sample-by-key-exact
save-as-text-file
sc
sort-by-key
spark-context
spark-home
storage-level
subtract
subtract-by-key
take
take-async
take-ordered
take-sample
text-file
top
union
unpersist
vals
value
values
version
whole-text-files
zip
zip-partitions
zip-with-index
zip-with-unique-id

aggregate^clj

(aggregate rdd zero seq-op comb-op)

Params: (zeroValue: U)

(seqOp: Function2[U, T, U], combOp: Function2[U, U, U])

Result: U

Aggregate the elements of each partition, and then the results for all the partitions, using given combine functions and a neutral "zero value". This function can return a different result type, U, than the type of this RDD, T. Thus, we need one operation for merging a T into an U and one operation for merging two U's, as in scala.TraversableOnce. Both of these functions are allowed to modify and return their first argument instead of creating a new U to avoid memory allocation.

Source: https://spark.apache.org/docs/3.0.1/api/scala/org/apache/spark/api/java/JavaRDD.html

Timestamp: 2020-10-19T01:56:48.803Z

Params: (zeroValue: U)

(seqOp: Function2[U, T, U], combOp: Function2[U, U, U])

Result: U

Aggregate the elements of each partition, and then the results for all the partitions, using
given combine functions and a neutral "zero value". This function can return a different result
type, U, than the type of this RDD, T. Thus, we need one operation for merging a T into an U
and one operation for merging two U's, as in scala.TraversableOnce. Both of these functions are
allowed to modify and return their first argument instead of creating a new U to avoid memory
allocation.


Source: https://spark.apache.org/docs/3.0.1/api/scala/org/apache/spark/api/java/JavaRDD.html

Timestamp: 2020-10-19T01:56:48.803Z

`Ctrl`+`k`	Jump to recent docs
`←`	Move to previous article
`→`	Move to next article
`Ctrl`+`/`	Jump to the search field

zero-one.geni.rdd

aggregateclj

aggregate-by-keyclj

app-nameclj

binary-filescljmultimethod

broadcastclj

cacheclj

cartesianclj

checkpoint-dirclj

checkpointed?clj

coalesceclj

cogroupclj

collectclj

collect-asyncclj

collect-partitionsclj

combine-by-keyclj

confclj

contextclj

countclj

count-approxclj

count-approx-distinctclj

count-approx-distinct-by-keyclj

count-asyncclj

count-by-keyclj

count-by-key-approxclj

count-by-valueclj

default-min-partitionsclj

default-parallelismclj

disk-onlyclj

disk-only-2clj

distinctclj

empty-rddclj

empty?clj

filterclj

final-valueclj

final?clj

firstclj

flat-mapclj

flat-map-to-pairclj

flat-map-valuesclj

foldclj

fold-by-keyclj

foreachclj

foreach-asyncclj

foreach-partitionclj

foreach-partition-asyncclj

full-outer-joinclj

get-num-partitionsclj

get-storage-levelclj

glomclj

group-byclj

group-by-keyclj

idclj

initial-valueclj

intersectionclj

is-checkpointedclj

is-emptyclj

is-initial-value-finalclj

is-localclj

jarsclj

java-spark-contextclj

joinclj

key-byclj

keysclj

left-outer-joinclj

local-propertyclj

local?clj

lookupclj

mapcljmultimethod

map-partitionsclj

map-partitions-to-pairclj

map-partitions-with-indexclj

map-to-pairclj

map-valuesclj

mapcatclj

mapcat-to-pairclj

masterclj

maxclj

memory-and-diskclj

memory-and-disk-2clj

aggregate^clj

aggregate-by-key^clj

app-name^clj

binary-files^cljmultimethod

broadcast^clj

cache^clj

cartesian^clj

checkpoint-dir^clj

checkpointed?^clj

coalesce^clj

cogroup^clj

collect^clj

collect-async^clj

collect-partitions^clj

combine-by-key^clj

conf^clj

context^clj

count^clj

count-approx^clj

count-approx-distinct^clj

count-approx-distinct-by-key^clj

count-async^clj

count-by-key^clj

count-by-key-approx^clj

count-by-value^clj

default-min-partitions^clj

default-parallelism^clj

disk-only^clj

disk-only-2^clj

distinct^clj

empty-rdd^clj

empty?^clj

filter^clj

final-value^clj

final?^clj

first^clj

flat-map^clj

flat-map-to-pair^clj

flat-map-values^clj

fold^clj

fold-by-key^clj

foreach^clj

foreach-async^clj

foreach-partition^clj

foreach-partition-async^clj

full-outer-join^clj

get-num-partitions^clj

get-storage-level^clj

glom^clj

group-by^clj

group-by-key^clj

id^clj

initial-value^clj

intersection^clj

is-checkpointed^clj

is-empty^clj

is-initial-value-final^clj

is-local^clj

jars^clj

java-spark-context^clj

join^clj

key-by^clj

keys^clj

left-outer-join^clj

local-property^clj

local?^clj

lookup^clj

map^cljmultimethod

map-partitions^clj

map-partitions-to-pair^clj

map-partitions-with-index^clj

map-to-pair^clj

map-values^clj

mapcat^clj

mapcat-to-pair^clj

master^clj

max^clj

memory-and-disk^clj

memory-and-disk-2^clj

memory-and-disk-ser^clj