jepsen.random

Liking cljdoc? Tell your friends :D

Clojure only.

Pluggable generation of random values.

Pluggable

First, randomness should be pluggable. In normal tests, standard Clojure (rand-int) and friends are just fine. But in tests, it's nice if those can be replaced by a deterministic seed. When running in a hypervisor like Antithesis, you want to draw entropy from a special SDK, so it can intentionally send you down interesting paths.

Fast

Second, it should be reasonably fast. We'd like to ideally generate ~100 K ops/sec, and each operation might need to draw, say, 10 random values, which is 1 M values/sec. Basic Clojure (rand-int 5) on my ThreadRipper takes ~37 ns. Clojure's data.generators takes ~35 ns. Our thread-local implementation, backed by a LXM splittable random, takes just ~33 ns.

Thread-safe

We want everyone in the Jepsen universe to be able to draw random values from this namespace without coordinating. This implies generating values should be thread-safe.

Stateful

This namespace must be stateful. We'd like callers to simply be able to call (r/int 5) and get 2.

Pure, splittable random seeds ala test.check are nice, but they a.) come with a performance penalty, and b.) require threading that random state through essentially every function call and return. This is not only complex, but adds additional destructuring overhead at each call boundary.

The main advantage of stateful random generators is determinism across threads, but this is not a major concern in Jepsen. In normal test runs, we don't care about reproducibility. In Antithesis, the entire thread schedule is deterministic, so we're free to share state across threads and trust that Antithesis Will Take Care Of It. In tests, we're generally drawing entropy from a single thread. It'd be nice to have thread-safe random generators, but it's not critical.

Determinism

In single-threaded contexts, we want to be able to seed randomness and have reproducible tests. Doing this across threads is not really important--if we were being rigorous we could thread a splittable random down through every thread spawned, but there's a LOT of threaded code in Jepsen and it doesn't all know about us. More to the point, our multithreaded code is usually a.) non-random, or b.) doing IO, which we can't control. Having determinism for a single thread gets us a reasonable 'bang for our buck'.

Special Distributions

Jepsen needs some random things that aren't well supported by the normal java.util.Random, clojure.core, or data.generators functions. In particular, we like to do:

Zipfian distributions: lots of small things, but sometimes very large things.
Weighted choices: 90% reads, 5% writes, 5% deletes.
Special values: over-represent maxima, minima, and zero, to stress codepaths that might treat them differently.

Usage

Here are common Clojure functions and their equivalents in this namespace:

rand rand/double rand-int rand/long rand-nth rand/nth shuffle rand/shuffle

You can also generate values from common distributions:

rand/bool Returns true or false, optionally with a probability rand/exp Exponential distribution rand/geometric Geometric distribution rand/zipf Zipfian distribution rand/weighted Discrete values with given weights

You can take random permutations and subsets (really, ordered prefixes of permutations) of collections with:

rand/shuffle rand/nonempty-subset

There are two macros for randomly branching control flow:

rand/branch rand/weighted-branch

To re-bind randomness to a specifically seeded RNG, use:

(jepsen.random/with-seed 5 (jepsen.random/long) ; Returns the same value every time (call-stuff-using-jepsen.random ...) ; This holds for the whole body

This changes a global variable jepsen.random/rng and is NOT THREAD SAFE. Do not use with-seed concurrently. It's fine to spawn threads within the body, but if those threads are spawned in a nondeterministic order, then their calls to jepsen.random will also be nondeterministic.

Pluggable generation of random values.

## Pluggable

First, randomness should be pluggable. In normal tests, standard Clojure
`(rand-int)` and friends are just fine. But in tests, it's nice if those can
be replaced by a deterministic seed. When running in a hypervisor like
Antithesis, you want to draw entropy from a special SDK, so it can
intentionally send you down interesting paths.

## Fast

Second, it should be *reasonably* fast. We'd like to ideally generate ~100 K
ops/sec, and each operation might need to draw, say, 10 random values, which
is 1 M values/sec. Basic Clojure (rand-int 5) on my ThreadRipper takes ~37
ns. Clojure's data.generators takes ~35 ns. Our thread-local implementation,
backed by a LXM splittable random, takes just ~33 ns.

## Thread-safe

We want everyone in the Jepsen universe to be able to draw random values from
this namespace without coordinating. This implies generating values should be
thread-safe.

## Stateful

This namespace must be stateful. We'd like callers to simply be able to call
`(r/int 5)` and get 2.

Pure, splittable random seeds ala `test.check` are nice, but they a.) come
with a performance penalty, and b.) require threading that random state
through essentially every function call and return. This is not only complex,
but adds additional destructuring overhead at each call boundary.

The main advantage of stateful random generators is determinism across
threads, but this is not a major concern in Jepsen. In normal test runs, we
don't care about reproducibility. In Antithesis, the entire thread schedule
is deterministic, so we're free to share state across threads and trust that
Antithesis Will Take Care Of It. In tests, we're generally drawing entropy
from a single thread. It'd be *nice* to have thread-safe random generators,
but it's not critical.

## Determinism

In single-threaded contexts, we want to be able to seed randomness and have
reproducible tests. Doing this across threads is not really important--if we
were being rigorous we could thread a splittable random down through every
thread spawned, but there's a LOT of threaded code in Jepsen and it doesn't
all know about us. More to the point, our multithreaded code is usually a.)
non-random, or b.) doing IO, which we can't control. Having determinism for a
single thread gets us a reasonable 'bang for our buck'.

## Special Distributions

Jepsen needs some random things that aren't well supported by the normal
java.util.Random, clojure.core, or data.generators functions. In particular,
we like to do:

1. Zipfian distributions: lots of small things, but sometimes very
   large things.
2. Weighted choices: 90% reads, 5% writes, 5% deletes.
3. Special values: over-represent maxima, minima, and zero, to stress
   codepaths that might treat them differently.

## Usage

Here are common Clojure functions and their equivalents in this namespace:

  rand        rand/double
  rand-int    rand/long
  rand-nth    rand/nth
  shuffle     rand/shuffle

You can also generate values from common distributions:

  rand/bool       Returns true or false, optionally with a probability
  rand/exp        Exponential distribution
  rand/geometric  Geometric distribution
  rand/zipf       Zipfian distribution
  rand/weighted   Discrete values with given weights

You can take random permutations and subsets (really, ordered prefixes of
permutations) of collections with:

  rand/shuffle
  rand/nonempty-subset

There are two macros for randomly branching control flow:

  rand/branch
  rand/weighted-branch

To re-bind randomness to a specifically seeded RNG, use:

(jepsen.random/with-seed 5
  (jepsen.random/long)                  ; Returns the same value every time
  (call-stuff-using-jepsen.random ...)  ; This holds for the whole body

This changes a global variable `jepsen.random/rng` and is NOT THREAD SAFE. Do
not use `with-seed` concurrently. It's fine to spawn threads within the body,
but if those threads are spawned in a nondeterministic order, then their
calls to jepsen.random will also be nondeterministic.

`Ctrl`+`k`	Jump to recent docs
`←`	Move to previous article
`→`	Move to next article
`Ctrl`+`/`	Jump to the search field

jepsen.random

Pluggable

Fast

Thread-safe

Stateful

Determinism

Special Distributions

Usage

all-zero-doubles?clj

boolclj

branchcljmacro

clojure-randomclj

data-generators-randomclj

doubleclj

double-weighted-indexclj

expclj

geometricclj

longclj

nonempty-subsetclj

nthclj

nth-emptyclj

rngclj

shuffleclj

thread-local-randomclj

weightedclj

weighted-branchcljmacro

weighted-fnclj

with-rngcljmacro

with-seedcljmacro

zipfclj

zipf-b-inverse-cdfclj

zipf-default-skewclj

all-zero-doubles?^clj

bool^clj

branch^cljmacro

clojure-random^clj

data-generators-random^clj

double^clj

double-weighted-index^clj

exp^clj

geometric^clj

long^clj

nonempty-subset^clj

nth^clj

nth-empty^clj

rng^clj

shuffle^clj

thread-local-random^clj

weighted^clj

weighted-branch^cljmacro

weighted-fn^clj

with-rng^cljmacro

with-seed^cljmacro

zipf^clj

zipf-b-inverse-cdf^clj

zipf-default-skew^clj