Liking cljdoc? Tell your friends :D

sisyphus

SISYPHUS

"Forever, I shall be a stranger to myself."

― Albert Camus

concept

Sisyphus is a task execution worker designed for decentralization, running commands in containers, and pulling inputs from and pushing outputs to a data store. There is no leader so you can add new nodes on the fly to help out with load and scale them back down again when you are done.

setup

In order to use Google Cloud Storage, Compute Engine, and Container Registry you must install the SDK, then log in. Follow the instructions in Install and log in to the Google Cloud SDK. (See Cloud Storage Client Libraries for more background on programming with the SDK to access cloud resources.)

Sisyphus uses RabbitMQ to receive run-task messages and Kafka to communicate with the workflow engine Gaia, so you need those servers running somewhere you can connect to.

storage

There are three types of pathnames in use:

  • remote - files in the object store.
  • local - files on the system that is running a Sisyphus node.
  • internal - files inside the docker container.

Sisyphus pulls inputs down from the object store to the local system, then maps a volume from that local path into a specified path internal to the container that is actually going to run the code. Once the code finishes, a reverse mapping takes outputs from internal paths and maps them to local paths, which are then uploaded back into the remote object store. This is the full round trip of a single task being executed and has the ultimate effect of operating on objects in an object store and adding the results back as new objects into that same store.

In general, clients of Sisyphus only really need to be aware of remote and internal paths. Local paths are managed entirely by Sisyphus, but it is good to know that this intermediate layer exists.

task documents

Each task to run is represented by a task document in JSON format. This document details the command that will be run, the image it will be run with, any inputs the command needs to run, and any outputs we want to record later. Here is an example document:

{"image": "alpine:latest",
 "inputs": {
   "sisyphus:data/clear": "/mnt/clear"},
 "outputs": {
   "sisyphus:data/crypt": "/mnt/crypt"},
 "command": [
     "md5sum",
     "/mnt/clear"],
 "stdout": "/mnt/crypt"}

The map keys are:

  • image - what docker image to use for this command.
  • inputs - map of bucketname:remote/path to /internal/container/path for copying down files from the object store and knowing where to bind them inside the container.
  • outputs - same as inputs, but opposite, so a map of object store paths bucket:remote/path where any files at the original internal path /internal/container/path will be uploaded to once execution is complete.
  • commands - a command to run, expressed as an array of token strings.
  • stdout, stdin and stderr with internal paths, if required. <== [TODO] Update this.

triggering execution

Any running Sisyphus nodes will be listening on a rabbitmq channel and will consume any available task message when they are ready to execute something.

running

To start a new Sisyphus node that is ready to accept new tasks:

$ lein run

Once this is up, you can put tasks on the queue and the new node will accept them and begin executing. In a new terminal execute the following commands:

$ lein repl
> (def rabbit (rabbit/connect! {}))
> (def task (task/load-task "test/tasks/md5sum.json")
> (rabbit/publish! rabbit task)

You will see the worker node pick up this task and begin executing.

Can you improve this documentation? These fine people already did:
Ryan Spangler & Jerry Morrison
Edit on GitHub

cljdoc is a website building & hosting documentation for Clojure/Script libraries

× close