(alternate-swap! atom f & args)
This version returns the swapped out value as well
This version returns the swapped out value as well
(crawl config)
Main entry point. Right now we have two ways of specifying seed URLs: :seed-list [seed1, seed2, ...] :seed-file /path/to/txt/file/with/seeds. Use just 1 :D.
If you specify a destination (at :destination), all records are finally written there. Otherwise, we pprint to stdout.
Config keys that we understand: :user-agent <user agent>
Main entry point. Right now we have two ways of specifying seed URLs: :seed-list [seed1, seed2, ...] :seed-file /path/to/txt/file/with/seeds. Use just 1 :D. If you specify a destination (at :destination), all records are finally written there. Otherwise, we pprint to stdout. Config keys that we understand: :user-agent <user agent>
(filter-uris uris config)
Given a list of URIs pick out an unseen distinct set. This is a helper for the enqueue op.
Given a list of URIs pick out an unseen distinct set. This is a helper for the enqueue op.
(get-time-ticks n min-delay)
Generates a set of intervals (jodatime objects) separated by min-delay
Generates a set of intervals (jodatime objects) separated by min-delay
cljdoc is a website building & hosting documentation for Clojure/Script libraries
× close