When you query
or execute
something, unless you specified a custom
reducer, you get a vector of maps:
(def rows
(pg/query conn "select x from generate_series(1, 3) as x"))
rows
[{:x 1} {:x 2} {:x 3}]
These maps are not pure Clojure maps but rather special objects that mimic
Clojure maps. Namely, these are instances of the org.pg.clojure.RowMap
class
that implements APersistentMap
and some other interfaces. But at first glance,
it's a map indeed: get
, assoc
, dissoc
and other things work as expected:
(-> rows first map?)
true
(-> rows first type)
org.pg.clojure.RowMap
(-> rows first (assoc :y 2))
{:y 2, :x 1}
(-> rows first :x)
1
The RowMap
class is special in terms of laziness. Initially, it stores only an
array of keys and unparsed byte array that came from a Postgres server. When you
touch a certain key, a corresponding fragment of the array gets parsed and
cached. When you reach the same key the second time, it's taken from a cache
without parsing:
{
_keys ["Ivan" "test@test.com" ...]
_payload [....................................]
|---name---|-----email-----|...
_cache {:name "Ivan"}
}
(:x row) ;; cache miss, parse, cache set
(:x row) ;; cache hit
This feature brings two positive points. First, you don't spend time on parsing rows when receiving data from a server. The throughput is faster, and the connection is now busy for less time. You borrow a connection, get the data without parsing and let other threads use it again.
Second, we often select more fields that we actually need. For example, querying
select *
from a table with 20 fields while only two of three of them are
needed. With laziness, all 20 fields stay unparsed until you forcibly trigger
evaluation.
The RowMap
class stays itself only while you're reading. If you assoc
or
dissoc
a key, it gets transformed into an ordinary Clojure map. Adding or
removing a key triggers a process of parsing all keys and making a Clojure map:
(-> rows first type)
org.pg.clojure.RowMap
(-> rows first (assoc :y 2) type)
clojure.lang.PersistentHashMap
A RowMap
instance knows the order of keys, and it gives a couple of
features. First, it can be passed into the nth
function like a vector:
(nth row 0) ;; the first column
(nth row 1) ;; the second column
(nth row 99) ;; IndexOutOfBoundsException: the row map misses index 99
You can destruct a row map as a vector in let
binding:
(let [[id email created-at] row]
...)
Under the hood, it expands into something like this:
(let [id (nth row 0)
email (nth row 1)
created-at (nth row 2)]
...)
The second feature is, it always preserves the order of columns when reading keys, values, or processing a row as a sequence of key-value pairs. Here is a small demo:
(def row
(pg/query conn
"select 1 a, 2 b, 3 c, 4 d, 5 e, 6 f, 7 g, 8 h, 9 i,
10 j, 11 k, 12 l, 13 m, 14 n, 15 o, 16 p"
{:first? true}))
row
{:a 1, :b 2, :c 3, :d 4, :e 5, :f 6, :g 7, :h 8, :i 9,
:j 10, :k 11, :l 12, :m 13, :n 14, :o 15, :p 16}
(keys row) ;; follows the order of columns
(:a :b :c :d :e :f :g :h :i :j :k :l :m :n :o :p)
(vals row) ;; the same
(1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16)
Whereas pure Clojure maps don't preserve the order or keys:
(def row
{:a 1, :b 2, :c 3, ... :n 14, :o 15, :p 16})
(keys row) ;; different order
(:o :n :m :e :l :k :g :c :j :h :b :d :f :p :i :a)
All these features make data processing a bit easier and predictable.
Can you improve this documentation?Edit on GitHub
cljdoc is a website building & hosting documentation for Clojure/Script libraries
× close