Liking cljdoc? Tell your friends :D

All platforms.

fr.jeremyschoffen.textp.alpha.reader.grammar

Textp's grammar.

We construct here textp's grammar using instaparse. Our grammar is then constructed here in two parts:

a lexical part or lexer made of regular expressions.
a set of grammatical rules tyring the lexer together into the grammar.

The lexer.

Our lexer is made of regular expression constructed with the [[textp.reader.alpha.grammar/defregex]] macro which uses the Regal library under the covers. We then assemble a lexer from these regular expressions with the [[textp.reader.alpha.grammar/make-lexer]] macro.

For instance we could construct the following 2 rules lexer:

(def-regex number [:* :digit])

(def-regex word [:* [:class ["a" "z"]]])

(def lexer (make-lexer number word))

lexer
;=> {:number {:tag :regexp
              :regexp #"\d*"}
     :word {:tag :regexp
            :regexp #"[a-z]*"}}

The grammatical rules

We use the [[instaparse.combinators/ebnf]] function to produce grammatical rules. This allows use to write these rules in the ebnf format.

For instance we could write the following:

(def rules
  (instac/ebnf
     "
     doc = (token <':'>)*
     token = (number | word)
     "))

rules
;=>{:doc {:tag :star
          :parser {:tag :cat
                   :parsers ({:tag :nt :keyword :token}
                            {:tag :string :string ":" :hide true})}}
    :token {:tag :alt
            :parsers ({:tag :nt :keyword :number}
                      {:tag :nt :keyword :word})}}

This way of writing the grammatical rules is way easier than using function combinators and still gives us these rules in map form.

The combining trick

Now that we have both a lexer and and grammatical rules, we can simply merge them to have the full grammar.

We actually get a instparse parser this way:

(def parser
  (insta/parser (merge lexer rules)
                :start :doc))

(parser "abc:1:def:2:3:")
;=> [:doc
     [:token [:word "abc"]]
     [:token [:number "1"]]
     [:token [:word "def"]]
     [:token [:number "2"]]
     [:token [:number "3"]]]
```

With the exception of some details, this is how this namespace is made.

# Textp's grammar.

We construct here textp's grammar using instaparse. Our grammar is then constructed here in two parts:
- a lexical part or lexer made of regular expressions.
- a set of grammatical rules tyring the lexer together into the grammar.

## The lexer.
Our lexer is made of regular expression constructed with the [[textp.reader.alpha.grammar/defregex]] macro
which uses the Regal library under the covers. We then assemble a lexer from these regular expressions
with the [[textp.reader.alpha.grammar/make-lexer]] macro.

For instance we could construct the following 2 rules lexer:

```clojure
(def-regex number [:* :digit])

(def-regex word [:* [:class ["a" "z"]]])

(def lexer (make-lexer number word))

lexer
;=> {:number {:tag :regexp
              :regexp #"\d*"}
     :word {:tag :regexp
            :regexp #"[a-z]*"}}
```

## The grammatical rules
We use the [[instaparse.combinators/ebnf]] function to produce grammatical rules. This allows use
to write these rules in the ebnf format.

For instance we could write the following:
```clojure
(def rules
  (instac/ebnf
     "
     doc = (token <':'>)*
     token = (number | word)
     "))

rules
;=>{:doc {:tag :star
          :parser {:tag :cat
                   :parsers ({:tag :nt :keyword :token}
                            {:tag :string :string ":" :hide true})}}
    :token {:tag :alt
            :parsers ({:tag :nt :keyword :number}
                      {:tag :nt :keyword :word})}}
```

This way of writing the grammatical rules is way easier than using function combinators and still gives us
these rules in map form.

## The combining trick
Now that we have both a lexer and and grammatical rules, we can simply merge them to have the full grammar.

We actually get a instparse parser this way:

````clojure
(def parser
  (insta/parser (merge lexer rules)
                :start :doc))

(parser "abc:1:def:2:3:")
;=> [:doc
     [:token [:word "abc"]]
     [:token [:number "1"]]
     [:token [:word "def"]]
     [:token [:number "2"]]
     [:token [:number "3"]]]
```

With the exception of some details, this is how this namespace is made.

raw docstring

all-delimitors^clj/s

source

all-grammatical-rules^clj/s

Merging of the lexer rules and the grammatical rules.

Merging of the lexer rules and the grammatical rules.

fr.jeremyschoffen.textp.alpha.reader.grammar

Textp's grammar.

The lexer.

The grammatical rules

The combining trick

all-delimitorsclj/s

all-grammatical-rulesclj/s

any-charclj/s

anythingclj/s

bracesclj/s

bracketsclj/s

comment-gclj/s

def-regexclj/smacro

diamondclj/s

double-quoteclj/s

embedded-gclj/s

embedded-g-maskedclj/s

end-commentclj/s

end-embedded-valueclj/s

end-embeded-codeclj/s

end-verbatimclj/s

escaperclj/s

escaping-charclj/s

general-gclj/s

general-g-maskedclj/s

grammarclj/s

grammar-maskedclj/s

hide-allclj/s

hide-rulesclj/s

lexerclj/s

lexer*clj/s

macro-reader-charclj/s

make-complex-symbol-regexclj/s

make-lexerclj/smacro

make-simple-symbol-regexclj/s

non-specialclj/s

normal-textclj/s

ns-endclj/s

parensclj/s

parserclj/s

plain-textclj/s

symbol-first-charclj/s

symbol-ns-partclj/s

symbol-regular-charclj/s

symbol-regular-char-setclj/s

tag-gclj/s

tag-g-maskedclj/s

tag-plain-textclj/s

text-commentclj/s

text-e-codeclj/s

text-e-valueclj/s

text-escaped-charclj/s

text-gclj/s

text-g-maskedclj/s

text-spacesclj/s

text-symbolclj/s

text-t-cljclj/s

text-t-clj-non-specialclj/s

text-t-clj-strclj/s

text-t-clj-str-non-specialclj/s

text-t-txt-non-specialclj/s

text-verbatimclj/s

verbatim-gclj/s

all-delimitors^clj/s

all-grammatical-rules^clj/s

any-char^clj/s

anything^clj/s

braces^clj/s

brackets^clj/s

comment-g^clj/s

def-regex^clj/smacro

diamond^clj/s

double-quote^clj/s

embedded-g^clj/s

embedded-g-masked^clj/s

end-comment^clj/s

end-embedded-value^clj/s

end-embeded-code^clj/s

end-verbatim^clj/s

escaper^clj/s

escaping-char^clj/s

general-g^clj/s

general-g-masked^clj/s

grammar^clj/s

grammar-masked^clj/s

hide-all^clj/s

hide-rules^clj/s

lexer^clj/s

lexer*^clj/s

macro-reader-char^clj/s

make-complex-symbol-regex^clj/s

make-lexer^clj/smacro

make-simple-symbol-regex^clj/s

non-special^clj/s

normal-text^clj/s

ns-end^clj/s

parens^clj/s

parser^clj/s

plain-text^clj/s

symbol-first-char^clj/s

symbol-ns-part^clj/s

symbol-regular-char^clj/s

symbol-regular-char-set^clj/s

tag-g^clj/s

tag-g-masked^clj/s

tag-plain-text^clj/s

text-comment^clj/s

text-e-code^clj/s

text-e-value^clj/s

text-escaped-char^clj/s

text-g^clj/s

text-g-masked^clj/s

text-spaces^clj/s

text-symbol^clj/s

text-t-clj^clj/s

text-t-clj-non-special^clj/s

text-t-clj-str^clj/s

text-t-clj-str-non-special^clj/s

text-t-txt-non-special^clj/s

text-verbatim^clj/s

verbatim-g^clj/s