Liking cljdoc? Tell your friends :D

lucene.custom.text-analysis


text->graphclj

(text->graph text)
(text->graph text analyzer)
(text->graph text analyzer field-name)

Given a text (and an optional analyzer) turns the text into a TokenStream that will be converted to the dot language program as a string, e.g.: `digraph tokens { graph [ fontsize=30 labelloc="t" label="" splines=true overlap=false rankdir = "LR" ]; // A2 paper size size = "34.4,16.5"; edge [ fontname="Helvetica" fontcolor="red" color="#606060" ] node [ style="filled" fillcolor="#e8e8f0" shape="Mrecord" fontname="Helvetica" ]

0 [label="0"] -1 [shape=point color=white] -1 -> 0 [] 0 -> 1 [ label="foobarbazs / fooBarBazs"] -2 [shape=point color=white] 1 -> -2 [] }`

Given a text (and an optional analyzer) turns the text into a TokenStream
that will be converted to the `dot` language program as a string, e.g.:
`digraph tokens {
   graph [ fontsize=30 labelloc=\"t\" label=\"\" splines=true overlap=false rankdir = \"LR\" ];
   // A2 paper size
   size = \"34.4,16.5\";
   edge [ fontname=\"Helvetica\" fontcolor=\"red\" color=\"#606060\" ]
   node [ style=\"filled\" fillcolor=\"#e8e8f0\" shape=\"Mrecord\" fontname=\"Helvetica\" ]

   0 [label=\"0\"]
   -1 [shape=point color=white]
   -1 -> 0 []
   0 -> 1 [ label=\"foobarbazs / fooBarBazs\"]
   -2 [shape=point color=white]
   1 -> -2 []
 }`
sourceraw docstring

text->token-stringsclj

(text->token-strings text)
(text->token-strings text analyzer)
(text->token-strings text analyzer field-name)

Given a text (and an optional analyzer) returns a vector of tokens as strings.

Given a text (and an optional analyzer) returns a vector of tokens as strings.
sourceraw docstring

text->tokensclj

(text->tokens text)
(text->tokens text analyzer)
(text->tokens text analyzer field-name)

Given a text (and an optional analyzer) returns a list of tokens as maps of shape: {:token "pre", :type "<ALPHANUM>", :start_offset 0, :end_offset 3, :position 0, :positionLength 1}

Given a text (and an optional analyzer) returns a list of tokens as maps of shape:
{:token "pre",
 :type "<ALPHANUM>",
 :start_offset 0,
 :end_offset 3,
 :position 0,
 :positionLength 1}
sourceraw docstring

cljdoc is a website building & hosting documentation for Clojure/Script libraries

× close