Tools for lazily separating, tokenizing and tagging sentences.
Tools for lazily separating, tokenizing and tagging sentences.
(lazy-chunk sentences tokenizer pos-tagger chunker)
Given a sequence of sentences, a tokenizer, a pos-tagger and a chunker, return a lazy sequence of treebank-chunked sentences.
Given a sequence of sentences, a tokenizer, a pos-tagger and a chunker, return a lazy sequence of treebank-chunked sentences.
(lazy-get-sentences text sentence-finder)
Given a sequence of texts and a sentence-finder, return a lazy sequence of sentences for each text.
Given a sequence of texts and a sentence-finder, return a lazy sequence of sentences for each text.
(lazy-tag sentences tokenizer pos-tagger)
Given a sequence of sentences, a tokenizer and a pos-tagger, return a lazy sequence of pos-tagged sentences.
Given a sequence of sentences, a tokenizer and a pos-tagger, return a lazy sequence of pos-tagged sentences.
(lazy-tokenize sentences tokenizer)
Given a sequence of sentences, and a tokenizer, return a lazy sequence of tokenized sentences.
Given a sequence of sentences, and a tokenizer, return a lazy sequence of tokenized sentences.
(sentence-seq rdr sentence-finder)
lazily read sentences from rdr as a lazy sequence of strings using the given sentence-finder. rdr must implement java.io.BufferedReader.
lazily read sentences from rdr as a lazy sequence of strings using the given sentence-finder. rdr must implement java.io.BufferedReader.
cljdoc is a website building & hosting documentation for Clojure/Script libraries
× close