(get-images-from-pdf
pdf-input-stream
&
{:keys [start-page end-page dpi ext]
:or {start-page 0 end-page Integer/MAX_VALUE dpi 300 ext "png"}})
Return the image's byte-array list of a documnet.PDF supported. Args:
:start-page
: A start page of the pdf file. (default 0):end-page
: A last page of the pdf file. (default Integer/MAX_VALUE)dpi
: A number of dots that fit horizontally and vertically into a one-inch length. (default 300)ext
: A string with the extension of the image.(default 'png')Return the image's byte-array list of a documnet.PDF supported. Args: - pdf-input-stream: An InputStream of a PDF file, e.g., If you have a file `(clojure.java.io/input-stream "/path/to/your/file.pdf") - options: A map with the following keys - `:start-page`: A start page of the pdf file. (default 0) - `:end-page`: A last page of the pdf file. (default Integer/MAX_VALUE) - `dpi`: A number of dots that fit horizontally and vertically into a one-inch length. (default 300) - `ext`: A string with the extension of the image.(default 'png')
(get-text path-or-input-stream)
Return the text of a document. PDF, DOCX, and other formats are supported.
Returns:
:id
: A string with the document id.:text
: A string with the document text.:metadata
: A map with the document metadata.Return the text of a document. PDF, DOCX, and other formats are supported. Returns: - A map with the following keys: - `:id`: A string with the document id. - `:text`: A string with the document text. - `:metadata`: A map with the document metadata.
cljdoc is a website building & hosting documentation for Clojure/Script libraries
× close