Liking cljdoc? Tell your friends :D

Changelog

All notable changes to this project are documented here. The format follows Keep a Changelog, and the project adheres to Semantic Versioning.

0.1.0 - 2026-06-22

Initial release.

Added

  • from-file, from-pretrained, from-stream constructors over DJL's HuggingFaceTokenizer (native Rust tokenizers via JNI).
  • encode returning a Clojure map (:ids :tokens :type-ids :word-ids :attention-mask :special-tokens-mask), with :add-special-tokens? / :with-overflowing-tokens? options.
  • ids, tokens, count-tokens, decode, and batch-encode helpers.
  • Tokenizers are Closeable, so with-open frees the native handle.

Can you improve this documentation?Edit on GitHub

cljdoc builds & hosts documentation for Clojure/Script libraries

Keyboard shortcuts
Ctrl+kJump to recent docs
Move to previous article
Move to next article
Ctrl+/Jump to the search field
× close