zero-one.geni.ml

Liking cljdoc? Tell your friends :D

Clojure only.

aft-survival-regression
als
alternating-least-squares
approx-nearest-neighbours
approx-similarity-join
association-rules
best-model
binariser
binarizer
binary-classification-evaluator
binary-summary
bisecting-k-means
boundaries
bucketed-random-projection-lsh
bucketiser
bucketizer
category-maps
category-sizes
chi-sq-selector
chi-square-test
cluster-centers
clustering-evaluator
coefficient-matrix
coefficients
corr
count-vectoriser
count-vectorizer
cross-validator
dct
decision-tree-classifier
decision-tree-regressor
depth
describe-topics
discrete-cosine-transform
distributed?
elementwise-product
estimated-doc-concentration
evaluate
feature-hasher
feature-importances
features-col
find-frequent-sequential-patterns
find-patterns
fit
fm-classifier
fm-regressor
fp-growth
freq-itemsets
frequent-item-sets
frequent-pattern-growth
gaussian-mixture
gaussians-df
gbt-classifier
gbt-regressor
generalised-linear-regression
generalized-linear-regression
get-features-col
get-input-col
get-input-cols
get-label-col
get-num-trees
get-output-col
get-output-cols
get-prediction-col
get-probability-col
get-raw-prediction-col
get-size
get-thresholds
glm
gmm
hashing-tf
idf
idf-vector
imputer
index-to-string
input-col
input-cols
interaction
intercept
intercept-vector
is-distributed
isotonic-regression
item-factors
k-means
kolmogorov-smirnov-test
label-col
labels
latent-dirichlet-allocation
lda
linear-regression
linear-svc
load-method
load-method?
log-likelihood
log-perplexity
logistic-regression
max-abs
max-abs-scaler
mean
min-hash-lsh
min-max-scaler
mlp-classifier
multiclass-classification-evaluator
multilabel-classification-evaluator
multilayer-perceptron-classifier
n-gram
naive-bayes
normaliser
normalizer
num-classes
num-features
num-nodes
one-hot-encoder
one-vs-rest
original-max
original-min
output-col
output-cols
param-grid
params
pc
pca
pi
pipeline
polynomial-expansion
power-iteration-clustering
prediction-col
prefix-span
principal-components
probability-col
quantile-discretiser
quantile-discretizer
random-forest-classifier
random-forest-regressor
ranking-evaluator
raw-prediction-col
read-stage!
recommend-for-all-items
recommend-for-all-users
recommend-for-item-subset
recommend-for-user-subset
recommend-items
recommend-users
regex-tokeniser
regex-tokenizer
regression-evaluator
robust-scaler
root-node
scale
sql-transformer
stages
standard-scaler
std
stop-words-remover
string-indexer
summary
supported-optimisers
supported-optimizers
surrogate-df
theta
thresholds
tokeniser
tokenizer
total-num-nodes
train-validation-split
transform
tree-weights
trees
uid
user-factors
vector->array
vector-assembler
vector-indexer
vector-size-hint
vector-to-array
vocab-size
vocabulary
weights
word2vec
write-native-model!
write-stage!
xgboost-classifier
xgboost-regressor

aft-survival-regression^clj

(aft-survival-regression params)

Fit a parametric survival regression model named accelerated failure time (AFT) model (see Accelerated failure time model (Wikipedia)) based on the Weibull distribution of the survival time.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/ml/regression/AFTSurvivalRegression.html Timestamp: 2020-10-02T14:21:20.345Z

Fit a parametric survival regression model named accelerated failure time (AFT) model
(see 
Accelerated failure time model (Wikipedia))
based on the Weibull distribution of the survival time.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/ml/regression/AFTSurvivalRegression.html
Timestamp: 2020-10-02T14:21:20.345Z

source raw docstring

als^clj

(als params)

source

alternating-least-squares^clj

source

approx-nearest-neighbours^clj

(approx-nearest-neighbours dataset model key-v n-nearest)

(approx-nearest-neighbours dataset model key-v n-nearest dist-col)

source

approx-similarity-join^clj

(approx-similarity-join dataset-a dataset-b model threshold)

(approx-similarity-join dataset-a dataset-b model threshold dist-col)

source

association-rules^clj

(association-rules model)

source

best-model^clj

(best-model model)

source

binariser^clj

(binariser params)

source

binarizer^clj

source

binary-classification-evaluator^clj

(binary-classification-evaluator params)

source

binary-summary^clj

(binary-summary model)

source

bisecting-k-means^clj

(bisecting-k-means params)

source

boundaries^clj

(boundaries model)

source

bucketed-random-projection-lsh^clj

(bucketed-random-projection-lsh params)

source

bucketiser^clj

(bucketiser params)

source

bucketizer^clj

source

category-maps^clj

(category-maps model)

source

category-sizes^clj

(category-sizes model)

source

chi-sq-selector^clj

(chi-sq-selector params)

source

chi-square-test^clj

(chi-square-test dataframe features-col label-col)

source

cluster-centers^clj

(cluster-centers model)

source

clustering-evaluator^clj

(clustering-evaluator params)

source

coefficient-matrix^clj

(coefficient-matrix model)

source

coefficients^clj

(coefficients model)

source

corr^cljmultimethod

source

count-vectoriser^clj

(count-vectoriser params)

source

count-vectorizer^clj

source

cross-validator^clj

(cross-validator {:keys [estimator evaluator estimator-param-maps num-folds seed
                         parallelism]})

source

dct^clj

source

decision-tree-classifier^clj

(decision-tree-classifier params)

source

decision-tree-regressor^clj

(decision-tree-regressor params)

Decision tree learning algorithm for regression. It supports both continuous and categorical features.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/ml/regression/DecisionTreeRegressor.html Timestamp: 2020-10-02T14:21:20.720Z

Decision tree
learning algorithm for regression.
It supports both continuous and categorical features.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/ml/regression/DecisionTreeRegressor.html
Timestamp: 2020-10-02T14:21:20.720Z

source raw docstring

depth^clj

(depth model)

source

describe-topics^clj

source

discrete-cosine-transform^clj

(discrete-cosine-transform params)

source

distributed?^clj

source

elementwise-product^clj

(elementwise-product params)

source

estimated-doc-concentration^clj

(estimated-doc-concentration model)

source

evaluate^clj

(evaluate dataframe evaluator)

source

feature-hasher^clj

(feature-hasher params)

source

feature-importances^clj

(feature-importances model)

source

features-col^clj

source

find-frequent-sequential-patterns^clj

(find-frequent-sequential-patterns dataset prefix-span)

source

find-patterns^clj

source

fit^clj

(fit dataframe estimator)

source

fm-classifier^clj

(fm-classifier params)

source

fm-regressor^clj

(fm-regressor params)

Factorization Machines learning algorithm for regression. It supports normal gradient descent and AdamW solver.The implementation is based upon:

S. Rendle. "Factorization machines" 2010.FM is able to estimate interactions even in problems with huge sparsity (like advertising and recommendation system). FM formula is:

$$ \begin{align} y = w_0 + \sum\limits^n_{i-1} w_i x_i + \sum\limits^n_{i=1} \sum\limits^n_{j=i+1} \langle v_i, v_j \rangle x_i x_j \end{align} $$

First two terms denote global bias and linear term (as same as linear regression), and last term denotes pairwise interactions term. v_i describes the i-th variable with k factors.FM regression model uses MSE loss which can be solved by gradient descent method, and regularization terms like L2 are usually added to the loss function to prevent overfitting.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/ml/regression/FMRegressor.html Timestamp: 2020-10-02T14:21:21.102Z

Factorization Machines learning algorithm for regression.
It supports normal gradient descent and AdamW solver.The implementation is based upon:

S. Rendle. "Factorization machines" 2010.FM is able to estimate interactions even in problems with huge sparsity
(like advertising and recommendation system).
FM formula is:

  $$
  \begin{align}
  y = w_0 + \sum\limits^n_{i-1} w_i x_i +
    \sum\limits^n_{i=1} \sum\limits^n_{j=i+1} \langle v_i, v_j \rangle x_i x_j
  \end{align}
  $$

First two terms denote global bias and linear term (as same as linear regression),
and last term denotes pairwise interactions term. v_i describes the i-th variable
with k factors.FM regression model uses MSE loss which can be solved by gradient descent method, and
regularization terms like L2 are usually added to the loss function to prevent overfitting.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/ml/regression/FMRegressor.html
Timestamp: 2020-10-02T14:21:21.102Z

source raw docstring

fp-growth^clj

(fp-growth params)

source

freq-itemsets^clj

source

frequent-item-sets^clj

(frequent-item-sets model)

source

frequent-pattern-growth^clj

source

gaussian-mixture^clj

(gaussian-mixture params)

source

gaussians-df^clj

(gaussians-df model)

source

gbt-classifier^clj

(gbt-classifier params)

source

gbt-regressor^clj

(gbt-regressor params)

Gradient-Boosted Trees (GBTs) learning algorithm for regression. It supports both continuous and categorical features.The implementation is based upon: J.H. Friedman. "Stochastic Gradient Boosting." 1999.Notes on Gradient Boosting vs. TreeBoost:This implementation is for Stochastic Gradient Boosting, not for TreeBoost.Both algorithms learn tree ensembles by minimizing loss functions.TreeBoost (Friedman, 1999) additionally modifies the outputs at tree leaf nodes based on the loss function, whereas the original gradient boosting method does not.When the loss is SquaredError, these methods give the same result, but they could differ for other loss functions.We expect to implement TreeBoost in the future: [https://issues.apache.org/jira/browse/SPARK-4240]

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/ml/regression/GBTRegressor.html Timestamp: 2020-10-02T14:21:21.485Z

Gradient-Boosted Trees (GBTs)
learning algorithm for regression.
It supports both continuous and categorical features.The implementation is based upon: J.H. Friedman. "Stochastic Gradient Boosting." 1999.Notes on Gradient Boosting vs. TreeBoost:This implementation is for Stochastic Gradient Boosting, not for TreeBoost.Both algorithms learn tree ensembles by minimizing loss functions.TreeBoost (Friedman, 1999) additionally modifies the outputs at tree leaf nodes
   based on the loss function, whereas the original gradient boosting method does not.When the loss is SquaredError, these methods give the same result, but they could differ
      for other loss functions.We expect to implement TreeBoost in the future:
   [https://issues.apache.org/jira/browse/SPARK-4240]

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/ml/regression/GBTRegressor.html
Timestamp: 2020-10-02T14:21:21.485Z

source raw docstring

generalised-linear-regression^clj

(generalised-linear-regression params)

Fit a Generalized Linear Model (see Generalized linear model (Wikipedia)) specified by giving a symbolic description of the linear predictor (link function) and a description of the error distribution (family). It supports "gaussian", "binomial", "poisson", "gamma" and "tweedie" as family. Valid link functions for each family is listed below. The first link function of each family is the default one."gaussian" : "identity", "log", "inverse""binomial" : "logit", "probit", "cloglog""poisson" : "log", "identity", "sqrt""gamma" : "inverse", "identity", "log""tweedie" : power link function specified through "linkPower". The default link power in the tweedie family is 1 - variancePower.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/ml/regression/GeneralizedLinearRegression.html Timestamp: 2020-10-02T14:21:21.946Z

Fit a Generalized Linear Model
(see 
Generalized linear model (Wikipedia))
specified by giving a symbolic description of the linear
predictor (link function) and a description of the error distribution (family).
It supports "gaussian", "binomial", "poisson", "gamma" and "tweedie" as family.
Valid link functions for each family is listed below. The first link function of each family
is the default one."gaussian" : "identity", "log", "inverse""binomial" : "logit", "probit", "cloglog""poisson"  : "log", "identity", "sqrt""gamma"    : "inverse", "identity", "log""tweedie"  : power link function specified through "linkPower". The default link power in
 the tweedie family is 1 - variancePower.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/ml/regression/GeneralizedLinearRegression.html
Timestamp: 2020-10-02T14:21:21.946Z

source raw docstring

generalized-linear-regression^clj

(generalized-linear-regression params)

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/ml/regression/GeneralizedLinearRegression.html Timestamp: 2020-10-02T14:21:21.946Z

Fit a Generalized Linear Model
(see 
Generalized linear model (Wikipedia))
specified by giving a symbolic description of the linear
predictor (link function) and a description of the error distribution (family).
It supports "gaussian", "binomial", "poisson", "gamma" and "tweedie" as family.
Valid link functions for each family is listed below. The first link function of each family
is the default one."gaussian" : "identity", "log", "inverse""binomial" : "logit", "probit", "cloglog""poisson"  : "log", "identity", "sqrt""gamma"    : "inverse", "identity", "log""tweedie"  : power link function specified through "linkPower". The default link power in
 the tweedie family is 1 - variancePower.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/ml/regression/GeneralizedLinearRegression.html
Timestamp: 2020-10-02T14:21:21.946Z

source raw docstring

get-features-col^clj

(get-features-col model)

source

get-input-col^clj

(get-input-col model)

source

get-input-cols^clj

(get-input-cols model)

source

get-label-col^clj

(get-label-col model)

source

get-num-trees^clj

(get-num-trees model)

source

get-output-col^clj

(get-output-col model)

source

get-output-cols^clj

(get-output-cols model)

source

get-prediction-col^clj

(get-prediction-col model)

source

get-probability-col^clj

(get-probability-col model)

source

get-raw-prediction-col^clj

(get-raw-prediction-col model)

source

get-size^clj

(get-size model)

source

get-thresholds^clj

(get-thresholds model)

source

glm^clj

(glm params)

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/ml/regression/GeneralizedLinearRegression.html Timestamp: 2020-10-02T14:21:21.946Z

Fit a Generalized Linear Model
(see 
Generalized linear model (Wikipedia))
specified by giving a symbolic description of the linear
predictor (link function) and a description of the error distribution (family).
It supports "gaussian", "binomial", "poisson", "gamma" and "tweedie" as family.
Valid link functions for each family is listed below. The first link function of each family
is the default one."gaussian" : "identity", "log", "inverse""binomial" : "logit", "probit", "cloglog""poisson"  : "log", "identity", "sqrt""gamma"    : "inverse", "identity", "log""tweedie"  : power link function specified through "linkPower". The default link power in
 the tweedie family is 1 - variancePower.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/ml/regression/GeneralizedLinearRegression.html
Timestamp: 2020-10-02T14:21:21.946Z

source raw docstring

gmm^clj

source

hashing-tf^clj

(hashing-tf params)

source

idf^clj

(idf params)

source

idf-vector^clj

(idf-vector model)

source

imputer^clj

(imputer params)

source

index-to-string^clj

(index-to-string params)

source

input-col^clj

source

input-cols^clj

source

interaction^clj

(interaction params)

source

intercept^clj

(intercept model)

source

intercept-vector^clj

(intercept-vector model)

source

is-distributed^clj

(is-distributed model)

source

isotonic-regression^clj

(isotonic-regression params)

Isotonic regression.Currently implemented using parallelized pool adjacent violators algorithm. Only univariate (single feature) algorithm supported.Uses org.apache.spark.mllib.regression.IsotonicRegression.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/ml/regression/IsotonicRegression.html Timestamp: 2020-10-02T14:21:22.305Z

Isotonic regression.Currently implemented using parallelized pool adjacent violators algorithm.
Only univariate (single feature) algorithm supported.Uses org.apache.spark.mllib.regression.IsotonicRegression.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/ml/regression/IsotonicRegression.html
Timestamp: 2020-10-02T14:21:22.305Z

source raw docstring

item-factors^clj

(item-factors model)

source

k-means^clj

(k-means params)

source

kolmogorov-smirnov-test^clj

(kolmogorov-smirnov-test dataframe sample-col dist-name params)

source

label-col^clj

source

labels^clj

(labels model)

source

latent-dirichlet-allocation^clj

source

lda^clj

(lda params)

source

linear-regression^clj

(linear-regression params)

Linear regression.The learning objective is to minimize the specified loss function, with regularization. This supports two kinds of loss:squaredError (a.k.a squared loss)huber (a hybrid of squared error for relatively small errors and absolute error for relatively large ones, and we estimate the scale parameter from training data)This supports multiple types of regularization:none (a.k.a. ordinary least squares)L2 (ridge regression)L1 (Lasso)L2 + L1 (elastic net)The squared error objective function is: $$ \begin{align} \min_{w}\frac{1}{2n}{\sum_{i=1}^n(X_{i}w - y_{i})^{2} + \lambda\left[\frac{1-\alpha}{2}{||w||{2}}^{2} + \alpha{||w||{1}}\right]} \end{align} $$ The huber objective function is: $$ \begin{align} \min_{w, \sigma}\frac{1}{2n}{\sum_{i=1}^n\left(\sigma + H_m\left(\frac{X_{i}w - y_{i}}{\sigma}\right)\sigma\right) + \frac{1}{2}\lambda {||w||_2}^2} \end{align} $$ where $$ \begin{align} H_m(z) = \begin{cases} z^2, & \text {if } |z| < \epsilon, \ 2\epsilon|z| - \epsilon^2, & \text{otherwise} \end{cases} \end{align} $$ Note: Fitting with huber loss only supports none and L2 regularization.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/ml/regression/LinearRegression.html Timestamp: 2020-10-02T14:21:22.713Z

Linear regression.The learning objective is to minimize the specified loss function, with regularization.
This supports two kinds of loss:squaredError (a.k.a squared loss)huber (a hybrid of squared error for relatively small errors and absolute error for
 relatively large ones, and we estimate the scale parameter from training data)This supports multiple types of regularization:none (a.k.a. ordinary least squares)L2 (ridge regression)L1 (Lasso)L2 + L1 (elastic net)The squared error objective function is:
  $$
  \begin{align}
  \min_{w}\frac{1}{2n}{\sum_{i=1}^n(X_{i}w - y_{i})^{2} +
  \lambda\left[\frac{1-\alpha}{2}{||w||_{2}}^{2} + \alpha{||w||_{1}}\right]}
  \end{align}
  $$
The huber objective function is:
  $$
  \begin{align}
  \min_{w, \sigma}\frac{1}{2n}{\sum_{i=1}^n\left(\sigma +
  H_m\left(\frac{X_{i}w - y_{i}}{\sigma}\right)\sigma\right) + \frac{1}{2}\lambda {||w||_2}^2}
  \end{align}
  $$
where
  $$
  \begin{align}
  H_m(z) = \begin{cases}
           z^2, & \text {if } |z| < \epsilon, \\
           2\epsilon|z| - \epsilon^2, & \text{otherwise}
           \end{cases}
  \end{align}
  $$
Note: Fitting with huber loss only supports none and L2 regularization.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/ml/regression/LinearRegression.html
Timestamp: 2020-10-02T14:21:22.713Z

source raw docstring

linear-svc^clj

(linear-svc params)

source

load-method^clj

(load-method cls)

source

load-method?^clj

(load-method? method)

source

log-likelihood^clj

(log-likelihood dataset model)

source

log-perplexity^clj

(log-perplexity dataset model)

source

logistic-regression^clj

(logistic-regression params)

source

max-abs^clj

(max-abs model)

source

max-abs-scaler^clj

(max-abs-scaler params)

source

mean^clj

(mean model)

source

min-hash-lsh^clj

(min-hash-lsh params)

source

min-max-scaler^clj

(min-max-scaler params)

source

mlp-classifier^clj

(mlp-classifier params)

source

multiclass-classification-evaluator^clj

(multiclass-classification-evaluator params)

source

multilabel-classification-evaluator^clj

(multilabel-classification-evaluator params)

source

multilayer-perceptron-classifier^clj

source

n-gram^clj

(n-gram params)

source

naive-bayes^clj

(naive-bayes params)

source

normaliser^clj

(normaliser params)

source

normalizer^clj

source

num-classes^clj

(num-classes model)

source

num-features^clj

(num-features model)

source

num-nodes^clj

(num-nodes model)

source

one-hot-encoder^clj

(one-hot-encoder params)

source

one-vs-rest^clj

(one-vs-rest params)

source

original-max^clj

(original-max model)

source

original-min^clj

(original-min model)

source

output-col^clj

source

output-cols^clj

source

param-grid^clj

(param-grid grids)

source

params^clj

(params stage)

source

pc^clj

(pc model)

source

pca^clj

(pca params)

source

pi^clj

(pi model)

source

pipeline^clj

(pipeline & stages)

source

polynomial-expansion^clj

(polynomial-expansion params)

source

power-iteration-clustering^clj

(power-iteration-clustering params)

source

prediction-col^clj

source

prefix-span^clj

(prefix-span params)

source

principal-components^clj

source

probability-col^clj

source

quantile-discretiser^clj

(quantile-discretiser params)

source

quantile-discretizer^clj

source

random-forest-classifier^clj

(random-forest-classifier params)

source

random-forest-regressor^clj

(random-forest-regressor params)

Random Forest learning algorithm for regression. It supports both continuous and categorical features.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/ml/regression/RandomForestRegressor.html Timestamp: 2020-10-02T14:21:23.298Z

Random Forest
learning algorithm for regression.
It supports both continuous and categorical features.

Source: https://spark.apache.org/docs/latest/api/scala/org/apache/spark/ml/regression/RandomForestRegressor.html
Timestamp: 2020-10-02T14:21:23.298Z

source raw docstring

ranking-evaluator^clj

(ranking-evaluator params)

source

raw-prediction-col^clj

source

read-stage!^clj

(read-stage! model-cls path)

source

(recommend-for-all-items model num-users)

source

(recommend-for-all-users model num-items)

source

(recommend-for-item-subset model items-df num-users)

source

(recommend-for-user-subset model users-df num-items)

source

(recommend-items model num-items)

(recommend-items model users-df num-items)

source

(recommend-users model num-users)

(recommend-users model items-df num-users)

source

regex-tokeniser^clj

(regex-tokeniser params)

source

regex-tokenizer^clj

source

regression-evaluator^clj

(regression-evaluator params)

source

robust-scaler^clj

(robust-scaler params)

source

root-node^clj

(root-node model)

source

scale^clj

(scale model)

source

sql-transformer^clj

(sql-transformer params)

source

stages^clj

(stages model)

source

standard-scaler^clj

(standard-scaler params)

source

std^clj

(std model)

source

stop-words-remover^clj

(stop-words-remover params)

source

string-indexer^clj

(string-indexer params)

source

summary^clj

(summary model)

source

supported-optimisers^clj

source

supported-optimizers^clj

(supported-optimizers model)

source

surrogate-df^clj

(surrogate-df model)

source

theta^clj

(theta model)

source

thresholds^clj

source

tokeniser^clj

(tokeniser params)

source

tokenizer^clj

source

total-num-nodes^clj

(total-num-nodes model)

source

train-validation-split^clj

(train-validation-split {:keys [estimator evaluator estimator-param-maps seed
                                parallelism]})

source

transform^clj

(transform dataframe transformer)

source

tree-weights^clj

(tree-weights model)

source

trees^clj

(trees model)

source

uid^clj

(uid model)

source

user-factors^clj

(user-factors model)

source

vector->array^clj

source

vector-assembler^clj

(vector-assembler params)

source

vector-indexer^clj

(vector-indexer params)

source

vector-size-hint^clj

(vector-size-hint params)

source

vector-to-array^clj

(vector-to-array expr)

(vector-to-array expr dtype)

source

vocab-size^clj

(vocab-size model)

source

vocabulary^clj

(vocabulary model)

source

weights^clj

(weights model)

source

word2vec^clj

(word2vec params)

source

write-native-model!^clj

(write-native-model! model path)

source

write-stage!^clj

(write-stage! stage path)

(write-stage! stage path options)

source

xgboost-classifier^clj

(xgboost-classifier params)

source

xgboost-regressor^clj

(xgboost-regressor params)

source

cljdoc is a website building & hosting documentation for Clojure/Script libraries

Keyboard shortcuts Report a problem cljdoc on GitHub

× close

zero-one.geni.ml

aft-survival-regressionclj

alsclj

alternating-least-squaresclj

approx-nearest-neighboursclj

approx-similarity-joinclj

association-rulesclj

best-modelclj

binariserclj

binarizerclj

binary-classification-evaluatorclj

binary-summaryclj

bisecting-k-meansclj

boundariesclj

bucketed-random-projection-lshclj

bucketiserclj

bucketizerclj

category-mapsclj

category-sizesclj

chi-sq-selectorclj

chi-square-testclj

cluster-centersclj

clustering-evaluatorclj

coefficient-matrixclj

coefficientsclj

corrcljmultimethod

count-vectoriserclj

count-vectorizerclj

cross-validatorclj

dctclj

decision-tree-classifierclj

decision-tree-regressorclj

depthclj

describe-topicsclj

discrete-cosine-transformclj

distributed?clj

elementwise-productclj

estimated-doc-concentrationclj

evaluateclj

feature-hasherclj

feature-importancesclj

features-colclj

find-frequent-sequential-patternsclj

find-patternsclj

fitclj

fm-classifierclj

fm-regressorclj

fp-growthclj

freq-itemsetsclj

frequent-item-setsclj

frequent-pattern-growthclj

gaussian-mixtureclj

gaussians-dfclj

gbt-classifierclj

gbt-regressorclj

generalised-linear-regressionclj

generalized-linear-regressionclj

get-features-colclj

get-input-colclj

get-input-colsclj

get-label-colclj

get-num-treesclj

get-output-colclj

get-output-colsclj

get-prediction-colclj

get-probability-colclj

get-raw-prediction-colclj

get-sizeclj

get-thresholdsclj

glmclj

gmmclj

hashing-tfclj

idfclj

idf-vectorclj

imputerclj

index-to-stringclj

input-colclj

input-colsclj

interactionclj

interceptclj

aft-survival-regression^clj

als^clj

alternating-least-squares^clj

approx-nearest-neighbours^clj

approx-similarity-join^clj

association-rules^clj

best-model^clj

binariser^clj

binarizer^clj

binary-classification-evaluator^clj

binary-summary^clj

bisecting-k-means^clj

boundaries^clj

bucketed-random-projection-lsh^clj

bucketiser^clj

bucketizer^clj

category-maps^clj

category-sizes^clj

chi-sq-selector^clj

chi-square-test^clj

cluster-centers^clj

clustering-evaluator^clj

coefficient-matrix^clj

coefficients^clj

corr^cljmultimethod

count-vectoriser^clj

count-vectorizer^clj

cross-validator^clj

dct^clj

decision-tree-classifier^clj

decision-tree-regressor^clj

depth^clj

describe-topics^clj

discrete-cosine-transform^clj

distributed?^clj

elementwise-product^clj

estimated-doc-concentration^clj

evaluate^clj

feature-hasher^clj

feature-importances^clj

features-col^clj

find-frequent-sequential-patterns^clj

find-patterns^clj

fit^clj

fm-classifier^clj

fm-regressor^clj

fp-growth^clj

freq-itemsets^clj

frequent-item-sets^clj

frequent-pattern-growth^clj

gaussian-mixture^clj

gaussians-df^clj

gbt-classifier^clj

gbt-regressor^clj

generalised-linear-regression^clj

generalized-linear-regression^clj

get-features-col^clj

get-input-col^clj

get-input-cols^clj

get-label-col^clj

get-num-trees^clj

get-output-col^clj

get-output-cols^clj

get-prediction-col^clj

get-probability-col^clj

get-raw-prediction-col^clj

get-size^clj

get-thresholds^clj

glm^clj

gmm^clj

hashing-tf^clj

idf^clj

idf-vector^clj

imputer^clj

index-to-string^clj

input-col^clj

input-cols^clj

interaction^clj

intercept^clj

intercept-vector^clj