Classification H2O GLM Learner

H2O Connection

If no running H2O connection is found, the learner will automatically start a local H2O server on 127.0.0.1 via h2o::h2o.init(). If you want to connect to a remote H2O cluster, call h2o::h2o.init() with the appropriate arguments before training or predicting.

Dictionary

This Learner can be instantiated via lrn():

lrn("classif.h2o.glm")

Meta Information

Task type: “classif”
Predict Types: “response”, “prob”
Feature Types: “integer”, “numeric”, “factor”
Required Packages: mlr3, mlr3extralearners, h2o

Parameters

Id	Type	Default	Levels	Range
alpha	numeric	0.5		$[0, 1]$
balance_classes	logical	FALSE	TRUE, FALSE	-
beta_constraints	untyped	NULL		-
beta_epsilon	numeric	1e-04		$[0, \infty)$
build_null_model	logical	FALSE	TRUE, FALSE	-
calc_like	logical	FALSE	TRUE, FALSE	-
checkpoint	untyped	NULL		-
class_sampling_factors	untyped	NULL		-
cold_start	logical	FALSE	TRUE, FALSE	-
compute_p_values	logical	FALSE	TRUE, FALSE	-
early_stopping	logical	TRUE	TRUE, FALSE	-
export_checkpoints_dir	untyped	NULL		-
gainslift_bins	integer	-1		$[-1, \infty)$
generate_scoring_history	logical	FALSE	TRUE, FALSE	-
generate_variable_inflation_factors	logical	FALSE	TRUE, FALSE	-
gradient_epsilon	numeric	-1		$[0, \infty)$
HGLM	logical	FALSE	TRUE, FALSE	-
ignore_const_cols	logical	TRUE	TRUE, FALSE	-
interactions	untyped	NULL		-
interaction_pairs	untyped	NULL		-
intercept	logical	TRUE	TRUE, FALSE	-
lambda	numeric	1e-05		$[0, \infty)$
lambda_min_ratio	numeric	-1		$[0, 1]$
lambda_search	logical	FALSE	TRUE, FALSE	-
link	character	logit	family_default, logit	-
max_active_predictors	integer	-1		$[1, \infty)$
max_after_balance_size	numeric	5		$[0, \infty)$
max_iterations	integer	-1		$[0, \infty)$
max_runtime_secs	numeric	0		$[0, \infty)$
missing_values_handling	character	MeanImputation	MeanImputation, Skip, PlugValues	-
nlambdas	integer	-1		$[1, \infty)$
non_negative	logical	FALSE	TRUE, FALSE	-
objective_epsilon	numeric	-1		$[0, \infty)$
obj_reg	numeric	-1		$[0, \infty)$
plug_values	untyped	NULL		-
prior	numeric	-1		$[0, \infty)$
random_columns	untyped	NULL		-
remove_collinear_columns	logical	FALSE	TRUE, FALSE	-
score_each_iteration	logical	FALSE	TRUE, FALSE	-
score_iteration_interval	integer	-1		$(-\infty, \infty)$
seed	integer	-1		$(-\infty, \infty)$
solver	character	AUTO	AUTO, IRLSM, L_BFGS, COORDINATE_DESCENT, COORDINATE_DESCENT_NAIVE	-
standardize	logical	TRUE	TRUE, FALSE	-
startval	untyped	NULL		-
stopping_metric	character	AUTO	AUTO, logloss, AUC, AUCPR, lift_top_group, misclassification, mean_per_class_error	-
stopping_rounds	integer	0		$[0, \infty)$
stopping_tolerance	numeric	0.001		$[0, \infty)$

References

Fryda T, LeDell E, Gill N, Aiello S, Fu A, Candel A, Click C, Kraljevic T, Nykodym T, Aboyoun P, Kurka M, Malohlava M, Poirier S, Wong W (2025). h2o: R Interface for the 'H2O' Scalable Machine Learning Platform. R package version 3.46.0.9, https://github.com/h2oai/h2o-3.

Author

awinterstetter

Super classes

mlr3::Learner -> mlr3::LearnerClassif -> LearnerClassifH2OGLM

Methods

Inherited methods

`LearnerClassifH2OGLM$new()`

Creates a new instance of this R6 class.

Usage

LearnerClassifH2OGLM$new()

`LearnerClassifH2OGLM$clone()`

The objects of this class are cloneable with this method.

Usage

LearnerClassifH2OGLM$clone(deep = FALSE)

Arguments

deep: Whether to make a deep clone.

Examples

# Define the Learner
learner = lrn("classif.h2o.glm")
print(learner)
#> 
#> ── <LearnerClassifH2OGLM> (classif.h2o.glm): H2O GLM ───────────────────────────
#> • Model: -
#> • Parameters: list()
#> • Packages: mlr3, mlr3extralearners, and h2o
#> • Predict Types: [response] and prob
#> • Feature Types: integer, numeric, and factor
#> • Encapsulation: none (fallback: -)
#> • Properties: missings, twoclass, and weights
#> • Other settings: use_weights = 'use', predict_raw = 'FALSE'

# Define a Task
task = tsk("sonar")

# Create train and test set
ids = partition(task)

# Train the learner on the training ids
learner$train(task, row_ids = ids$train)

print(learner$model)
#> Model Details:
#> ==============
#> 
#> H2OBinomialModel: glm
#> Model ID:  GLM_model_R_1784104713728_56 
#> GLM Model: summary
#>     family  link                               regularization
#> 1 binomial logit Elastic Net (alpha = 0.5, lambda = 0.04285 )
#>   number_of_predictors_total number_of_active_predictors number_of_iterations
#> 1                         60                          32                    8
#>     training_frame
#> 1 data_sid_a9e8_11
#> 
#> Coefficients: glm coefficients
#>       names coefficients standardized_coefficients
#> 1 Intercept     3.534901                 -0.404818
#> 2        V1   -10.436413                 -0.253885
#> 3       V10    -0.741746                 -0.105123
#> 4       V11    -2.972127                 -0.415669
#> 5       V12    -0.917467                 -0.129855
#> 
#> ---
#>    names coefficients standardized_coefficients
#> 56   V59    -7.635727                 -0.048336
#> 57    V6     0.000000                  0.000000
#> 58   V60    -0.979867                 -0.005266
#> 59    V7     0.000000                  0.000000
#> 60    V8     0.000000                  0.000000
#> 61    V9    -0.331270                 -0.040973
#> 
#> H2OBinomialMetrics: glm
#> ** Reported on training data. **
#> 
#> MSE:  0.1084073
#> RMSE:  0.3292527
#> LogLoss:  0.3543595
#> Mean Per-Class Error:  0.1370092
#> AUC:  0.9419382
#> AUCPR:  0.93679
#> Gini:  0.8838764
#> R^2:  0.5625443
#> Residual Deviance:  98.51195
#> AIC:  164.512
#> 
#> Confusion Matrix (vertical: actual; across: predicted) for F1-optimal threshold:
#>         M  R    Error     Rate
#> M      60 16 0.210526   =16/76
#> R       4 59 0.063492    =4/63
#> Totals 64 75 0.143885  =20/139
#> 
#> Maximum Metrics: Maximum metrics at their respective thresholds
#>                         metric threshold     value idx
#> 1                       max f1  0.378034  0.855072  74
#> 2                       max f2  0.293861  0.906433  89
#> 3                 max f0point5  0.571462  0.885609  51
#> 4                 max accuracy  0.571462  0.863309  51
#> 5                max precision  0.972056  1.000000   0
#> 6                   max recall  0.232271  1.000000  96
#> 7              max specificity  0.972056  1.000000   0
#> 8             max absolute_mcc  0.571462  0.729675  51
#> 9   max min_per_class_accuracy  0.473346  0.842105  65
#> 10 max mean_per_class_accuracy  0.378034  0.862991  74
#> 11                     max tns  0.972056 76.000000   0
#> 12                     max fns  0.972056 62.000000   0
#> 13                     max fps  0.000273 76.000000 138
#> 14                     max tps  0.232271 63.000000  96
#> 15                     max tnr  0.972056  1.000000   0
#> 16                     max fnr  0.972056  0.984127   0
#> 17                     max fpr  0.000273  1.000000 138
#> 18                     max tpr  0.232271  1.000000  96
#> 
#> Gains/Lift Table: Extract with `h2o.gainsLift(<model>, <data>)` or `h2o.gainsLift(<model>, valid=<T/F>, xval=<T/F>)`
#> 
#> 


# Make predictions for the test rows
predictions = learner$predict(task, row_ids = ids$test)

# Score the predictions
predictions$score()
#> classif.ce 
#>  0.2898551

H2O Connection

Dictionary

Meta Information

Parameters

References

See also

Author

Super classes

Methods

Public methods

LearnerClassifH2OGLM$new()

Usage

LearnerClassifH2OGLM$clone()

Usage

Arguments

Examples

`LearnerClassifH2OGLM$new()`

`LearnerClassifH2OGLM$clone()`