Classification H2O Random Forest Learner

H2O Connection

If no running H2O connection is found, the learner will automatically start a local H2O server on 127.0.0.1 via h2o::h2o.init(). If you want to connect to a remote H2O cluster, call h2o::h2o.init() with the appropriate arguments before training or predicting.

Dictionary

This Learner can be instantiated via lrn():

lrn("classif.h2o.randomForest")

Meta Information

Task type: “classif”
Predict Types: “response”, “prob”
Feature Types: “integer”, “numeric”, “factor”
Required Packages: mlr3, mlr3extralearners, h2o

Parameters

Id	Type	Default	Levels	Range
auc_type	character	AUTO	AUTO, NONE, MACRO_OVR, WEIGHTED_OVR, MACRO_OVO, WEIGHTED_OVO	-
balance_classes	logical	FALSE	TRUE, FALSE	-
binomial_double_trees	logical	FALSE	TRUE, FALSE	-
build_tree_one_node	logical	FALSE	TRUE, FALSE	-
categorical_encoding	character	AUTO	AUTO, Enum, OneHotInternal, OneHotExplicit, Binary, Eigen, LabelEncoder, SortByResponse, EnumLimited	-
check_constant_response	logical	TRUE	TRUE, FALSE	-
checkpoint	untyped	NULL		-
class_sampling_factors	untyped	NULL		-
col_sample_rate_change_per_level	numeric	1		$[0, 2]$
col_sample_rate_per_tree	numeric	1		$[0, 1]$
export_checkpoints_dir	untyped	NULL		-
gainslift_bins	integer	-1		$[-1, \infty)$
histogram_type	character	AUTO	AUTO, UniformAdaptive, Random, QuantilesGlobal, RoundRobin, UniformRobust	-
ignore_const_cols	logical	TRUE	TRUE, FALSE	-
max_after_balance_size	numeric	5		$[0, \infty)$
max_depth	integer	20		$[0, \infty)$
max_runtime_secs	numeric	0		$[0, \infty)$
min_rows	numeric	1		$[1, \infty)$
min_split_improvement	numeric	1e-05		$[0, \infty)$
mtries	integer	-1		$[1, \infty)$
nbins	integer	20		$[1, \infty)$
nbins_cats	integer	1024		$[1, \infty)$
nbins_top_level	integer	1024		$[1, \infty)$
ntrees	integer	50		$[1, \infty)$
sample_rate	numeric	0.632		$[0, 1]$
sample_rate_per_class	untyped	NULL		-
score_each_iteration	logical	FALSE	TRUE, FALSE	-
score_tree_interval	integer	0		$[0, \infty)$
seed	integer	-1		$(-\infty, \infty)$
stopping_metric	character	AUTO	AUTO, logloss, AUC, AUCPR, lift_top_group, misclassification, mean_per_class_error	-
stopping_rounds	integer	0		$[0, \infty)$
stopping_tolerance	numeric	0.001		$[0, \infty)$
verbose	logical	FALSE	TRUE, FALSE	-

References

Fryda T, LeDell E, Gill N, Aiello S, Fu A, Candel A, Click C, Kraljevic T, Nykodym T, Aboyoun P, Kurka M, Malohlava M, Poirier S, Wong W (2025). h2o: R Interface for the 'H2O' Scalable Machine Learning Platform. R package version 3.46.0.9, https://github.com/h2oai/h2o-3.

Author

awinterstetter

Super classes

mlr3::Learner -> mlr3::LearnerClassif -> LearnerClassifH2ORandomForest

Methods

Inherited methods

`LearnerClassifH2ORandomForest$new()`

Creates a new instance of this R6 class.

Usage

LearnerClassifH2ORandomForest$new()

`LearnerClassifH2ORandomForest$clone()`

The objects of this class are cloneable with this method.

Usage

LearnerClassifH2ORandomForest$clone(deep = FALSE)

Arguments

deep: Whether to make a deep clone.

Examples

# Define the Learner
learner = lrn("classif.h2o.randomForest")
print(learner)
#> 
#> ── <LearnerClassifH2ORandomForest> (classif.h2o.randomForest): H2O Random Forest
#> • Model: -
#> • Parameters: list()
#> • Packages: mlr3, mlr3extralearners, and h2o
#> • Predict Types: [response] and prob
#> • Feature Types: integer, numeric, and factor
#> • Encapsulation: none (fallback: -)
#> • Properties: missings, multiclass, twoclass, and weights
#> • Other settings: use_weights = 'use', predict_raw = 'FALSE'

# Define a Task
task = tsk("sonar")

# Create train and test set
ids = partition(task)

# Train the learner on the training ids
learner$train(task, row_ids = ids$train)

print(learner$model)
#> Model Details:
#> ==============
#> 
#> H2OBinomialModel: drf
#> Model ID:  DRF_model_R_1784104713728_58 
#> Model Summary: 
#>   number_of_trees number_of_internal_trees model_size_in_bytes min_depth
#> 1              50                       50               12629         5
#>   max_depth mean_depth min_leaves max_leaves mean_leaves
#> 1         9    6.44000         11         21    15.52000
#> 
#> 
#> H2OBinomialMetrics: drf
#> ** Reported on training data. **
#> ** Metrics reported on Out-Of-Bag training samples **
#> 
#> MSE:  0.1346601
#> RMSE:  0.3669607
#> LogLoss:  0.411964
#> Mean Per-Class Error:  0.2043651
#> AUC:  0.8924394
#> AUCPR:  0.8915914
#> Gini:  0.7848789
#> R^2:  0.4566065
#> 
#> Confusion Matrix (vertical: actual; across: predicted) for F1-optimal threshold:
#>         M  R    Error     Rate
#> M      57 19 0.250000   =19/76
#> R      10 53 0.158730   =10/63
#> Totals 67 72 0.208633  =29/139
#> 
#> Maximum Metrics: Maximum metrics at their respective thresholds
#>                         metric threshold     value idx
#> 1                       max f1  0.409091  0.785185  44
#> 2                       max f2  0.176471  0.879888  68
#> 3                 max f0point5  0.666667  0.862069  22
#> 4                 max accuracy  0.666667  0.798561  22
#> 5                max precision  1.000000  1.000000   0
#> 6                   max recall  0.176471  1.000000  68
#> 7              max specificity  1.000000  1.000000   0
#> 8             max absolute_mcc  0.666667  0.637168  22
#> 9   max min_per_class_accuracy  0.454545  0.777778  38
#> 10 max mean_per_class_accuracy  0.409091  0.795635  44
#> 11                     max tns  1.000000 76.000000   0
#> 12                     max fns  1.000000 58.000000   0
#> 13                     max fps  0.000000 76.000000  85
#> 14                     max tps  0.176471 63.000000  68
#> 15                     max tnr  1.000000  1.000000   0
#> 16                     max fnr  1.000000  0.920635   0
#> 17                     max fpr  0.000000  1.000000  85
#> 18                     max tpr  0.176471  1.000000  68
#> 
#> Gains/Lift Table: Extract with `h2o.gainsLift(<model>, <data>)` or `h2o.gainsLift(<model>, valid=<T/F>, xval=<T/F>)`
#> 
#> 


# Make predictions for the test rows
predictions = learner$predict(task, row_ids = ids$test)

# Score the predictions
predictions$score()
#> classif.ce 
#>  0.2173913

H2O Connection

Dictionary

Meta Information

Parameters

References

See also

Author

Super classes

Methods

Public methods

LearnerClassifH2ORandomForest$new()

Usage

LearnerClassifH2ORandomForest$clone()

Usage

Arguments

Examples

`LearnerClassifH2ORandomForest$new()`

`LearnerClassifH2ORandomForest$clone()`