BlockForest Regression Learner

Random forests for blocks of clinical and omics covariate data. Calls blockForest::blockfor() from package blockForest. The training model includes only the $forest slot, excluding the paramvalues and the biased_oob_error_donotuse.

In this learner, only the trained forest object ($forest) is retained. The optimized block-specific tuning parameters (paramvalues) and the biased OOB error estimate (biased_oob_error_donotuse) are discarded, as they are either not needed for downstream use or not reliable for performance estimation.

Initial parameter values

num.threads is initialized to 1 to avoid conflicts with parallelization via future.

Dictionary

This Learner can be instantiated via lrn():

lrn("regr.blockforest")

Meta Information

Task type: “regr”
Predict Types: “response”, “se”
Feature Types: “logical”, “integer”, “numeric”, “factor”, “ordered”
Required Packages: mlr3, mlr3extralearners, blockForest

Parameters

Id	Type	Default	Levels	Range
blocks	untyped	-		-
block.method	character	BlockForest	BlockForest, RandomBlock, BlockVarSel, VarProb, SplitWeights	-
num.trees	integer	2000		$[1, \infty)$
mtry	untyped	NULL		-
nsets	integer	300		$[1, \infty)$
num.trees.pre	integer	1500		$[1, \infty)$
splitrule	character	extratrees	extratrees, variance, maxstat	-
always.select.block	integer	0		$[0, 1]$
importance	character	-	none, impurity, impurity_corrected, permutation	-
num.threads	integer	-		$[1, \infty)$
seed	integer	NULL		$(-\infty, \infty)$
verbose	logical	TRUE	TRUE, FALSE	-
se.method	character	infjack	jack, infjack	-

References

Hornung, R., Wright, N. M (2019). “Block Forests: Random forests for blocks of clinical and omics covariate data.” BMC Bioinformatics, 20(1), 1–17. doi:10.1186/s12859-019-2942-y , https://doi.org/10.1186/s12859-019-2942-y.

Author

bblodfon

Super classes

mlr3::Learner -> mlr3::LearnerRegr -> LearnerRegrBlockForest

Methods

Inherited methods

Method `new()`

Creates a new instance of this R6 class.

Usage

LearnerRegrBlockForest$new()

Method `importance()`

The importance scores are extracted from the model slot variable.importance.

Usage

LearnerRegrBlockForest$importance()

Returns

Named numeric().

Method `clone()`

The objects of this class are cloneable with this method.

Usage

LearnerRegrBlockForest$clone(deep = FALSE)

Arguments

deep: Whether to make a deep clone.

Examples

# Define a Task
task = tsk("mtcars")
# Create train and test set
ids = partition(task)
# check task's features
task$feature_names
#>  [1] "am"   "carb" "cyl"  "disp" "drat" "gear" "hp"   "qsec" "vs"   "wt"  
# partition features to 2 blocks
blocks = list(bl1 = 1:3, bl2 = 4:10)
# define learner
learner = lrn("regr.blockforest", blocks = blocks,
              importance = "permutation", nsets = 10,
              num.trees = 50, num.trees.pre = 10, splitrule = "variance")
# Train the learner on the training ids
learner$train(task, row_ids = ids$train)
# feature importance
learner$importance()
#>         cyl          am        disp          wt        carb          hp 
#> 12.58118432  6.92005868  6.28665674  5.45327127  5.29764100  3.65244820 
#>        drat          vs        qsec        gear 
#>  2.37920713  1.11868580 -0.04124444 -0.06435200 
# Make predictions for the test observations
pred = learner$predict(task, row_ids = ids$test)
pred
#> 
#> ── <PredictionRegr> for 11 observations: ───────────────────────────────────────
#>  row_ids truth response
#>        4  21.4 21.07217
#>        6  18.1 21.33517
#>        9  22.8 22.31260
#>      ---   ---      ---
#>       26  27.3 29.29502
#>       27  26.0 26.73754
#>       32  21.4 25.05295
# Score the predictions
pred$score()
#> regr.mse 
#> 3.880211

Initial parameter values

Dictionary

Meta Information

Parameters

References

See also

Author

Super classes

Methods

Public methods

Method new()

Usage

Method importance()

Usage

Returns

Method clone()

Usage

Arguments

Examples

Method `new()`

Method `importance()`

Method `clone()`