Classification Imbalanced Random Forest Src Learner

Imbalanced Random forest for classification between two classes. Calls randomForestSRC::imbalanced.rfsrc() from from randomForestSRC.

Dictionary

This Learner can be instantiated via lrn():

lrn("classif.imbalanced_rfsrc")

Meta Information

Task type: “classif”
Predict Types: “response”, “prob”
Feature Types: “logical”, “integer”, “numeric”, “factor”, “ordered”
Required Packages: mlr3, randomForestSRC

Parameters

Id	Type	Default	Levels	Range
ntree	integer	500		$[1, \infty)$
method	character	rfq	rfq, brf, standard	-
block.size	integer	10		$[1, \infty)$
fast	logical	FALSE	TRUE, FALSE	-
ratio	numeric	-		$[0, 1]$
mtry	integer	-		$[1, \infty)$
mtry.ratio	numeric	-		$[0, 1]$
nodesize	integer	15		$[1, \infty)$
nodedepth	integer	-		$[1, \infty)$
splitrule	character	gini	gini, auc, entropy	-
nsplit	integer	10		$[0, \infty)$
importance	character	FALSE	FALSE, TRUE, none, permute, random, anti	-
bootstrap	character	by.root	by.root, by.node, none, by.user	-
samptype	character	swor	swor, swr	-
samp	untyped	-		-
membership	logical	FALSE	TRUE, FALSE	-
sampsize	untyped	-		-
sampsize.ratio	numeric	-		$[0, 1]$
na.action	character	na.omit	na.omit, na.impute	-
nimpute	integer	1		$[1, \infty)$
ntime	integer	-		$[1, \infty)$
cause	integer	-		$[1, \infty)$
proximity	character	FALSE	FALSE, TRUE, inbag, oob, all	-
distance	character	FALSE	FALSE, TRUE, inbag, oob, all	-
forest.wt	character	FALSE	FALSE, TRUE, inbag, oob, all	-
xvar.wt	untyped	-		-
split.wt	untyped	-		-
forest	logical	TRUE	TRUE, FALSE	-
var.used	character	FALSE	FALSE, all.trees, by.tree	-
split.depth	character	FALSE	FALSE, all.trees, by.tree	-
seed	integer	-		$(-\infty, -1]$
do.trace	logical	FALSE	TRUE, FALSE	-
statistics	logical	FALSE	TRUE, FALSE	-
get.tree	untyped	-		-
outcome	character	train	train, test	-
ptn.count	integer	0		$[0, \infty)$
cores	integer	1		$[1, \infty)$
save.memory	logical	FALSE	TRUE, FALSE	-
perf.type	character	-	gmean, misclass, brier, none	-
case.depth	logical	FALSE	TRUE, FALSE	-
marginal.xvar	untyped	NULL		-

Custom mlr3 parameters

mtry: This hyperparameter can alternatively be set via the added hyperparameter mtry.ratio as mtry = max(ceiling(mtry.ratio * n_features), 1). Note that mtry and mtry.ratio are mutually exclusive.
sampsize: This hyperparameter can alternatively be set via the added hyperparameter sampsize.ratio as sampsize = max(ceiling(sampsize.ratio * n_obs), 1). Note that sampsize and sampsize.ratio are mutually exclusive.
cores: This value is set as the option rf.cores during training and is set to 1 by default.

References

O’Brien R, Ishwaran H (2019). “A random forests quantile classifier for class imbalanced data.” Pattern Recognition, 90, 232–249. doi:10.1016/j.patcog.2019.01.036 .

Chao C, Leo B (2004). “Using Random Forest to Learn Imbalanced Data.” University of California, Berkeley.

Author

HarutyunyanLiana

Super classes

mlr3::Learner -> mlr3::LearnerClassif -> LearnerClassifImbalancedRandomForestSRC

Methods

Public methods

LearnerClassifImbalancedRandomForestSRC$new()
LearnerClassifImbalancedRandomForestSRC$importance()
LearnerClassifImbalancedRandomForestSRC$selected_features()
LearnerClassifImbalancedRandomForestSRC$oob_error()
LearnerClassifImbalancedRandomForestSRC$clone()

Inherited methods

Method `new()`

Creates a new instance of this R6 class.

Usage

LearnerClassifImbalancedRandomForestSRC$new()

Method `importance()`

The importance scores are extracted from the slot importance.

Usage

LearnerClassifImbalancedRandomForestSRC$importance()

Returns

Named numeric().

Method `selected_features()`

Selected features are extracted from the model slot var.used.

Usage

LearnerClassifImbalancedRandomForestSRC$selected_features()

Returns

character().

Method `oob_error()`

OOB error extracted from the model slot err.rate.

Usage

LearnerClassifImbalancedRandomForestSRC$oob_error()

Returns

numeric().

Method `clone()`

The objects of this class are cloneable with this method.

Usage

LearnerClassifImbalancedRandomForestSRC$clone(deep = FALSE)

Arguments

deep: Whether to make a deep clone.

Examples

# Define the Learner
learner = mlr3::lrn("classif.imbalanced_rfsrc", importance = "TRUE")
print(learner)
#> 
#> ── <LearnerClassifImbalancedRandomForestSRC> (classif.imbalanced_rfsrc): Imbalan
#> • Model: -
#> • Parameters: importance=TRUE
#> • Packages: mlr3 and randomForestSRC
#> • Predict Types: [response] and prob
#> • Feature Types: logical, integer, numeric, factor, and ordered
#> • Encapsulation: none (fallback: -)
#> • Properties: importance, missings, oob_error, twoclass, and weights
#> • Other settings: use_weights = 'use'

# Define a Task
task = mlr3::tsk("sonar")
# Create train and test set
ids = mlr3::partition(task)

# Train the learner on the training ids
learner$train(task, row_ids = ids$train)

print(learner$model)
#>                          Sample size: 139
#>            Frequency of class labels: 72, 67
#>                      Number of trees: 3000
#>            Forest terminal node size: 1
#>        Average no. of terminal nodes: 16.4877
#> No. of variables tried at each split: 8
#>               Total no. of variables: 60
#>        Resampling used to grow trees: swor
#>     Resample size used to grow trees: 88
#>                             Analysis: RFQ
#>                               Family: class
#>                       Splitting rule: auc *random*
#>        Number of random split points: 10
#>                     Imbalanced ratio: 1.0746
#>                    (OOB) Brier score: 0.1264914
#>         (OOB) Normalized Brier score: 0.50596561
#>                            (OOB) AUC: 0.93884743
#>                       (OOB) Log-loss: 0.40861012
#>                         (OOB) PR-AUC: 0.93603323
#>                         (OOB) G-mean: 0.84788294
#>    (OOB) Requested performance error: 0.15211706
#> 
#> Confusion matrix:
#> 
#>           predicted
#>   observed  M  R class.error
#>          M 68  4      0.0556
#>          R 16 51      0.2388
#> 
#>       (OOB) Misclassification rate: 0.1438849
print(learner$importance())
#>          V21           V4          V12          V13          V59          V11 
#>  0.035331113  0.029104571  0.020791888  0.020791888  0.020791888  0.014549602 
#>          V23          V34          V39          V42          V44          V53 
#>  0.014549602  0.014549602  0.014549602  0.014549602  0.014549602  0.014549602 
#>          V56          V20          V26          V31          V38          V52 
#>  0.014549602  0.012561923  0.012561923  0.012561923  0.012561923  0.012561923 
#>          V10          V17          V24          V29          V43          V45 
#>  0.008353730  0.008353730  0.008353730  0.008353730  0.008353730  0.008353730 
#>          V46          V48           V6           V8           V9          V14 
#>  0.008353730  0.008353730  0.008353730  0.008353730  0.008353730  0.006257524 
#>          V15          V19          V22           V3          V30          V32 
#>  0.006257524  0.006257524  0.006257524  0.006257524  0.006257524  0.006257524 
#>          V33          V36          V40          V49          V50          V57 
#>  0.006257524  0.006257524  0.006257524  0.006257524  0.006257524  0.006257524 
#>          V58           V1          V16          V18           V2          V25 
#>  0.006257524  0.000000000  0.000000000  0.000000000  0.000000000  0.000000000 
#>          V27          V28          V35          V37          V41           V5 
#>  0.000000000  0.000000000  0.000000000  0.000000000  0.000000000  0.000000000 
#>          V51          V54          V55          V60           V7          V47 
#>  0.000000000  0.000000000  0.000000000  0.000000000  0.000000000 -0.008272225 

# Make predictions for the test rows
predictions = learner$predict(task, row_ids = ids$test)

# Score the predictions
predictions$score()
#> classif.ce 
#>  0.1884058

Dictionary

Meta Information

Parameters

Custom mlr3 parameters

References

See also

Author

Super classes

Methods

Public methods

Method new()

Usage

Method importance()

Usage

Returns

Method selected_features()

Usage

Returns

Method oob_error()

Usage

Returns

Method clone()

Usage

Arguments

Examples

Method `new()`

Method `importance()`

Method `selected_features()`

Method `oob_error()`

Method `clone()`