Random Forest Competing Risks Learner

Random survival forests for competing risks. Calls randomForestSRC::rfsrc() from randomForestSRC.

Dictionary

This Learner can be instantiated via lrn():

lrn("cmprsk.rfsrc")

Meta Information

Task type: “cmprsk”
Predict Types: “cif”
Feature Types: “logical”, “integer”, “numeric”, “factor”
Required Packages: mlr3, mlr3cmprsk, mlr3extralearners, randomForestSRC

Parameters

Id	Type	Default	Levels	Range
ntree	integer	500		$[1, \infty)$
mtry	integer	-		$[1, \infty)$
mtry.ratio	numeric	-		$[0, 1]$
nodesize	integer	15		$[1, \infty)$
nodedepth	integer	-		$[1, \infty)$
splitrule	character	logrankCR	logrankCR, logrank	-
nsplit	integer	10		$[0, \infty)$
importance	character	FALSE	FALSE, TRUE, none, anti, permute, random	-
block.size	integer	10		$[1, \infty)$
bootstrap	character	by.root	by.root, by.node, none, by.user	-
samptype	character	swor	swor, swr	-
samp	untyped	-		-
membership	logical	FALSE	TRUE, FALSE	-
sampsize	untyped	-		-
sampsize.ratio	numeric	-		$[0, 1]$
na.action	character	na.omit	na.omit, na.impute	-
nimpute	integer	1		$[1, \infty)$
ntime	integer	150		$[0, \infty)$
cause	untyped	-		-
proximity	character	FALSE	FALSE, TRUE, inbag, oob, all	-
distance	character	FALSE	FALSE, TRUE, inbag, oob, all	-
forest.wt	character	FALSE	FALSE, TRUE, inbag, oob, all	-
xvar.wt	untyped	-		-
split.wt	untyped	-		-
forest	logical	TRUE	TRUE, FALSE	-
var.used	character	FALSE	FALSE, all.trees	-
split.depth	character	FALSE	FALSE, all.trees, by.tree	-
seed	integer	-		$(-\infty, -1]$
do.trace	logical	FALSE	TRUE, FALSE	-
get.tree	untyped	-		-
outcome	character	train	train, test	-
ptn.count	integer	0		$[0, \infty)$
cores	integer	1		$[1, \infty)$
save.memory	logical	FALSE	TRUE, FALSE	-
perf.type	character	-	none	-
case.depth	logical	FALSE	TRUE, FALSE	-
marginal.xvar	untyped	NULL		-

Initial parameter values

ntime: Number of time points to coerce the observed event times for use in the estimated cumulative incidence functions during prediction. We changed the default value of 150 to 0, meaning we now use all the unique event times from the train set across all competing causes.

Custom mlr3 parameters

mtry: This hyperparameter can alternatively be set via the added hyperparameter mtry.ratio as mtry = max(ceiling(mtry.ratio * n_features), 1). Note that mtry and mtry.ratio are mutually exclusive.
sampsize: This hyperparameter can alternatively be set via the added hyperparameter sampsize.ratio as sampsize = max(ceiling(sampsize.ratio * n_obs), 1). Note that sampsize and sampsize.ratio are mutually exclusive.
cores: This value is set as the option rf.cores during training and is set to 1 by default.

References

Ishwaran, H., Gerds, A. T, Kogalur, B. U, Moore, D. R, Gange, J. S, Lau, M. B (2014). “Random survival forests for competing risks.” Biostatistics, 15(4), 757–773. doi:10.1093/BIOSTATISTICS/KXU010 , https://doi.org/10.1093/BIOSTATISTICS/KXU010.

Author

bblodfon

Super classes

mlr3::Learner -> mlr3cmprsk::LearnerCompRisks -> LearnerCompRisksRandomForestSRC

Methods

Public methods

LearnerCompRisksRandomForestSRC$new()
LearnerCompRisksRandomForestSRC$importance()
LearnerCompRisksRandomForestSRC$selected_features()
LearnerCompRisksRandomForestSRC$oob_error()
LearnerCompRisksRandomForestSRC$clone()

Inherited methods

Method `new()`

Creates a new instance of this R6 class.

Usage

LearnerCompRisksRandomForestSRC$new()

Method `importance()`

The importance scores are extracted from the model slot importance and are cause-specific.

Usage

LearnerCompRisksRandomForestSRC$importance(cause = 1)

Arguments

cause: Integer value indicating the event of interest

Returns

Named numeric().

Method `selected_features()`

Selected features are extracted from the model slot var.used.

Note: Due to a known issue in randomForestSRC, enabling var.used = "all.trees" causes prediction to fail. Therefore, this setting should be used exclusively for feature selection purposes and not when prediction is required.

Usage

LearnerCompRisksRandomForestSRC$selected_features()

Returns

character().

Method `oob_error()`

Extracts the out-of-bag (OOB) cumulative incidence function (CIF) error from the model's err.rate slot.

If cause = "mean" (default), the function returns a weighted average of the cause-specific OOB errors, where the weights correspond to the observed proportion of events for each cause in the training data.

Usage

LearnerCompRisksRandomForestSRC$oob_error(cause = "mean")

Arguments

cause: Integer (event type) or "mean" (default). Use a specific event type to retrieve its OOB error, or "mean" to compute the weighted average across causes.

Returns

numeric().

Method `clone()`

The objects of this class are cloneable with this method.

Usage

LearnerCompRisksRandomForestSRC$clone(deep = FALSE)

Arguments

deep: Whether to make a deep clone.

Examples

# Define the Learner
learner = lrn("cmprsk.rfsrc", importance = "TRUE")
print(learner)
#> 
#> ── <LearnerCompRisksRandomForestSRC> (cmprsk.rfsrc): Competing Risk Survival For
#> • Model: -
#> • Parameters: importance=TRUE, ntime=0, cores=1
#> • Packages: mlr3, mlr3cmprsk, mlr3extralearners, and randomForestSRC
#> • Predict Types: [cif]
#> • Feature Types: logical, integer, numeric, and factor
#> • Encapsulation: none (fallback: -)
#> • Properties: importance, missings, oob_error, selected_features, and weights
#> • Other settings: use_weights = 'use'

# Define a Task
task = tsk("pbc")

# Stratification based on event
task$set_col_roles(cols = "status", add_to = "stratum")

# Create train and test set
ids = partition(task)

# Train the learner on the training ids
learner$train(task, row_ids = ids$train)

print(learner$model)
#>                          Sample size: 184
#>                     Number of events: 1=12, 2=74
#>                      Number of trees: 500
#>            Forest terminal node size: 15
#>        Average no. of terminal nodes: 8.976
#> No. of variables tried at each split: 5
#>               Total no. of variables: 17
#>        Resampling used to grow trees: swor
#>     Resample size used to grow trees: 116
#>                             Analysis: RSF
#>                               Family: surv-CR
#>                       Splitting rule: logrankCR *random*
#>        Number of random split points: 10
#>    (OOB) Requested performance error: 0.2706002, 0.18629344
#> 
print(learner$importance(cause = 1)) # VIMP for cause = 1
#>          bili       ascites           age       protime          chol 
#>  0.3017561458  0.0726824501  0.0672478944  0.0501774098  0.0322083143 
#>           ast        copper      platelet       albumin         edema 
#>  0.0288769084  0.0246504726  0.0166332406  0.0141871721  0.0095095127 
#>         stage           trt       spiders           sex      alk.phos 
#>  0.0060865737  0.0001370855 -0.0006979656 -0.0007313168 -0.0017121272 
#>          trig        hepato 
#> -0.0027085125 -0.0091757938 
print(learner$importance(cause = 2)) # VIMP for cause = 2
#>          bili       ascites        copper         edema       albumin 
#>  0.2037589640  0.1373366819  0.0741892792  0.0643305189  0.0508703672 
#>           age          chol          trig       protime       spiders 
#>  0.0395225275  0.0380622084  0.0317483948  0.0193383144  0.0125237966 
#>           ast      alk.phos      platelet           sex        hepato 
#>  0.0124840164  0.0086160611  0.0049617141  0.0028227554  0.0023256081 
#>         stage           trt 
#>  0.0017279104 -0.0001618135 
print(learner$oob_error()) # weighted-mean across causes
#> [1] 0.1980572

# Make predictions for the test rows
predictions = learner$predict(task, row_ids = ids$test)

# Score the predictions
predictions$score()
#> cmprsk.auc 
#>  0.9163699

Dictionary

Meta Information

Parameters

Initial parameter values

Custom mlr3 parameters

References

See also

Author

Super classes

Methods

Public methods

Method new()

Usage

Method importance()

Usage

Arguments

Returns

Method selected_features()

Usage

Returns

Method oob_error()

Usage

Arguments

Returns

Method clone()

Usage

Arguments

Examples

Method `new()`

Method `importance()`

Method `selected_features()`

Method `oob_error()`

Method `clone()`