Generalized Linear Regression

Dictionary

This Learner can be instantiated via lrn():

lrn("regr.glm")

Meta Information

Task type: “regr”
Predict Types: “response”, “se”
Feature Types: “logical”, “integer”, “numeric”, “character”, “factor”, “ordered”
Required Packages: mlr3, mlr3extralearners, 'stats'

Parameters

Id	Type	Default	Levels	Range
singular.ok	logical	TRUE	TRUE, FALSE	-
x	logical	FALSE	TRUE, FALSE	-
y	logical	TRUE	TRUE, FALSE	-
model	logical	TRUE	TRUE, FALSE	-
etastart	untyped	-		-
mustart	untyped	-		-
start	untyped	NULL		-
family	character	gaussian	gaussian, poisson, quasipoisson, Gamma, inverse.gaussian	-
na.action	character	-	na.omit, na.pass, na.fail, na.exclude	-
link	character	-	logit, probit, cauchit, cloglog, identity, log, sqrt, 1/mu^2, inverse	-
epsilon	numeric	1e-08		$(-\infty, \infty)$
maxit	numeric	25		$(-\infty, \infty)$
trace	logical	FALSE	TRUE, FALSE	-
dispersion	untyped	NULL		-
type	character	link	response, link, terms	-
use_pred_offset	logical	TRUE	TRUE, FALSE	-

Initial parameter values

type
- Actual default: "link"
- Adjusted default: "response"
- Reason for change: Response scale more natural for predictions.

Offset

If a Task has a column with the role offset, it will automatically be used during training. The offset is incorporated through the formula interface to ensure compatibility with stats::glm(). We add it to the model formula as offset(<column_name>) and also include it in the training data. During prediction, the default behavior is to use the offset column from the test set (enabled by use_pred_offset = TRUE). Otherwise, if the user sets use_pred_offset = FALSE, a zero offset is applied, effectively disabling the offset adjustment during prediction.

References

Hosmer Jr, W D, Lemeshow, Stanley, Sturdivant, X R (2013). Applied logistic regression, volume 398. John Wiley & Sons.

Author

salauer

Super classes

mlr3::Learner -> mlr3::LearnerRegr -> LearnerRegrGlm

Methods

Inherited methods

Method `new()`

Creates a new instance of this R6 class.

Usage

LearnerRegrGlm$new()

Method `clone()`

The objects of this class are cloneable with this method.

Usage

LearnerRegrGlm$clone(deep = FALSE)

Arguments

deep: Whether to make a deep clone.

Examples

# Define the Learner
learner = mlr3::lrn("regr.glm")
print(learner)
#> 
#> ── <LearnerRegrGlm> (regr.glm): Generalized Linear Regression ──────────────────
#> • Model: -
#> • Parameters: family=gaussian, type=response, use_pred_offset=TRUE
#> • Packages: mlr3, mlr3extralearners, and stats
#> • Predict Types: [response] and se
#> • Feature Types: logical, integer, numeric, character, factor, and ordered
#> • Encapsulation: none (fallback: -)
#> • Properties: offset and weights
#> • Other settings: use_weights = 'use'

# Define a Task
task = mlr3::tsk("mtcars")

# Create train and test set
ids = mlr3::partition(task)

# Train the learner on the training ids
learner$train(task, row_ids = ids$train)

print(learner$model)
#> 
#> Call:  stats::glm(formula = form, family = structure(list(family = "gaussian", 
#>     link = "identity", linkfun = function (mu) 
#>     mu, linkinv = function (eta) 
#>     eta, variance = function (mu) 
#>     rep.int(1, length(mu)), dev.resids = function (y, mu, wt) 
#>     wt * ((y - mu)^2), aic = function (y, n, mu, wt, dev) 
#>     {
#>         nobs <- length(y)
#>         nobs * (log(dev/nobs * 2 * pi) + 1) + 2 - sum(log(wt))
#>     }, mu.eta = function (eta) 
#>     rep.int(1, length(eta)), initialize = expression({
#>         n <- rep.int(1, nobs)
#>         if (is.null(etastart) && is.null(start) && is.null(mustart) && 
#>             ((family$link == "inverse" && any(y == 0)) || (family$link == 
#>                 "log" && any(y <= 0)))) 
#>             stop("cannot find valid starting values: please specify some")
#>         mustart <- y
#>     }), validmu = function (mu) 
#>     TRUE, valideta = function (eta) 
#>     TRUE, dispersion = NA_real_), class = "family"), data = data)
#> 
#> Coefficients:
#> (Intercept)           am         carb          cyl         disp         drat  
#>    28.50545      0.77315     -0.41440     -0.85387      0.01260      0.59946  
#>        gear           hp         qsec           vs           wt  
#>     0.38423     -0.02627      0.58929     -1.86478     -4.58478  
#> 
#> Degrees of Freedom: 20 Total (i.e. Null);  10 Residual
#> Null Deviance:	    923.1 
#> Residual Deviance: 90.17 	AIC: 114.2


# Make predictions for the test rows
predictions = learner$predict(task, row_ids = ids$test)

# Score the predictions
predictions$score()
#> regr.mse 
#> 9.752143

Dictionary

Meta Information

Parameters

Initial parameter values

Offset

References

See also

Author

Super classes

Methods

Public methods

Method new()

Usage

Method clone()

Usage

Arguments

Examples

Method `new()`

Method `clone()`