Generalized Linear Regression
mlr_learners_regr.glm.Rd
Generalized linear model.
Calls stats::glm()
from base package 'stats'.
For logistic regression please use mlr_learners_classif.log_reg
.
Meta Information
Task type: “regr”
Predict Types: “response”, “se”
Feature Types: “logical”, “integer”, “numeric”, “character”, “factor”, “ordered”
Required Packages: mlr3, mlr3extralearners, 'stats'
Parameters
Id | Type | Default | Levels | Range |
singular.ok | logical | TRUE | TRUE, FALSE | - |
x | logical | FALSE | TRUE, FALSE | - |
y | logical | TRUE | TRUE, FALSE | - |
model | logical | TRUE | TRUE, FALSE | - |
etastart | untyped | - | - | |
mustart | untyped | - | - | |
start | untyped | NULL | - | |
offset | untyped | - | - | |
family | character | gaussian | gaussian, poisson, quasipoisson, Gamma, inverse.gaussian | - |
na.action | character | - | na.omit, na.pass, na.fail, na.exclude | - |
link | character | - | logit, probit, cauchit, cloglog, identity, log, sqrt, 1/mu^2, inverse | - |
epsilon | numeric | 1e-08 | \((-\infty, \infty)\) | |
maxit | numeric | 25 | \((-\infty, \infty)\) | |
trace | logical | FALSE | TRUE, FALSE | - |
dispersion | untyped | NULL | - | |
type | character | link | response, link, terms | - |
Initial parameter values
type
Actual default: "link"
Adjusted default: "response"
Reason for change: Response scale more natural for predictions.
References
Hosmer Jr, W D, Lemeshow, Stanley, Sturdivant, X R (2013). Applied logistic regression, volume 398. John Wiley & Sons.
See also
as.data.table(mlr_learners)
for a table of available Learners in the running session (depending on the loaded packages).Chapter in the mlr3book: https://mlr3book.mlr-org.com/basics.html#learners
mlr3learners for a selection of recommended learners.
mlr3cluster for unsupervised clustering learners.
mlr3pipelines to combine learners with pre- and postprocessing steps.
mlr3tuning for tuning of hyperparameters, mlr3tuningspaces for established default tuning spaces.
Super classes
mlr3::Learner
-> mlr3::LearnerRegr
-> LearnerRegrGlm
Examples
# Define the Learner
learner = mlr3::lrn("regr.glm")
print(learner)
#> <LearnerRegrGlm:regr.glm>: Generalized Linear Regression
#> * Model: -
#> * Parameters: family=gaussian, type=response
#> * Packages: mlr3, mlr3extralearners, stats
#> * Predict Types: [response], se
#> * Feature Types: logical, integer, numeric, character, factor, ordered
#> * Properties: weights
# Define a Task
task = mlr3::tsk("mtcars")
# Create train and test set
ids = mlr3::partition(task)
# Train the learner on the training ids
learner$train(task, row_ids = ids$train)
print(learner$model)
#>
#> Call: stats::glm(formula = formula, family = structure(list(family = "gaussian",
#> link = "identity", linkfun = function (mu)
#> mu, linkinv = function (eta)
#> eta, variance = function (mu)
#> rep.int(1, length(mu)), dev.resids = function (y, mu, wt)
#> wt * ((y - mu)^2), aic = function (y, n, mu, wt, dev)
#> {
#> nobs <- length(y)
#> nobs * (log(dev/nobs * 2 * pi) + 1) + 2 - sum(log(wt))
#> }, mu.eta = function (eta)
#> rep.int(1, length(eta)), initialize = expression({
#> n <- rep.int(1, nobs)
#> if (is.null(etastart) && is.null(start) && is.null(mustart) &&
#> ((family$link == "inverse" && any(y == 0)) || (family$link ==
#> "log" && any(y <= 0))))
#> stop("cannot find valid starting values: please specify some")
#> mustart <- y
#> }), validmu = function (mu)
#> TRUE, valideta = function (eta)
#> TRUE, dispersion = NA_real_), class = "family"), data = data)
#>
#> Coefficients:
#> (Intercept) am carb cyl disp drat
#> -5.645801 5.123981 -0.675242 -0.532099 0.005588 2.143339
#> gear hp qsec vs wt
#> -0.441599 0.001914 1.603206 -1.380888 -2.045974
#>
#> Degrees of Freedom: 20 Total (i.e. Null); 10 Residual
#> Null Deviance: 780
#> Residual Deviance: 86.23 AIC: 113.3
# Make predictions for the test rows
predictions = learner$predict(task, row_ids = ids$test)
# Score the predictions
predictions$score()
#> regr.mse
#> 10.59727