Learners

To keep the dependencies on other packages reasonable, the base package mlr3 only ships with with regression and classification trees from the rpart package and some learners for debugging. A subjective selection of implementations for essential ML algorithms can be found in mlr3learners package. Survival learners are provided by mlr3proba, cluster learners via mlr3cluster. Additional learners, including some learners which are not yet to be considered stable or which are not available on CRAN, are connected via the mlr3extralearners package. For neural networks, see the mlr3torch extension.

Example Usage

Fit a classification tree on the Wisconsin Breast Cancer Data Set and predict on left-out observations.

library("mlr3verse")
Registered S3 methods overwritten by 'mlr3viz':
  method                    from     
  autoplot.LearnerSurvCoxPH mlr3proba
  plot.LearnerSurvCoxPH     mlr3proba
# retrieve the task
task = tsk("breast_cancer")

# split into two partitions
split = partition(task)

# retrieve a learner
learner = lrn("classif.rpart", keep_model = TRUE, predict_type = "prob")

# fit decision tree
learner$train(task, split$train)

# access learned model
learner$model
n= 458 

node), split, n, loss, yval, (yprob)
      * denotes terminal node

1) root 458 158 benign (0.34497817 0.65502183)  
  2) cell_size=4,5,6,7,8,9,10 146   8 malignant (0.94520548 0.05479452) *
  3) cell_size=1,2,3 312  20 benign (0.06410256 0.93589744)  
    6) bare_nuclei=8,9,10 15   2 malignant (0.86666667 0.13333333) *
    7) bare_nuclei=1,2,3,4,5,6,7 297   7 benign (0.02356902 0.97643098) *
# predict on data frame with new data
predictions = learner$predict_newdata(task$data(split$test))

# predict on subset of the task
predictions = learner$predict(task, split$test)

# inspect predictions
predictions

── <PredictionClassif> for 225 observations: ───────────────────────────────────
 row_ids     truth  response prob.malignant prob.benign
       5    benign    benign     0.02356902  0.97643098
       6 malignant malignant     0.94520548  0.05479452
       8    benign    benign     0.02356902  0.97643098
     ---       ---       ---            ---         ---
     680    benign    benign     0.02356902  0.97643098
     682 malignant malignant     0.94520548  0.05479452
     683 malignant malignant     0.94520548  0.05479452
predictions$score(msr("classif.auc"))
classif.auc 
  0.9209105 
autoplot(predictions, type = "roc")
Warning in ggplot2::fortify(object, raw_curves = raw_curves, reduce_points = reduce_points): Arguments in `...` must be used.
✖ Problematic argument:
• raw_curves = raw_curves
ℹ Did you misspell an argument name?