Learners

To keep the dependencies on other packages reasonable, the base package mlr3 only ships with with regression and classification trees from the rpart package and some learners for debugging. A subjective selection of implementations for essential ML algorithms can be found in mlr3learners package. Survival learners are provided by mlr3proba, cluster learners via mlr3cluster. Additional learners, including some learners which are not yet to be considered stable or which are not available on CRAN, are connected via the mlr3extralearners package. For neural networks, see the mlr3torch extension.

Example Usage

Fit a classification tree on the Wisconsin Breast Cancer Data Set and predict on left-out observations.

library("mlr3verse")

# retrieve the task
task = tsk("breast_cancer")

# split into two partitions
split = partition(task)

# retrieve a learner
learner = lrn("classif.rpart", keep_model = TRUE, predict_type = "prob")

# fit decision tree
learner$train(task, split$train)

# access learned model
learner$model
n= 458 

node), split, n, loss, yval, (yprob)
      * denotes terminal node

 1) root 458 167 benign (0.36462882 0.63537118)  
   2) cell_size=3,4,5,6,7,8,9,10 182  23 malignant (0.87362637 0.12637363)  
     4) cell_shape=3,4,5,6,7,8,9,10 168  12 malignant (0.92857143 0.07142857)  
       8) marg_adhesion=2,3,4,5,6,7,8,9,10 148   5 malignant (0.96621622 0.03378378) *
       9) marg_adhesion=1 20   7 malignant (0.65000000 0.35000000)  
        18) cl_thickness=8,9,10 12   1 malignant (0.91666667 0.08333333) *
        19) cl_thickness=1,2,3,4,5,6,7 8   2 benign (0.25000000 0.75000000) *
     5) cell_shape=1,2 14   3 benign (0.21428571 0.78571429) *
   3) cell_size=1,2 276   8 benign (0.02898551 0.97101449)  
     6) normal_nucleoli=3,4,5,6,7,8,9,10 8   3 malignant (0.62500000 0.37500000) *
     7) normal_nucleoli=1,2 268   3 benign (0.01119403 0.98880597) *
# predict on data frame with new data
predictions = learner$predict_newdata(task$data(split$test))

# predict on subset of the task
predictions = learner$predict(task, split$test)

# inspect predictions
predictions
<PredictionClassif> for 225 observations:
 row_ids     truth  response prob.malignant prob.benign
       1    benign    benign     0.01119403  0.98880597
       6 malignant malignant     0.96621622  0.03378378
       7    benign    benign     0.01119403  0.98880597
     ---       ---       ---            ---         ---
     673    benign    benign     0.01119403  0.98880597
     675    benign    benign     0.01119403  0.98880597
     679    benign    benign     0.01119403  0.98880597
predictions$score(msr("classif.auc"))
classif.auc 
  0.9606481 
autoplot(predictions, type = "roc")