The mlr3 ecosystem has a new package mlr3inferr that allows to construct confidence intervals for the generalization error.
Resamplings
Resamplings split the observations multiple times into two sets: training and test. The former is used to fit the model, the latter is used to evaluate the predictions. The Resampling objects provide an abstraction for this procedure while respecting stratification as well as grouping/blocking if this is required by the Task.
If only a single split is required (i.e., a holdout split), the partition() function provides a single split into training and test set.