Patrick 0.1.0 will have backwards incompatible changes

DoubleML - Double Machine Learning in R

The R package DoubleML provides an implementation of the double / debiased machine learning framework of Chernozhukov et al. (2018). It is built on top of mlr3 and the mlr3 ecosystem (Lang et al., 2019).

Note that the R package was developed together with a python twin based on scikit-learn. The python package is also available on GitHub and .

Documentation and maintenance

Documentation of functions in R: https://docs.doubleml.org/r/stable/reference/index.html

User guide: https://docs.doubleml.org

DoubleML is currently maintained by @PhilippBach and @SvenKlaassen.

Main Features

Double / debiased machine learning framework of Chernozhukov et al. (2018) for

Partially linear regression models (PLR)
Partially linear IV regression models (PLIV)
Interactive regression models (IRM)
Interactive IV regression models (IIVM)

The object-oriented implementation of DoubleML that is based on the R6 package for R is very flexible. The model classes DoubleMLPLR, DoubleMLPLIV, DoubleMLIRM and DoubleIIVM implement the estimation of the nuisance functions via machine learning methods and the computation of the Neyman orthogonal score function. All other functionalities are implemented in the abstract base class DoubleML. In particular functionalities to estimate double machine learning models and to perform statistical inference via the methods fit, bootstrap, confint, p_adjust and tune. This object-oriented implementation allows a high flexibility for the model specification in terms of …

… the machine learning methods for estimation of the nuisance functions,
… the resampling schemes,
… the double machine learning algorithm,
… the Neyman orthogonal score functions,
…

It further can be readily extended with regards to

… new model classes that come with Neyman orthogonal score functions being linear in the target parameter,
… alternative score functions via callables,
… alternative resampling schemes,
…

OOP structure of the DoubleML package

Installation

Install the latest release from CRAN:

remotes::packages("DoubleML")

Install the development version from GitHub:

remotes::install_github("DoubleML/doubleml-for-r")

DoubleML requires

R (>= 3.5.0)
R6 (>= 2.4.1)
data.table (>= 1.12.8)
stats
checkmate
mlr3 (>= 0.5.0)
mlr3tuning (>= 0.3.0)
mlr3learners (>= 0.3.0)
mvtnorm
utils
clusterGeneration
readstata13

Contributing

DoubleML is a community effort. Everyone is welcome to contribute. To get started for your first contribution we recommend reading our contributing guidelines and our code of conduct.

Citation

If you use the DoubleML package a citation is highly appreciated:

Bach, P., Chernozhukov, V., Kurz, M. S., and Spindler, M. (2021), DoubleML - An Object-Oriented Implementation of Double Machine Learning in R, arXiv:2103.09603.

Bibtex-entry:

@misc{DoubleML2020,
      title={{DoubleML} -- {A}n Object-Oriented Implementation of Double Machine Learning in {R}}, 
      author={P. Bach and V. Chernozhukov and M. S. Kurz and M. Spindler and Sven Klaassen},
      year={2024},
      journal={Journal of Statistical Software},
      volume={108},
      number={3},
      pages= {1-56},
      doi={10.18637/jss.v108.i03},
      note={arXiv:\href{https://arxiv.org/abs/2103.09603}{2103.09603} [stat.ML]}
}

Acknowledgements

Funding by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) is acknowledged – Project Number 431701914.

References

Bach, P., Chernozhukov, V., Kurz, M. S., Spindler, M. and Klaassen, S. (2024), DoubleML - An Object-Oriented Implementation of Double Machine Learning in R, Journal of Statistical Software, 108(3): 1-56, doi:10.18637/jss.v108.i03, arXiv:2103.09603.
Chernozhukov, V., Chetverikov, D., Demirer, M., Duflo, E., Hansen, C., Newey, W. and Robins, J. (2018), Double/debiased machine learning for treatment and structural parameters. The Econometrics Journal, 21: C1-C68, https://doi.org/10.1111/ectj.12097.
Lang, M., Binder, M., Richter, J., Schratz, P., Pfisterer, F., Coors, S., Au, Q., Casalicchio, G., Kotthoff, L., Bischl, B. (2019), mlr3: A modern object-oriented machine learing framework in R. Journal of Open Source Software, https://doi.org/10.21105/joss.01903.

	if (!test_names(names(tune_settings$measure),
	subset.of = valid_learner)) {
	stop(paste(
	"Invalid name of measure", paste0(names(tune_settings$measure),
	collapse = ", "),
	"\n measure must be a named list with elements named",
	paste0(valid_learner, collapse = ", ")))
	}

	private$task_type = list(
	"ml_g" = NULL,
	"ml_m" = NULL)
	ml_g = private$assert_learner(ml_g, "ml_g", Regr = TRUE, Classif = FALSE)
	ml_m = private$assert_learner(ml_m, "ml_m", Regr = TRUE, Classif = TRUE)

	private$task_type = list(
	"ml_g" = NULL,
	"ml_m" = NULL)
	ml_g = private$assert_learner(ml_g, "ml_g", Regr = TRUE, Classif = TRUE)
	ml_m = private$assert_learner(ml_m, "ml_m", Regr = FALSE, Classif = TRUE)

	private$task_type = list(
	"ml_g" = NULL,
	"ml_m" = NULL,
	"ml_r" = NULL)
	ml_g = private$assert_learner(ml_g, "ml_g", Regr = TRUE, Classif = TRUE)
	ml_m = private$assert_learner(ml_m, "ml_m", Regr = FALSE, Classif = TRUE)
	ml_r = private$assert_learner(ml_r, "ml_r", Regr = FALSE, Classif = TRUE)

	ml_g = private$assert_learner(ml_g, "ml_g",
	Regr = TRUE,
	Classif = FALSE)
	ml_m = private$assert_learner(ml_m, "ml_m",
	Regr = TRUE,
	Classif = FALSE)
	ml_r = private$assert_learner(ml_r, "ml_r",
	Regr = TRUE,
	Classif = FALSE)

	if (task_type == "regr") {
	resp_name = "response"
	} else if (task_type == "classif") {
	resp_name = "prob.1"
	}

	self$se = sqrt(apply(
	self$all_se^2 + (self$all_coef - self$coef)^2, 1,
	function(x) median(x, na.rm = TRUE)))

indicators	Lasso	lightGBM	Xgboost	RF
a	-0.105	-13.424***	-13.410***	-13.025***
b	0.001	0.265***	0.275***	0.259***
c	0.003	0.186***	0.187***	0.185***
d	-0.017	20.600***	21.417***	20.165***
e	1.701	13.227***	13.282***	12.853***
f	16.672	10.549*	15.637**	8.339

doubleml / doubleml-for-r Goto Github PK

doubleml-for-r's Introduction

DoubleML - Double Machine Learning in R

Documentation and maintenance

Main Features

Installation

Contributing

Citation

Acknowledgements

References

doubleml-for-r's People

Contributors

Stargazers

Watchers

Forkers

doubleml-for-r's Issues

Describe the bug

Minimum reproducible code snippet

Expected Result

Actual Result

Versions

Describe the feature you want to propose or implement

SessionInfo (Microsoft R Open 4.0.2)

Commands Ran and outputs

Propose a possible solution or implementation

Did you consider alternatives to the proposed solution. If yes, please describe

Comments, context or references

PLR, IRM and IIVM

PLIV

Possible solution

Miscellaneous

Describe the bug

Minimum reproducible code snippet

Expected Result

Actual Result

Versions

Describe the issue related to the API documentation

Suggested alternative or fix

Description

Implementation

Unit Tests

Documentation

Description

Example

Describe the bug

Minimum reproducible code snippet

################################ Lasso

Expected Result

Actual Result

Versions

Describe the bug

Minimum reproducible code snippet

Initiate new DoubleML object and estimate with graph learner

Expected Result

Actual Result

Versions

Describe the bug

Minimum reproducible code snippet

Expected Result

Actual Result

Versions

Recommend Projects

Recommend Topics

Recommend Org