This repository will contain R, Stata, and Python packages, all called causaldata
, which contain data sets that can be used to implement the code examples in causal inference textbooks.
As of the moment, this contains data sets from The Effect by Huntington-Klein and Causal Inference: The Mixtape by Scott Cunningham are included in the package. The judge_fe
data set from The Mixtape is too large to include, and so is omitted.
Data sets all come with documentation in the form of variable labels, although the exact format of this varies from language to language.
The R package can be installed with:
# If necessary: install.packages('remotes')
remotes::install_github('NickCH-K/causaldata/R/')
The Stata package can be installed with:
net install causaldata, from("https://raw.githubusercontent.com/NickCH-K/causaldata/master/Stata/")
To install the Python package, use the green Code button on this page to download this repository, unzip it, change the directory to the causaldata/Python
folder, and install with:
python setup.py install
Or, if you're using something with IPython like Spyder, you might use
runfile('the/full/path/to/causaldata/Python/setup.py', wdir='your/working/directory',args='install')