We are building a Russian Drama Corpus with files encoded in TEI-P5. Our corpus comprises 75 plays so far, stemming from ilibrary, Wikisource and РВБ, converted into TEI and corrected by us. There will be more.
If you just want to download the corpus in its current state, do this:
svn export https://github.com/dracor-org/rusdracor/trunk/tei
RusDraCor was first presented on June 29, 2017, at the Corpora 2017 conference in St. Petersburg (our slides here) and on July 11, 2017, at the "Digitizing the stage" conference in Oxford. The social network data we extracted so far can be explored with our Shinyapp.