This repository holds all code to transform the PURE (a Dataset of Public Requirements Documents) dataset into plain text files, including the source files and the output files.
The purpose of this repository is to allow for the PURE dataset to be used in a more accessible, usable way for applying natural language processing techniques to the domain of requirements engineering.
The raw data is sourced from the PURE dataset page on Zenodo, a platform for open research.