A gold standard dataset for Pashto. The source data come from three selected books, published in 1986, 2002, and 2006 respectively, vary in fonts, printing, and digitization quality.
Details on the dataset and its sources are described in this article. https://doi.org/10.6017/ital.v40i1.12553