We compile the largest-known list of potentially blocked websites from various sources government orders, court orders, user reports, RTI by CIS, list of potentially blocked websites by OONI and miscellaneous.
Structure for repository is as follows:
- dataset/
- potentially_blocked_urls.txt all the urls curated from the various sources
- potentially_blocked_websites.txt all the unique hostnames curated from the various sources
Sources:
-
Court Orders
-
Govt Orders
-
Misc