mmisiewicz / archivespark Goto Github PK
View Code? Open in Web Editor NEWThis project forked from helgeho/archivespark
An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed at Internet Archive.
License: MIT License