lfarrell / cinch2 Goto Github PK
View Code? Open in Web Editor NEWThis project forked from slnc-dimp/cinch2
A project to add site crawling, file normalization, natural language processing and increased scalability to the current Cinch project. Cinch is a project to develop a bulk download service to a central repository that will maintain original file timestamps, virus check, extract file level metadata, create file checksums and periodically validate checksums for continued file integrity. Users merely need to upload a list of URLs to download and when the process completes they can download the requested files and file metadata to their local environment. Funding for the first version of CINCH: Capture, Ingest, & Checksum tool was made possible through an IMLS Sparks! Ignition grant.
License: The Unlicense