banarasi04 / hdfschecksumforlocalfile Goto Github PK
View Code? Open in Web Editor NEWThis project forked from srch07/hdfschecksumforlocalfile
This program / jar creates checksum, with same algorithm that hadoop uses to create on hdfs files. So integrity of file can be verified on local and hadoop system. Can also, be used to check if file exist based on checksum, before uploading and cluttering hdfs with duplicate files.