MINERVA-Dataset-Generation
Validating Dataset Generated Using NOMOS in Fossology
We are using Nomos to label the licenses, with license_headers with which it's regex is being matched, or the other labels that are Unclassidied_licenses, No_License_found, Public-domain, Restricted, etc. This is a base line validation for the generated text files using both the algorithms. Terminal command to run this will be :
sudo nomos -J -d <folder_with_files>
And to use multiple cores to validate files (here I am using 3 cores) :
sudo nomos -J -d <folder_with_files> -n 3