.NET Library for crawling filesystem and ingesting into Elasticsearch. Includes Events API for file change watching in cloud file storage applications (i.e. Nasuni)
We pull office, project (name and number), and category fields from files using 3 settings fields:
OfficeSiteExtractRegex
ProjectExtractRegex
PipeCategorizationRegex
If they are not present, we get a fatal error. I think we should both allow these to not be set (and either not set the field or include a default value) and to allow the user to set a static value for project/office, possibly with a separate options settings key.
The README mentions appsettings.example.json and jobsettings.example.json, but they're not in the repo. They need to be added (with comments, so maybe jsonc files)
If an Elastic cluster is not using Kerberos, they can't make use of NetworkCredentials, so it would be good to allow the user to set up a crawl using effectively Basic auth or ApiKey auth, potentially in appsettings.json