be666 / neocrawler Goto Github PK
View Code? Open in Web Editor NEWThis project forked from ahkimkoo/neocrawler
Nodejs Crawler, including schedule, spider, web ui config, proxy modules. using nodejs, redis/ssdb, hbase, phantomjs. css selector extraction rules and regex extraction rules supported.
License: BSD 3-Clause "New" or "Revised" License