๐ค English | ๐ ็ฎไฝไธญๆ
ScrapydWeb: A full-featured web UI for Scrapyd cluster management, with Scrapy log analysis & visualization supported.
๐ How to efficiently manage your distributed web scraping projects
View contents
-
๐ Scrapyd Cluster Management
- ๐ฏ All Scrapyd JSON API Supported
- โ๏ธ Group, filter and select any number of nodes
- ๐ฑ๏ธ Execute command on multinodes with just a few clicks
-
๐ Scrapy Log Analysis
- ๐ข Stats collection
- ๐ Progress visualization
- ๐ Logs categorization
-
๐ Enhancements
- ๐๏ธ HTML caching for the Log and Stats page
- ๐ Basic auth for web UI
- ๐ฆ Auto eggify your projects
- ๐ง Email notice
- ๐ฑ Mobile UI
View contents
โ Make sure that ๐ Scrapyd has been installed and started on all of your hosts.
- Use pip:
pip install scrapydweb
- Use git:
git clone https://github.com/my8100/scrapydweb.git
cd scrapydweb
python setup.py install
- Start ScrapydWeb via the
scrapydweb
command. (a config file would be generated for customizing settings on the first startup.) - Visit http://127.0.0.1:5000 (It's recommended to use Google Chrome for a better experience.)
The latest version of Google Chrome, Firefox, and Safari.
View contents
$ git clone https://github.com/my8100/scrapydweb.git
$ cd scrapydweb
# To create isolated Python environments
$ pip install virtualenv
$ virtualenv venv/scrapydweb
# Or specify your Python interpreter: $ virtualenv -p /usr/local/bin/python3.7 venv/scrapydweb
$ source venv/scrapydweb/bin/activate
# Install dependent libraries
(scrapydweb) $ python setup.py install
(scrapydweb) $ pip install pytest
(scrapydweb) $ pip install coverage
# Make sure Scrapyd has been installed and started, then update the custom_settings item in tests/conftest.py
(scrapydweb) $ vi tests/conftest.py
(scrapydweb) $ curl http://127.0.0.1:6800
(scrapydweb) $ coverage run --source=scrapydweb -m pytest tests/test_a_factory.py -s -vv
(scrapydweb) $ coverage run --source=scrapydweb -m pytest tests -s -vv
(scrapydweb) $ coverage report
# To create an HTML report, check out htmlcov/index.html
(scrapydweb) $ coverage html
View contents
- Front End
- Back End
Detailed changes for each release are documented in the ๐ HISTORY.md.
my8100 |
---|
Kaisla |
---|
This project is licensed under the GNU General Public License v3.0 - see the ๐ LICENSE file for details.