Data analysis of Lagou
-
crawl job data from Lagou, and get the latest info of jobs
-
data analysis and visualize
-
crawl job details info and generate word cloud as Job Impression
- Python Version >= 3.4
- Third Party Library:
pip3 install requests
pip3 install beautifulsoup4
pip3 install jieba
pip3 install openpyxl
-
clone this project from github
-
change the file path in source code
-
run lagou_spider.py to get job data and output them with a Excel file
-
run hot_words.py to cut sentences, and return TOP30 hot words ----V1.3 updated
For more information, please visit my answer at Zhihu.
In addition, there is an another repository which may help you!