githubw2015 / crawler-boss Goto Github PK
View Code? Open in Web Editor NEW用Go实现抓取Boss直聘职位数据。IP代理,模拟浏览器,高效快速。
用Go实现抓取Boss直聘职位数据。IP代理,模拟浏览器,高效快速。
我设置如下:
const (
seleniumPath = `/Users/dapeng/Downloads/others/selenium-server-4.0.0-beta-4.jar`
chromeDriverPath = `/Applications/Google\ Chrome.app/Contents/chromedriver`
port = 4444
)
它报错:
panic: server did not respond on port 4444
goroutine 203 [running]:
main.main.func1(0x14de540, 0xc000250fa0, 0x145b4b5, 0x4, 0x60571c4, 0x145d4fe, 0x6)
/Users/dapeng/Documents/code/go/src/proj/crawler/main.go:89 +0x10db
created by main.main
/Users/dapeng/Documents/code/go/src/proj/crawler/main.go:87 +0x22c
exit status 2
我设置port为 8080
:
const (
seleniumPath = `/Users/dapeng/Downloads/others/selenium-server-4.0.0-beta-4.jar`
chromeDriverPath = `/Applications/Google\ Chrome.app/Contents/chromedriver`
port = 8080
)
它报这个错:
panic: unknown error - 33: Unable to create new service: ChromeDriverService
Build info: version: '3.141.59', revision: 'e82be7d358', time: '2018-11-14T08:25:53'
System info: host: 'B-K1H5JHD2-2305', ip: 'fe80:0:0:0:1c8b:e051:22cd:551a%en0', os.name: 'Mac OS X', os.arch: 'x86_64', os.version: '11.4', java.version: '16.0.1'Driver info: driver.version: unknown
goroutine 36 [running]:
main.main.func1(0x0, 0x0, 0x145cda2, 0x6, 0x60571c4, 0x145d4fe, 0x6)
/Users/dapeng/Documents/code/go/src/proj/crawler/main.go:123 +0x10bf
created by main.main
/Users/dapeng/Documents/code/go/src/proj/crawler/main.go:87 +0x22c
exit status 2
java直接运行jar包
➜ ~ java -jar Downloads/others/selenium-server-4.0.0-beta-4.jar hub
18:56:59.414 INFO [LoggingOptions.configureLogEncoding] - Using the system default encoding
18:56:59.420 INFO [OpenTelemetryTracer.createTracer] - Using OpenTelemetry for tracing
18:56:59.553 INFO [BoundZmqEventBus.<init>] - XPUB binding to [binding to tcp://*:4442, advertising as tcp://[fe80:0:0:0:1c8b:e051:22cd:551a%en0]:4442], XSUB binding to [binding to tcp://*:4443, advertising as tcp://[fe80:0:0:0:1c8b:e051:22cd:551a%en0]:4443]
18:56:59.621 INFO [UnboundZmqEventBus.<init>] - Connecting to tcp://[fe80:0:0:0:1c8b:e051:22cd:551a%en0]:4442 and tcp://[fe80:0:0:0:1c8b:e051:22cd:551a%en0]:4443
18:56:59.650 INFO [UnboundZmqEventBus.<init>] - Sockets created
18:57:00.655 INFO [UnboundZmqEventBus.<init>] - Event bus ready
18:57:01.353 INFO [Hub.execute] - Started Selenium Hub 4.0.0-beta-4 (revision 29f46d02dd): http://192.168.1.126:4444
我该怎么搞?「PS:这个运行chromedriver还需要安装java...感觉比python的难用好多😂」会不会是jar包、java、或者chrome的版本原因?
信息罗列
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.