fanhuaandluomu / pkulaw_spider Goto Github PK
View Code? Open in Web Editor NEW爬取北大法宝网http://www.pkulaw.cn/Case/
爬取北大法宝网http://www.pkulaw.cn/Case/
您好,大神,我是小白。我想请问有办法按照年份和城市返回案件总数吗?比如2000年北京市知识产物案件总数为1000。我急需2000-2009年全国各地级市涉及知识产权案件的数量,不知能否实现。谢谢大神!
请问txt中写入的是这种编码怎么办?
b'41\xe3\x80\x81\xe6\xb1\x9f\xe8\x8b\x8f\xe7\x9c\x81\xe6\x89\xac\xe5\xb7\x9e\xe5\xb8\x82\xe4\xb8\xad\xe7\xba\xa7\xe4\xba\xba\xe6\xb0\x91\xe6\xb3\x95\xe9\x99\xa2\xe5\x8f\x91\xe5\xb8\x838\xe8\xb5\xb7\xe6\x9c\x8d\xe5\x8a\xa1\xe4\xbf\x9d\xe9\x9a\x9c\xe4\xb8\xad\xe5\xb0\x8f\xe5\xbe\xae\xe4\xbc\x81\xe4\xb8\x9a\xe5\x81\xa5\xe5\xba\xb7\xe5\x8f\x91\xe5\xb1\x95\xe5\x85\xb8\xe5\x9e\x8b\xe6\xa1\x88\xe4\xbe\x8b\xe4\xb9\x8b\xe4\xb8\x83\xef\xbc\x9a\xe6\x89\xac\xe5\xb7\x9e\xe5\xb8\x82\xe5\xbc\x80\xe5\x8f\x91\xe5\x8c\xba\xe6\x9f\x90\xe7\xbb\x8f\xe8\x90\xa5\xe9\x83\xa8\xe4\xb8\x8e\xe6\x9f\x90\xe4\xbf\x9d\xe9\x99\xa9\xe8\x82\xa1\xe4\xbb\xbd\xe6\x9c\x89\xe9\x99\x90\xe5\x85\xac\xe5\x8f\xb8\xe6\x89\xac\xe5\xb7\x9e\xe4\xb8\xad\xe5\xbf\x83\xe6\x94\xaf\xe5\x85\xac\xe5\x8f\xb8\xe6\xb6\x89\xe7\x96\xab\xe6\x83\x85\xe4\xbf\x9d\xe9\x99\xa9\xe7\x90\x86\xe8\xb5\x94\xe6\xa1\x88\xe2\x80\x94\xe2\x80\x94\xe5\xa6\xa5\xe5\x96\x84\xe5\xae\xa1\xe7\x90\x86\xe6\xb6\x89\xe7\x96\xab\xe6\x83\x85\xe4\xbf\x9d\xe9\x99\xa9\xe5\x90\x88\xe5\x90\x8c\xef\xbc\x8c\xe6\x8a\xa4\xe8\x88\xaa\xe4\xb8\xad\xe5\xb0\x8f\xe5\xbe\xae\xe4\xbc\x81\xe4\xb8\x9a\xe5\xa4\x8d\xe5\xb7\xa5\xe5\xa4\x8d\xe4\xba\xa7' 95b2ca8d4055fce16601f6fb461bb3ae0cd8f2c8caf71853bdfb
我用学校的IP登陆的,只能下载两个txt文件,从第三个开始就编程forbidden了,望作者解决?
能不能提供一下联系方式
运行没有问题,条件按照图片上的条件,但是数据为0
亲爱的开发者,你好!
首先感谢你开发这个程序,因为我没有学过python,所以不知道是不是我操作的问题。
我使用的是Ubuntu14.04, 运行的命令行log如下:
benjamin@benjaminpc:/media/benjamin/disk/pkulaw_spider$ python crawl_v3.py
input date info (eg:2017_01_01-2017_09_01):
2017_01_01-2017_09_01
start_date: 2017.01.01
end_date: 2017.09.01
input classcode1(an you):(eg:007)
007
input classcode3(fa yuan):(eg:01)
01
input keyword:pageNum: 1
1
347
page:0 has 347 different case.
2017_01_01-2017_09_01+007+01+_log/0.txt-1 load success..
2017_01_01-2017_09_01+007+01+_log/0.txt-2 load success..
2017_01_01-2017_09_01+007+01+_log/0.txt-3 load success..
.......省略
期待你的回复~
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.