Giter Site home page Giter Site logo

maicius / qqzonemood Goto Github PK

View Code? Open in Web Editor NEW
521.0 20.0 130.0 58.65 MB

QQZone mood spider and analysis. QQ空间多线程爬虫和数据挖掘。提供线上服务,扫码登陆即可自动爬取和分析数据,还有网易云年度报告风格的数据展示;使用docker-compose打包程序,方便部署;额外提供QQ空间抽奖小程序。

Home Page: http://app.xiaomaidong.com

License: MIT License

Python 79.09% HTML 11.19% CSS 0.49% JavaScript 9.14% Dockerfile 0.09%
qzone qqzone qq spider mood thread flask docker slides slider

qqzonemood's Issues

登录

登录时经常出现qq号和识别码不匹配和qq号和cookie不匹配的问题,可我输入的时候是没有错的

[安全问题]代码内可能包含敏感信息

貌似你QQzone里面的代码中包含了你的QQ和密码耶,这些敏感信息,你确定不删?我记得b站当年就是github里泄露了些信息,导致内部API手册等外泄了耶

(泄露位置InterestingCrawler/QQZone/QQZoneShuoshuo.py)

redis无法连接:Failed to connect redis: Error -2 connecting to redis:6379. Name or service not known.

docker版本我可以正常运行
然后我按照readme
修改userinfo.json.example为文件userinfo.json,并填好QQ号、QQ密码、保存数据用的文件名前缀;
随后我打开了redis,然后
python3 src/web/server.py

但是:
`36d3b270ee04:python -u /opt/project/src/web/server.py
['/opt/project/src/web', '/opt/project', '/opt/.pycharm_helpers/pycharm_display', '/usr/local/lib/python36.zip', '/usr/local/lib/python3.6', '/usr/local/lib/python3.6/lib-dynload', '/usr/local/lib/python3.6/site-packages', '/opt/.pycharm_helpers/pycharm_matplotlib_backend', '/opt/project', '/opt/project', '/opt/project/src', '/opt/project']
Traceback (most recent call last):
Failed to connect redis: Error -2 connecting to redis:6379. Name or service not known.
File "/usr/local/lib/python3.6/site-packages/redis/connection.py", line 439, in connect
sock = self._connect()
File "/usr/local/lib/python3.6/site-packages/redis/connection.py", line 494, in _connect
raise err
File "/usr/local/lib/python3.6/site-packages/redis/connection.py", line 482, in _connect
sock.connect(socket_address)
ConnectionRefusedError: [Errno 111] Connection refused

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
File "/usr/local/lib/python3.6/site-packages/redis/client.py", line 572, in execute_command
connection.send_command(*args)
File "/usr/local/lib/python3.6/site-packages/redis/connection.py", line 563, in send_command
self.send_packed_command(self.pack_command(*args))
File "/usr/local/lib/python3.6/site-packages/redis/connection.py", line 538, in send_packed_command
self.connect()
File "/usr/local/lib/python3.6/site-packages/redis/connection.py", line 442, in connect
raise ConnectionError(self._error_message(e))
redis.exceptions.ConnectionError: Error 111 connecting to 127.0.0.1:6379. Connection refused.

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
File "/usr/local/lib/python3.6/site-packages/redis/connection.py", line 439, in connect
sock = self._connect()
File "/usr/local/lib/python3.6/site-packages/redis/connection.py", line 494, in _connect
raise err
File "/usr/local/lib/python3.6/site-packages/redis/connection.py", line 482, in _connect
sock.connect(socket_address)
ConnectionRefusedError: [Errno 111] Connection refused

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
File "/opt/project/src/web/web_util/web_util.py", line 59, in judge_pool
conn.set('redis_success', 1)
File "/usr/local/lib/python3.6/site-packages/redis/client.py", line 1072, in set
return self.execute_command('SET', *pieces)
File "/usr/local/lib/python3.6/site-packages/redis/client.py", line 578, in execute_command
connection.send_command(*args)
File "/usr/local/lib/python3.6/site-packages/redis/connection.py", line 563, in send_command
self.send_packed_command(self.pack_command(*args))
File "/usr/local/lib/python3.6/site-packages/redis/connection.py", line 538, in send_packed_command
self.connect()
File "/usr/local/lib/python3.6/site-packages/redis/connection.py", line 442, in connect
raise ConnectionError(self._error_message(e))
redis.exceptions.ConnectionError: Error 111 connecting to 127.0.0.1:6379. Connection refused.

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
File "/usr/local/lib/python3.6/site-packages/redis/connection.py", line 439, in connect
sock = self._connect()
File "/usr/local/lib/python3.6/site-packages/redis/connection.py", line 464, in _connect
socket.SOCK_STREAM):
File "/usr/local/lib/python3.6/socket.py", line 745, in getaddrinfo
for res in _socket.getaddrinfo(host, port, family, type, proto, flags):
socket.gaierror: [Errno -2] Name or service not known

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
File "/usr/local/lib/python3.6/site-packages/redis/client.py", line 572, in execute_command
connection.send_command(*args)
File "/usr/local/lib/python3.6/site-packages/redis/connection.py", line 563, in send_command
self.send_packed_command(self.pack_command(*args))
File "/usr/local/lib/python3.6/site-packages/redis/connection.py", line 538, in send_packed_command
self.connect()
File "/usr/local/lib/python3.6/site-packages/redis/connection.py", line 442, in connect
raise ConnectionError(self._error_message(e))
redis.exceptions.ConnectionError: Error -2 connecting to redis:6379. Name or service not known.

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
File "/usr/local/lib/python3.6/site-packages/redis/connection.py", line 439, in connect
sock = self._connect()
File "/usr/local/lib/python3.6/site-packages/redis/connection.py", line 464, in _connect
socket.SOCK_STREAM):
File "/usr/local/lib/python3.6/socket.py", line 745, in getaddrinfo
for res in _socket.getaddrinfo(host, port, family, type, proto, flags):
socket.gaierror: [Errno -2] Name or service not known

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
File "/opt/project/src/web/server.py", line 17, in
host = judge_pool()
File "/opt/project/src/web/web_util/web_util.py", line 69, in judge_pool
raise e
File "/opt/project/src/web/web_util/web_util.py", line 65, in judge_pool
conn.set('redis_success', 2)
File "/usr/local/lib/python3.6/site-packages/redis/client.py", line 1072, in set
return self.execute_command('SET', *pieces)
File "/usr/local/lib/python3.6/site-packages/redis/client.py", line 578, in execute_command
connection.send_command(*args)
File "/usr/local/lib/python3.6/site-packages/redis/connection.py", line 563, in send_command
self.send_packed_command(self.pack_command(*args))
File "/usr/local/lib/python3.6/site-packages/redis/connection.py", line 538, in send_packed_command
self.connect()
File "/usr/local/lib/python3.6/site-packages/redis/connection.py", line 442, in connect
raise ConnectionError(self._error_message(e))
redis.exceptions.ConnectionError: Error -2 connecting to redis:6379. Name or service not known.

Process finished with exit code 1
`

看样子是没有成功连接到redis,我对redis没有什么了解,我找不到我哪个地方出了问题
我使用了
redis-server
启动了redis
并且可以成功redis-cliping 127.0.0.1:6379

请问我哪一步出了问题,谢谢!

本地部署问题

服务器控制台报错,好像爬取数据失败了,显示“服务器比较忙,请稍后再试”

数据分析

数据分析过程第一次挺快的,为啥之后就特别慢?特别长时间也没出来

json以及csv文件

新手一枚,只是希望爬一点数据。为什么代码不会自动创建maicius_friend_detail_list.csv文件,我在文件夹中创建了一个后,运行又出现File "pandas_libs\parsers.pyx", line 545, in pandas._libs.parsers.TextReader.cinit
pandas.errors.EmptyDataError: No columns to parse from file。

如何爬取好友空间

从知乎上看过来的,感谢大神分享,想问一下如果要爬某个好友 A 的空间(有 QQ 号和访问权限),需要修改 src/spider/main.py 中的哪些参数呢?

安装imread出现错误

环境
win10
Python3.7
pycharm

安装imread出现问题

imread/lib/_jpeg.cpp(10): fatal error C1083: Cannot open include file: 'jpeglib.h': No such file or directory
error: command 'C:\Program Files (x86)\Microsoft Visual Studio 14.0\VC\BIN\x86_amd64\cl.exe' failed with exit status 2

Looking in indexes: https://pypi.tuna.tsinghua.edu.cn/simple/
Collecting imread
Using cached https://pypi.tuna.tsinghua.edu.cn/packages/4e/7e/52603151a6746a229f80b3d22a08e468dce9bcf723728a3b99d050372e86/imread-0.7.1.tar.gz
Requirement already satisfied: numpy in d:\program files\python37\lib\site-packages (from imread) (1.17.2)
Installing collected packages: imread
Running setup.py install for imread: started
Running setup.py install for imread: finished with status 'error'
Complete output from command "D:\Program Files\Python37\python.exe" -u -c "import setuptools, tokenize;file='C:\Users\Administrator\AppData\Local\Temp\pycharm-packaging\imread\setup.py';f=getattr(tokenize, 'open', open)(file);code=f.read().replace('\r\n', '\n');f.close();exec(compile(code, file, 'exec'))" install --record C:\Users\Administrator\AppData\Local\Temp\pip-record-jldkfk24\install-record.txt --single-version-externally-managed --compile:
'pkg-config' 不是内部或外部命令,也不是可运行的程序
或批处理文件。
running install
running build
running build_py
creating build
creating build\lib.win-amd64-3.7
creating build\lib.win-amd64-3.7\imread
copying imread\ijrois.py -> build\lib.win-amd64-3.7\imread
copying imread\imread.py -> build\lib.win-amd64-3.7\imread
copying imread\imread_version.py -> build\lib.win-amd64-3.7\imread
copying imread\special.py -> build\lib.win-amd64-3.7\imread
copying imread_init_.py -> build\lib.win-amd64-3.7\imread
creating build\lib.win-amd64-3.7\imread\tests
copying imread\tests\test_bmp.py -> build\lib.win-amd64-3.7\imread\tests
copying imread\tests\test_error.py -> build\lib.win-amd64-3.7\imread\tests
copying imread\tests\test_formats.py -> build\lib.win-amd64-3.7\imread\tests
copying imread\tests\test_imread.py -> build\lib.win-amd64-3.7\imread\tests
copying imread\tests\test_imread_from_blob.py -> build\lib.win-amd64-3.7\imread\tests
copying imread\tests\test_imsave.py -> build\lib.win-amd64-3.7\imread\tests
copying imread\tests\test_jpeg.py -> build\lib.win-amd64-3.7\imread\tests
copying imread\tests\test_png.py -> build\lib.win-amd64-3.7\imread\tests
copying imread\tests\test_rois.py -> build\lib.win-amd64-3.7\imread\tests
copying imread\tests\test_tiff.py -> build\lib.win-amd64-3.7\imread\tests
copying imread\tests\test_webp.py -> build\lib.win-amd64-3.7\imread\tests
copying imread\tests\test_xcf.py -> build\lib.win-amd64-3.7\imread\tests
copying imread\tests_init_.py -> build\lib.win-amd64-3.7\imread\tests
creating build\lib.win-amd64-3.7\imread\tests\data
copying imread\tests\data\0186-0099.roi -> build\lib.win-amd64-3.7\imread\tests\data
copying imread\tests\data\1_webp_a.webp -> build\lib.win-amd64-3.7\imread\tests\data
copying imread\tests\data\arange512_16bit.png -> build\lib.win-amd64-3.7\imread\tests\data
copying imread\tests\data\bit1.png -> build\lib.win-amd64-3.7\imread\tests\data
copying imread\tests\data\diag.xcf -> build\lib.win-amd64-3.7\imread\tests\data
copying imread\tests\data\error.jpg -> build\lib.win-amd64-3.7\imread\tests\data
copying imread\tests\data\error.png -> build\lib.win-amd64-3.7\imread\tests\data
copying imread\tests\data\error.tif -> build\lib.win-amd64-3.7\imread\tests\data
copying imread\tests\data\error.unknown -> build\lib.win-amd64-3.7\imread\tests\data
copying imread\tests\data\error.webp -> build\lib.win-amd64-3.7\imread\tests\data
copying imread\tests\data\ghsystem_flame.jpg -> build\lib.win-amd64-3.7\imread\tests\data
copying imread\tests\data\good -> build\lib.win-amd64-3.7\imread\tests\data
copying imread\tests\data\good.png -> build\lib.win-amd64-3.7\imread\tests\data
copying imread\tests\data\mono.tif -> build\lib.win-amd64-3.7\imread\tests\data
copying imread\tests\data\palette_zero.png -> build\lib.win-amd64-3.7\imread\tests\data
copying imread\tests\data\py-installer-indexed.bmp -> build\lib.win-amd64-3.7\imread\tests\data
copying imread\tests\data\rgba.png -> build\lib.win-amd64-3.7\imread\tests\data
copying imread\tests\data\rois.zip -> build\lib.win-amd64-3.7\imread\tests\data
copying imread\tests\data\stack.tiff -> build\lib.win-amd64-3.7\imread\tests\data
copying imread\tests\data\star1.bmp -> build\lib.win-amd64-3.7\imread\tests\data
warning: build_py: byte-compiling is disabled, skipping.

running build_ext
building 'imread._imread' extension
creating build\temp.win-amd64-3.7
creating build\temp.win-amd64-3.7\Release
creating build\temp.win-amd64-3.7\Release\imread
creating build\temp.win-amd64-3.7\Release\imread\lib
C:\Program Files (x86)\Microsoft Visual Studio 14.0\VC\BIN\x86_amd64\cl.exe /c /nologo /Ox /W3 /GL /DNDEBUG /MD -DNPY_NO_DEPRECATED_API=NPY_1_7_API_VERSION -DPY_ARRAY_UNIQUE_SYMBOL=MahotasImread_PyArray_API_Symbol -DIMREAD_EXCLUDE_WEBP=1 "-ID:\Program Files\Python37\include" "-ID:\Program Files\Python37\include" "-ID:\Program Files\Python37\lib\site-packages\numpy\core\include" "-IC:\Program Files (x86)\Microsoft Visual Studio 14.0\VC\INCLUDE" "-IC:\Program Files (x86)\Microsoft Visual Studio 14.0\VC\ATLMFC\INCLUDE" "-IC:\Program Files (x86)\Windows Kits\10\include\10.0.10240.0\ucrt" "-IC:\Program Files (x86)\Windows Kits\NETFXSDK\4.6.1\include\um" "-IC:\Program Files (x86)\Windows Kits\10\include\10.0.10240.0\shared" "-IC:\Program Files (x86)\Windows Kits\10\include\10.0.10240.0\um" "-IC:\Program Files (x86)\Windows Kits\10\include\10.0.10240.0\winrt" /EHsc /Tpimread/_imread.cpp /Fobuild\temp.win-amd64-3.7\Release\imread/_imread.obj
_imread.cpp
c:\users\administrator\appdata\local\temp\pycharm-packaging\imread\imread\lib/base.h(95): warning C4800: 'int': forcing value to bool 'true' or 'false' (performance warning)
c:\users\administrator\appdata\local\temp\pycharm-packaging\imread\imread\lib/base.h(212): warning C4800: 'int': forcing value to bool 'true' or 'false' (performance warning)
c:\users\administrator\appdata\local\temp\pycharm-packaging\imread\imread\lib/base.h(221): warning C4146: unary minus operator applied to unsigned type, result still unsigned
c:\users\administrator\appdata\local\temp\pycharm-packaging\imread\imread\lib/base.h(221): warning C4267: 'argument': conversion from 'size_t' to 'int', possible loss of data
c:\users\administrator\appdata\local\temp\pycharm-packaging\imread\imread\lib/file.h(20): warning C4996: 'close': The POSIX name for this item is deprecated. Instead, use the ISO C and C++ conformant name: _close. See online help for details.
C:\Program Files (x86)\Windows Kits\10\include\10.0.10240.0\ucrt\corecrt_io.h(457): note: see declaration of 'close'
c:\users\administrator\appdata\local\temp\pycharm-packaging\imread\imread\lib/file.h(23): warning C4267: 'argument': conversion from 'size_t' to 'unsigned int', possible loss of data
c:\users\administrator\appdata\local\temp\pycharm-packaging\imread\imread\lib/file.h(23): warning C4996: 'read': The POSIX name for this item is deprecated. Instead, use the ISO C and C++ conformant name: _read. See online help for details.
C:\Program Files (x86)\Windows Kits\10\include\10.0.10240.0\ucrt\corecrt_io.h(521): note: see declaration of 'read'
c:\users\administrator\appdata\local\temp\pycharm-packaging\imread\imread\lib/file.h(26): warning C4267: 'argument': conversion from 'size_t' to 'long', possible loss of data
c:\users\administrator\appdata\local\temp\pycharm-packaging\imread\imread\lib/file.h(26): warning C4996: 'lseek': The POSIX name for this item is deprecated. Instead, use the ISO C and C++ conformant name: _lseek. See online help for details.
C:\Program Files (x86)\Windows Kits\10\include\10.0.10240.0\ucrt\corecrt_io.h(501): note: see declaration of 'lseek'
c:\users\administrator\appdata\local\temp\pycharm-packaging\imread\imread\lib/file.h(27): warning C4996: 'lseek': The POSIX name for this item is deprecated. Instead, use the ISO C and C++ conformant name: _lseek. See online help for details.
C:\Program Files (x86)\Windows Kits\10\include\10.0.10240.0\ucrt\corecrt_io.h(501): note: see declaration of 'lseek'
c:\users\administrator\appdata\local\temp\pycharm-packaging\imread\imread\lib/file.h(28): warning C4996: 'lseek': The POSIX name for this item is deprecated. Instead, use the ISO C and C++ conformant name: _lseek. See online help for details.
C:\Program Files (x86)\Windows Kits\10\include\10.0.10240.0\ucrt\corecrt_io.h(501): note: see declaration of 'lseek'
c:\users\administrator\appdata\local\temp\pycharm-packaging\imread\imread\lib/file.h(32): warning C4267: 'argument': conversion from 'size_t' to 'unsigned int', possible loss of data
c:\users\administrator\appdata\local\temp\pycharm-packaging\imread\imread\lib/file.h(32): warning C4996: 'write': The POSIX name for this item is deprecated. Instead, use the ISO C and C++ conformant name: _write. See online help for details.
C:\Program Files (x86)\Windows Kits\10\include\10.0.10240.0\ucrt\corecrt_io.h(556): note: see declaration of 'write'
c:\users\administrator\appdata\local\temp\pycharm-packaging\imread\imread\lib/numpy.h(75): warning C4244: 'return': conversion from 'npy_intp' to 'int', possible loss of data
c:\users\administrator\appdata\local\temp\pycharm-packaging\imread\imread\lib/numpy.h(114): warning C4244: 'argument': conversion from 'npy_intp' to 'int', possible loss of data
imread/_imread.cpp(100): warning C4267: 'argument': conversion from 'size_t' to 'const int', possible loss of data
imread/_imread.cpp(104): warning C4996: 'open': The POSIX name for this item is deprecated. Instead, use the ISO C and C++ conformant name: _open. See online help for details.
C:\Program Files (x86)\Windows Kits\10\include\10.0.10240.0\ucrt\corecrt_io.h(514): note: see declaration of 'open'
imread/_imread.cpp(134): warning C4800: 'int': forcing value to bool 'true' or 'false' (performance warning)
imread/_imread.cpp(163): warning C4800: 'ImageFormat *': forcing value to bool 'true' or 'false' (performance warning)
imread/_imread.cpp(297): warning C4244: 'initializing': conversion from 'Py_ssize_t' to 'unsigned int', possible loss of data
imread/_imread.cpp(297): warning C4244: 'initializing': conversion from 'Py_ssize_t' to 'const unsigned int', possible loss of data
imread/_imread.cpp(286): warning C4996: 'open': The POSIX name for this item is deprecated. Instead, use the ISO C and C++ conformant name: _open. See online help for details.
C:\Program Files (x86)\Windows Kits\10\include\10.0.10240.0\ucrt\corecrt_io.h(514): note: see declaration of 'open'
C:\Program Files (x86)\Microsoft Visual Studio 14.0\VC\BIN\x86_amd64\cl.exe /c /nologo /Ox /W3 /GL /DNDEBUG /MD -DNPY_NO_DEPRECATED_API=NPY_1_7_API_VERSION -DPY_ARRAY_UNIQUE_SYMBOL=MahotasImread_PyArray_API_Symbol -DIMREAD_EXCLUDE_WEBP=1 "-ID:\Program Files\Python37\include" "-ID:\Program Files\Python37\include" "-ID:\Program Files\Python37\lib\site-packages\numpy\core\include" "-IC:\Program Files (x86)\Microsoft Visual Studio 14.0\VC\INCLUDE" "-IC:\Program Files (x86)\Microsoft Visual Studio 14.0\VC\ATLMFC\INCLUDE" "-IC:\Program Files (x86)\Windows Kits\10\include\10.0.10240.0\ucrt" "-IC:\Program Files (x86)\Windows Kits\NETFXSDK\4.6.1\include\um" "-IC:\Program Files (x86)\Windows Kits\10\include\10.0.10240.0\shared" "-IC:\Program Files (x86)\Windows Kits\10\include\10.0.10240.0\um" "-IC:\Program Files (x86)\Windows Kits\10\include\10.0.10240.0\winrt" /EHsc /Tpimread/lib/formats.cpp /Fobuild\temp.win-amd64-3.7\Release\imread/lib/formats.obj
formats.cpp
c:\users\administrator\appdata\local\temp\pycharm-packaging\imread\imread\lib\base.h(95): warning C4800: 'int': forcing value to bool 'true' or 'false' (performance warning)
c:\users\administrator\appdata\local\temp\pycharm-packaging\imread\imread\lib\base.h(212): warning C4800: 'int': forcing value to bool 'true' or 'false' (performance warning)
c:\users\administrator\appdata\local\temp\pycharm-packaging\imread\imread\lib\base.h(221): warning C4146: unary minus operator applied to unsigned type, result still unsigned
c:\users\administrator\appdata\local\temp\pycharm-packaging\imread\imread\lib\base.h(221): warning C4267: 'argument': conversion from 'size_t' to 'int', possible loss of data
C:\Program Files (x86)\Microsoft Visual Studio 14.0\VC\BIN\x86_amd64\cl.exe /c /nologo /Ox /W3 /GL /DNDEBUG /MD -DNPY_NO_DEPRECATED_API=NPY_1_7_API_VERSION -DPY_ARRAY_UNIQUE_SYMBOL=MahotasImread_PyArray_API_Symbol -DIMREAD_EXCLUDE_WEBP=1 "-ID:\Program Files\Python37\include" "-ID:\Program Files\Python37\include" "-ID:\Program Files\Python37\lib\site-packages\numpy\core\include" "-IC:\Program Files (x86)\Microsoft Visual Studio 14.0\VC\INCLUDE" "-IC:\Program Files (x86)\Microsoft Visual Studio 14.0\VC\ATLMFC\INCLUDE" "-IC:\Program Files (x86)\Windows Kits\10\include\10.0.10240.0\ucrt" "-IC:\Program Files (x86)\Windows Kits\NETFXSDK\4.6.1\include\um" "-IC:\Program Files (x86)\Windows Kits\10\include\10.0.10240.0\shared" "-IC:\Program Files (x86)\Windows Kits\10\include\10.0.10240.0\um" "-IC:\Program Files (x86)\Windows Kits\10\include\10.0.10240.0\winrt" /EHsc /Tpimread/lib/numpy.cpp /Fobuild\temp.win-amd64-3.7\Release\imread/lib/numpy.obj
numpy.cpp
c:\users\administrator\appdata\local\temp\pycharm-packaging\imread\imread\lib\base.h(95): warning C4800: 'int': forcing value to bool 'true' or 'false' (performance warning)
c:\users\administrator\appdata\local\temp\pycharm-packaging\imread\imread\lib\base.h(212): warning C4800: 'int': forcing value to bool 'true' or 'false' (performance warning)
c:\users\administrator\appdata\local\temp\pycharm-packaging\imread\imread\lib\base.h(221): warning C4146: unary minus operator applied to unsigned type, result still unsigned
c:\users\administrator\appdata\local\temp\pycharm-packaging\imread\imread\lib\base.h(221): warning C4267: 'argument': conversion from 'size_t' to 'int', possible loss of data
c:\users\administrator\appdata\local\temp\pycharm-packaging\imread\imread\lib\numpy.h(75): warning C4244: 'return': conversion from 'npy_intp' to 'int', possible loss of data
c:\users\administrator\appdata\local\temp\pycharm-packaging\imread\imread\lib\numpy.h(114): warning C4244: 'argument': conversion from 'npy_intp' to 'int', possible loss of data
imread/lib/numpy.cpp(8): warning C4244: 'initializing': conversion from 'npy_intp' to 'int', possible loss of data
imread/lib/numpy.cpp(8): warning C4244: 'initializing': conversion from 'npy_intp' to 'const int', possible loss of data
imread/lib/numpy.cpp(9): warning C4244: 'initializing': conversion from 'npy_intp' to 'int', possible loss of data
imread/lib/numpy.cpp(9): warning C4244: 'initializing': conversion from 'npy_intp' to 'const int', possible loss of data
imread/lib/numpy.cpp(14): warning C4800: 'int': forcing value to bool 'true' or 'false' (performance warning)
imread/lib/numpy.cpp(17): warning C4800: 'int': forcing value to bool 'true' or 'false' (performance warning)
C:\Program Files (x86)\Microsoft Visual Studio 14.0\VC\BIN\x86_amd64\cl.exe /c /nologo /Ox /W3 /GL /DNDEBUG /MD -DNPY_NO_DEPRECATED_API=NPY_1_7_API_VERSION -DPY_ARRAY_UNIQUE_SYMBOL=MahotasImread_PyArray_API_Symbol -DIMREAD_EXCLUDE_WEBP=1 "-ID:\Program Files\Python37\include" "-ID:\Program Files\Python37\include" "-ID:\Program Files\Python37\lib\site-packages\numpy\core\include" "-IC:\Program Files (x86)\Microsoft Visual Studio 14.0\VC\INCLUDE" "-IC:\Program Files (x86)\Microsoft Visual Studio 14.0\VC\ATLMFC\INCLUDE" "-IC:\Program Files (x86)\Windows Kits\10\include\10.0.10240.0\ucrt" "-IC:\Program Files (x86)\Windows Kits\NETFXSDK\4.6.1\include\um" "-IC:\Program Files (x86)\Windows Kits\10\include\10.0.10240.0\shared" "-IC:\Program Files (x86)\Windows Kits\10\include\10.0.10240.0\um" "-IC:\Program Files (x86)\Windows Kits\10\include\10.0.10240.0\winrt" /EHsc /Tpimread/lib/_bmp.cpp /Fobuild\temp.win-amd64-3.7\Release\imread/lib/_bmp.obj
_bmp.cpp
c:\users\administrator\appdata\local\temp\pycharm-packaging\imread\imread\lib\base.h(95): warning C4800: 'int': forcing value to bool 'true' or 'false' (performance warning)
c:\users\administrator\appdata\local\temp\pycharm-packaging\imread\imread\lib\base.h(212): warning C4800: 'int': forcing value to bool 'true' or 'false' (performance warning)
c:\users\administrator\appdata\local\temp\pycharm-packaging\imread\imread\lib\base.h(221): warning C4146: unary minus operator applied to unsigned type, result still unsigned
c:\users\administrator\appdata\local\temp\pycharm-packaging\imread\imread\lib\base.h(221): warning C4267: 'argument': conversion from 'size_t' to 'int', possible loss of data
c:\users\administrator\appdata\local\temp\pycharm-packaging\imread\imread\lib\tools.h(23): warning C4267: 'initializing': conversion from 'size_t' to 'int', possible loss of data
C:\Program Files (x86)\Microsoft Visual Studio 14.0\VC\BIN\x86_amd64\cl.exe /c /nologo /Ox /W3 /GL /DNDEBUG /MD -DNPY_NO_DEPRECATED_API=NPY_1_7_API_VERSION -DPY_ARRAY_UNIQUE_SYMBOL=MahotasImread_PyArray_API_Symbol -DIMREAD_EXCLUDE_WEBP=1 "-ID:\Program Files\Python37\include" "-ID:\Program Files\Python37\include" "-ID:\Program Files\Python37\lib\site-packages\numpy\core\include" "-IC:\Program Files (x86)\Microsoft Visual Studio 14.0\VC\INCLUDE" "-IC:\Program Files (x86)\Microsoft Visual Studio 14.0\VC\ATLMFC\INCLUDE" "-IC:\Program Files (x86)\Windows Kits\10\include\10.0.10240.0\ucrt" "-IC:\Program Files (x86)\Windows Kits\NETFXSDK\4.6.1\include\um" "-IC:\Program Files (x86)\Windows Kits\10\include\10.0.10240.0\shared" "-IC:\Program Files (x86)\Windows Kits\10\include\10.0.10240.0\um" "-IC:\Program Files (x86)\Windows Kits\10\include\10.0.10240.0\winrt" /EHsc /Tpimread/lib/_jpeg.cpp /Fobuild\temp.win-amd64-3.7\Release\imread/lib/_jpeg.obj
_jpeg.cpp
c:\users\administrator\appdata\local\temp\pycharm-packaging\imread\imread\lib\base.h(95): warning C4800: 'int': forcing value to bool 'true' or 'false' (performance warning)
c:\users\administrator\appdata\local\temp\pycharm-packaging\imread\imread\lib\base.h(212): warning C4800: 'int': forcing value to bool 'true' or 'false' (performance warning)
c:\users\administrator\appdata\local\temp\pycharm-packaging\imread\imread\lib\base.h(221): warning C4146: unary minus operator applied to unsigned type, result still unsigned
c:\users\administrator\appdata\local\temp\pycharm-packaging\imread\imread\lib\base.h(221): warning C4267: 'argument': conversion from 'size_t' to 'int', possible loss of data
imread/lib/_jpeg.cpp(10): fatal error C1083: Cannot open include file: 'jpeglib.h': No such file or directory
error: command 'C:\\Program Files (x86)\\Microsoft Visual Studio 14.0\\VC\\BIN\\x86_amd64\\cl.exe' failed with exit status 2

----------------------------------------

Command ""D:\Program Files\Python37\python.exe" -u -c "import setuptools, tokenize;file='C:\Users\Administrator\AppData\Local\Temp\pycharm-packaging\imread\setup.py';f=getattr(tokenize, 'open', open)(file);code=f.read().replace('\r\n', '\n');f.close();exec(compile(code, file, 'exec'))" install --record C:\Users\Administrator\AppData\Local\Temp\pip-record-jldkfk24\install-record.txt --single-version-externally-managed --compile" failed with error code 1 in C:\Users\Administrator\AppData\Local\Temp\pycharm-packaging\imread
You are using pip version 19.0.3, however version 19.2.3 is available.
You should consider upgrading via the 'python -m pip install --upgrade pip' command.

多线程会遗漏数据

使用默认的线程数(10)跑程序的时候,某些线程抓取失败,导致爬取的数据不全(QQ 空间说说),请问楼主这个怎么解决呢

使用python版本爬取他人空间意外退出

success to download qr code
2020-04-28 17:39:56.562 Python[1922:60833] WARNING: NSWindow drag regions should only be invalidated on the Main Thread! This will throw an exception in the future. Called from (
0 AppKit 0x00007fff30227289 -[NSWindow(NSWindow_Theme) _postWindowNeedsToResetDragMarginsUnlessPostingDisabled] + 371
1 AppKit 0x00007fff3020ecb2 -[NSWindow _initContent:styleMask:backing:defer:contentView:] + 1416
2 AppKit 0x00007fff3020e723 -[NSWindow initWithContentRect:styleMask:backing:defer:] + 42
3 _macosx.cpython-38-darwin.so 0x000000011833b7c0 -[Window initWithContentRect:styleMask:backing:defer:withManager:] + 80
4 _macosx.cpython-38-darwin.so 0x000000011833eccf FigureManager_init + 319
5 Python 0x000000010d3a9dfc wrap_init + 12
6 Python 0x000000010d35b599 wrapperdescr_call + 121
7 Python 0x000000010d352785 _PyObject_MakeTpCall + 373
8 Python 0x000000010d420ed5 call_function + 533
9 Python 0x000000010d41dc8d _PyEval_EvalFrameDefault + 25677
10 Python 0x000000010d3530d0 function_code_fastcall + 128
11 Python 0x000000010d3525d7 _PyObject_FastCallDict + 247
12 Python 0x000000010d353bdf _PyObject_Call_Prepend + 143
13 Python 0x000000010d3a9d41 slot_tp_init + 145
14 Python 0x000000010d3a5219 type_call + 297
15 Python 0x000000010d352785 _PyObject_MakeTpCall + 373
16 Python 0x000000010d420ed5 call_function + 533
17 Python 0x000000010d41dc8d _PyEval_EvalFrameDefault + 25677
18 Python 0x000000010d3530d0 function_code_fastcall + 128
19 Python 0x000000010d35579a method_vectorcall + 170
20 Python 0x000000010d420e7c call_function + 444
21 Python 0x000000010d41dc8d _PyEval_EvalFrameDefault + 25677
22 Python 0x000000010d421cc4 _PyEval_EvalCodeWithName + 2804
23 Python 0x000000010d35326e _PyFunction_Vectorcall + 270
24 Python 0x000000010d35587c method_vectorcall + 396
25 Python 0x000000010d352aad PyVectorcall_Call + 109
26 Python 0x000000010d41e198 _PyEval_EvalFrameDefault + 26968
27 Python 0x000000010d421cc4 _PyEval_EvalCodeWithName + 2804
28 Python 0x000000010d35326e _PyFunction_Vectorcall + 270
29 Python 0x000000010d420e7c call_function + 444
30 Python 0x000000010d41dd1a _PyEval_EvalFrameDefault + 25818
31 Python 0x000000010d3530d0 function_code_fastcall + 128
32 Python 0x000000010d420e7c call_function + 444
33 Python 0x000000010d41dd1a _PyEval_EvalFrameDefault + 25818
34 Python 0x000000010d421cc4 _PyEval_EvalCodeWithName + 2804
35 Python 0x000000010d35326e _PyFunction_Vectorcall + 270
36 Python 0x000000010d420e7c call_function + 444
37 Python 0x000000010d41dd1a _PyEval_EvalFrameDefault + 25818
38 Python 0x000000010d421cc4 _PyEval_EvalCodeWithName + 2804
39 Python 0x000000010d35326e _PyFunction_Vectorcall + 270
40 Python 0x000000010d420e7c call_function + 444
41 Python 0x000000010d41dc8d _PyEval_EvalFrameDefault + 25677
42 Python 0x000000010d3530d0 function_code_fastcall + 128
43 Python 0x000000010d35587c method_vectorcall + 396
44 Python 0x000000010d352aad PyVectorcall_Call + 109
45 Python 0x000000010d41e198 _PyEval_EvalFrameDefault + 26968
46 Python 0x000000010d3530d0 function_code_fastcall + 128
47 Python 0x000000010d420e7c call_function + 444
48 Python 0x000000010d41dc69 _PyEval_EvalFrameDefault + 25641
49 Python 0x000000010d3530d0 function_code_fastcall + 128
50 Python 0x000000010d420e7c call_function + 444
51 Python 0x000000010d41dc69 _PyEval_EvalFrameDefault + 25641
52 Python 0x000000010d3530d0 function_code_fastcall + 128
53 Python 0x000000010d3557db method_vectorcall + 235
54 Python 0x000000010d352aad PyVectorcall_Call + 109
55 Python 0x000000010d4c07da t_bootstrap + 74
56 Python 0x000000010d473b19 pythread_wrapper + 25
57 libsystem_pthread.dylib 0x00007fff6d0c8109 _pthread_start + 148
58 libsystem_pthread.dylib 0x00007fff6d0c3b8b thread_start + 15
)
2020-04-28 17:39:56.718 Python[1922:60833] WARNING: nextEventMatchingMask should only be called from the Main Thread! This will throw an exception in the future.
2020-04-28 17:39:56.718 Python[1922:60833] *** Assertion failure in +[NSUndoManager _endTopLevelGroupings], /AppleInternal/BuildRoot/Library/Caches/com.apple.xbs/Sources/Foundation/Foundation-1675.129/Foundation/Misc.subproj/NSUndoManager.m:363
2020-04-28 17:39:56.720 Python[1922:60833] *** Assertion failure in +[NSUndoManager _endTopLevelGroupings], /AppleInternal/BuildRoot/Library/Caches/com.apple.xbs/Sources/Foundation/Foundation-1675.129/Foundation/Misc.subproj/NSUndoManager.m:363
2020-04-28 17:39:56.723 Python[1922:60833] *** Terminating app due to uncaught exception 'NSInternalInconsistencyException', reason: '+[NSUndoManager(NSInternal) _endTopLevelGroupings] is only safe to invoke on the main thread.'
*** First throw call stack:
(
0 CoreFoundation 0x00007fff32ff7d07 __exceptionPreprocess + 250
1 libobjc.A.dylib 0x00007fff6bd1c5bf objc_exception_throw + 48
2 CoreFoundation 0x00007fff33020ec8 +[NSException raise:format:arguments:] + 88
3 Foundation 0x00007fff3570bf3d -[NSAssertionHandler handleFailureInMethod:object:file:lineNumber:description:] + 191
4 Foundation 0x00007fff356486ce +[NSUndoManager(NSPrivate) _endTopLevelGroupings] + 440
5 AppKit 0x00007fff301e92bc -[NSApplication run] + 864
6 _macosx.cpython-38-darwin.so 0x0000000118340132 show + 210
7 Python 0x000000010d38f2b6 cfunction_vectorcall_NOARGS + 214
8 Python 0x000000010d420e7c call_function + 444
9 Python 0x000000010d41dc8d _PyEval_EvalFrameDefault + 25677
10 Python 0x000000010d3530d0 function_code_fastcall + 128
11 Python 0x000000010d420e7c call_function + 444
12 Python 0x000000010d41dc8d _PyEval_EvalFrameDefault + 25677
13 Python 0x000000010d421cc4 _PyEval_EvalCodeWithName + 2804
14 Python 0x000000010d35326e _PyFunction_Vectorcall + 270
15 Python 0x000000010d352aad PyVectorcall_Call + 109
16 Python 0x000000010d41e198 _PyEval_EvalFrameDefault + 26968
17 Python 0x000000010d421cc4 _PyEval_EvalCodeWithName + 2804
18 Python 0x000000010d35326e _PyFunction_Vectorcall + 270
19 Python 0x000000010d3557db method_vectorcall + 235
20 Python 0x000000010d352aad PyVectorcall_Call + 109
21 Python 0x000000010d41e198 _PyEval_EvalFrameDefault + 26968
22 Python 0x000000010d421cc4 _PyEval_EvalCodeWithName + 2804
23 Python 0x000000010d35326e _PyFunction_Vectorcall + 270
24 Python 0x000000010d420e7c call_function + 444
25 Python 0x000000010d41dc8d _PyEval_EvalFrameDefault + 25677
26 Python 0x000000010d3530d0 function_code_fastcall + 128
27 Python 0x000000010d35587c method_vectorcall + 396
28 Python 0x000000010d352aad PyVectorcall_Call + 109
29 Python 0x000000010d41e198 _PyEval_EvalFrameDefault + 26968
30 Python 0x000000010d3530d0 function_code_fastcall + 128
31 Python 0x000000010d420e7c call_function + 444
32 Python 0x000000010d41dc69 _PyEval_EvalFrameDefault + 25641
33 Python 0x000000010d3530d0 function_code_fastcall + 128
34 Python 0x000000010d420e7c call_function + 444
35 Python 0x000000010d41dc69 _PyEval_EvalFrameDefault + 25641
36 Python 0x000000010d3530d0 function_code_fastcall + 128
37 Python 0x000000010d3557db method_vectorcall + 235
38 Python 0x000000010d352aad PyVectorcall_Call + 109
39 Python 0x000000010d4c07da t_bootstrap + 74
40 Python 0x000000010d473b19 pythread_wrapper + 25
41 libsystem_pthread.dylib 0x00007fff6d0c8109 _pthread_start + 148
42 libsystem_pthread.dylib 0x00007fff6d0c3b8b thread_start + 15
)
libc++abi.dylib: terminating with uncaught exception of type NSException
Abort trap: 6
运行main.py ,生成二维码后意外退出,macOS 15.4 ,python 3.8.2环境
本人小白一位🤦‍♂️

扫码登陆后提示由于网络等原因登陆失败,请重新扫描二维码或稍后再试

因为没用过docker,摸索了挺久才成功跑起来,不知道是不是操作有问题,扫码登陆的时候提示
由于网络等原因登陆失败,请重新扫描二维码或稍后再试
而手机端是提示登陆成功的
试了很多次,还是这样


我想问题应该是出在这里,现在正在寻找修改的办法
File "F:\桌面\QQZoneMood\src\spider\QQZoneSpider.py", line 911, in get_qzone_token
qzonetoken = re.findall(re.compile("g_qzonetoken = (function(){ try{return "(.*)?""), content)[0]
IndexError: list index out of range

把这一行上的content print出来以后用在线的正则表达式测试 去匹配发现匹配不到东西
怀疑是这个页面和这个项目的之前的页面相比已经发生了改变
但是本人学识水平有限,暂时无法找出解决方法

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.