Comments (10)
From the log, do you know which pmid was running when the error happened?
from pubmed-batch-download.
below is the log output
*Trying to fetch pmid 26633170
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 26633170 succeeded
Trying to fetch pmid 23682673
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 23682673 succeeded
Trying to fetch pmid 25040501
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 25040501 succeeded
Trying to fetch pmid 24628937
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 24628937 succeeded
Trying to fetch pmid 27174497
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 27174497 succeeded
Trying to fetch pmid 27547345
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 27547345 succeeded
Trying to fetch pmid 22610656
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 22610656 succeeded
Trying to fetch pmid 23858657
** Reprint 23858657 cannot be fetched as pubmed does not have a link to its pdf.
Trying to fetch pmid 24998529
** Reprint 24998529 cannot be fetched as pubmed does not have a link to its pdf.
Trying to fetch pmid 27859194
** Reprint 27859194 cannot be fetched as pubmed does not have a link to its pdf.
Trying to fetch pmid 26991916
** Reprint 26991916 cannot be fetched as pubmed does not have a link to its pdf.
Here is the PMID
Trying to fetch pmid 26742956
Traceback (most recent call last):
File "fetch_pdfs.py", line 252, in
if type(e)==requests.ConnectionError and '104' in e[0][1][0]:
TypeError: argument of type 'int' is not iterable*
from pubmed-batch-download.
The error handles fine for me, but I added better handling just in case. Can you give it a try? Also, what version of python and requests are you running?
Side note: I could fetch 24998529 and 26991916...not sure why yours is giving you that message. If you are familiar with python enough to add a print statement or two in specific places, let me know and we can debug that on your end.
from pubmed-batch-download.
If the new code doesnt work, I just added a .yml with that can be used with anaconda to create an environment with the correct versions and packages needed to run the program.
from pubmed-batch-download.
Hi Bill,
I tried the new code and the .yml file with anaconda (but used it in a docker environment), however both error types persist, for instance the PMIDs 23682673, 22610656, 24628937, 25040501 came damaged. Please see below the output.
(base) root@8914b8bb01b3:/data# python fetch_pdfs.py -pmf PMID_all.txt -out PDF -maxRetries 3
Output directory of agora_vai did not exist. Created the directory.
Trying to fetch pmid 25211280
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 25211280 succeeded
Trying to fetch pmid 26633170
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 26633170 succeeded
Trying to fetch pmid 23682673
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 23682673 succeeded
Trying to fetch pmid 25040501
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 25040501 succeeded
Trying to fetch pmid 24628937
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 24628937 succeeded
Trying to fetch pmid 27174497
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 27174497 succeeded
Trying to fetch pmid 27859194
** fetching of reprint 27859194 failed from error ('Connection aborted.', BadStatusLine("''",))
Trying to fetch pmid 22610656
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 22610656 succeeded
Trying to fetch pmid 23858657
** Reprint 23858657 cannot be fetched as pubmed does not have a link to its pdf.
Trying to fetch pmid 27547345
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 27547345 succeeded
Trying to fetch pmid 24998529
Trying genericCitationLabelled
Trying pubmed_central
Trying science_direct
** fetching reprint using the 'science_direct' finder...
** fetching of reprint 24998529 succeeded
Trying to fetch pmid 26482654
** fetching of reprint 26482654 failed from error ('Connection aborted.', BadStatusLine("''",))
Trying to fetch pmid 26991916
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 26991916 succeeded
Trying to fetch pmid 26742956
Traceback (most recent call last):
File "fetch_pdfs.py", line 252, in <module>
if '104' in e[0][1][0]:
IndexError: tuple index out of range
(base) root@8914b8bb01b3:/data#
from pubmed-batch-download.
I just tried to download the pdfs that came damaged, but they came fine for me: are you logged into a system that has access to the journals? I am thinking that trying to download the file while not having access to the journal may be storing a non-pdf file as the output, which looks like a corrupted PDF.
Separately, I tried to add a new quick check to fix the error you are getting on the tuple index out of range. Let me know if that fixed it.
It looks like conda environments aren't platform agnostic, so not being able to install that package should be ok if you install the others, since its just a dependency needed for the others. Perhaps running conda install libgcc could do it?
from pubmed-batch-download.
Hi Bill,
I changed the network and now the pdfs are coming fine. However, after a few pdfs downloaded, the error appears. I am using a file with the PMIDs, and when error shows up, I remove the problematic PMID, but then the error occurs with a different PMID.
Regarding conda environment, I've tried conda install libgcc, but it could not find the library.
Below the error messages:
(base) root@8914b8bb01b3:/data# python fetch_pdfs.py -pmf PMID_all.txt -out PDF -maxRetries 3
Output directory of PDF did not exist. Created the directory.
Trying to fetch pmid 25211280
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 25211280 succeeded
Trying to fetch pmid 26633170
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 26633170 succeeded
Trying to fetch pmid 23682673
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 23682673 succeeded
Trying to fetch pmid 25040501
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 25040501 succeeded
Trying to fetch pmid 24628937
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 24628937 succeeded
Trying to fetch pmid 27174497
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 27174497 succeeded
Trying to fetch pmid 27859194
** Reprint 27859194 cannot be fetched as pubmed does not have a link to its pdf.
Trying to fetch pmid 22610656
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 22610656 succeeded
Trying to fetch pmid 23858657
** Reprint 23858657 cannot be fetched as pubmed does not have a link to its pdf.
Trying to fetch pmid 27547345
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 27547345 succeeded
Trying to fetch pmid 24998529
**** fetching of reprint 24998529 failed from error HTTPSConnectionPool(host='linkinghub.elsevier.com', port=443): Read timed out. (read timeout=5)
Traceback (most recent call last):
File "fetch_pdfs.py", line 252, in
if len(e) >=3 and '104' in e[0][1][0]:
TypeError: object of type 'ConnectionError' has no len()**
(base) root@8914b8bb01b3:/data# python fetch_pdfs.py -pmf PMID_all.txt -out PDF -maxRetries 3
Trying to fetch pmid 25211280
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 25211280 succeeded
Trying to fetch pmid 26633170
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 26633170 succeeded
Trying to fetch pmid 23682673
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 23682673 succeeded
Trying to fetch pmid 25040501
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 25040501 succeeded
Trying to fetch pmid 24628937
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 24628937 succeeded
Trying to fetch pmid 27174497
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 27174497 succeeded
Trying to fetch pmid 27859194
Traceback (most recent call last):
File "fetch_pdfs.py", line 252, in
if len(e) >=3 and '104' in e[0][1][0]:
TypeError: object of type 'ConnectionError' has no len()
(base) root@8914b8bb01b3:/data# python fetch_pdfs.py -pmf PMID_all.txt -out PDF -maxRetries 3
Trying to fetch pmid 25211280
** Reprint #25211280 already downloaded and in folder; skipping.
Trying to fetch pmid 26633170
** Reprint #26633170 already downloaded and in folder; skipping.
Trying to fetch pmid 23682673
** Reprint #23682673 already downloaded and in folder; skipping.
Trying to fetch pmid 25040501
** Reprint #25040501 already downloaded and in folder; skipping.
Trying to fetch pmid 24628937
** Reprint #24628937 already downloaded and in folder; skipping.
Trying to fetch pmid 27174497
** Reprint #27174497 already downloaded and in folder; skipping.
Trying to fetch pmid 27859194
** Reprint 27859194 cannot be fetched as pubmed does not have a link to its pdf.
Trying to fetch pmid 22610656
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 22610656 succeeded
Trying to fetch pmid 23858657
** Reprint 23858657 cannot be fetched as pubmed does not have a link to its pdf.
Trying to fetch pmid 27547345
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 27547345 succeeded
Trying to fetch pmid 24998529
Trying genericCitationLabelled
Trying pubmed_central
Trying science_direct
** fetching reprint using the 'science_direct' finder...
** fetching of reprint 24998529 succeeded
Trying to fetch pmid 26482654
Traceback (most recent call last):
File "fetch_pdfs.py", line 252, in
if len(e) >=3 and '104' in e[0][1][0]:
TypeError: object of type 'ConnectionError' has no len()
(base) root@8914b8bb01b3:/data# python fetch_pdfs.py -pmf PMID_all.txt -out PDF -maxRetries 3
Trying to fetch pmid 25211280
** Reprint #25211280 already downloaded and in folder; skipping.
Trying to fetch pmid 26633170
** Reprint #26633170 already downloaded and in folder; skipping.
Trying to fetch pmid 23682673
** Reprint #23682673 already downloaded and in folder; skipping.
Trying to fetch pmid 25040501
** Reprint #25040501 already downloaded and in folder; skipping.
Trying to fetch pmid 24628937
** Reprint #24628937 already downloaded and in folder; skipping.
Trying to fetch pmid 27174497
** Reprint #27174497 already downloaded and in folder; skipping.
Trying to fetch pmid 27859194
** Reprint 27859194 cannot be fetched as pubmed does not have a link to its pdf.
Trying to fetch pmid 22610656
** Reprint #22610656 already downloaded and in folder; skipping.
Trying to fetch pmid 23858657
Traceback (most recent call last):
File "fetch_pdfs.py", line 252, in
if len(e) >=3 and '104' in e[0][1][0]:
TypeError: object of type 'ConnectionError' has no len()
(base) root@8914b8bb01b3:/data# python fetch_pdfs.py -pmf PMID_all.txt -out PDF -maxRetries 3
Trying to fetch pmid 25211280
** Reprint #25211280 already downloaded and in folder; skipping.
Trying to fetch pmid 26633170
** Reprint #26633170 already downloaded and in folder; skipping.
Trying to fetch pmid 23682673
** Reprint #23682673 already downloaded and in folder; skipping.
Trying to fetch pmid 25040501
** Reprint #25040501 already downloaded and in folder; skipping.
Trying to fetch pmid 24628937
** Reprint #24628937 already downloaded and in folder; skipping.
Trying to fetch pmid 27174497
** Reprint #27174497 already downloaded and in folder; skipping.
Trying to fetch pmid 27859194
** Reprint 27859194 cannot be fetched as pubmed does not have a link to its pdf.
Trying to fetch pmid 22610656
** Reprint #22610656 already downloaded and in folder; skipping.
Trying to fetch pmid 23858657
Traceback (most recent call last):
File "fetch_pdfs.py", line 252, in
if len(e) >=3 and '104' in e[0][1][0]:
TypeError: object of type 'ConnectionError' has no len()
(base) root@8914b8bb01b3:/data# python fetch_pdfs.py -pmf PMID_all.txt -out PDF -maxRetries 3
Trying to fetch pmid 25211280
** Reprint #25211280 already downloaded and in folder; skipping.
Trying to fetch pmid 26633170
** Reprint #26633170 already downloaded and in folder; skipping.
Trying to fetch pmid 23682673
** Reprint #23682673 already downloaded and in folder; skipping.
Trying to fetch pmid 25040501
** Reprint #25040501 already downloaded and in folder; skipping.
Trying to fetch pmid 24628937
** Reprint #24628937 already downloaded and in folder; skipping.
Trying to fetch pmid 27174497
** Reprint #27174497 already downloaded and in folder; skipping.
Trying to fetch pmid 27859194
Traceback (most recent call last):
File "fetch_pdfs.py", line 252, in
if len(e) >=3 and '104' in e[0][1][0]:
TypeError: object of type 'ConnectionError' has no len()
(base) root@8914b8bb01b3:/data# python fetch_pdfs.py -pmf PMID_all.txt -out PDF -maxRetries 3
Trying to fetch pmid 25211280
** Reprint #25211280 already downloaded and in folder; skipping.
Trying to fetch pmid 26633170
** Reprint #26633170 already downloaded and in folder; skipping.
Trying to fetch pmid 23682673
** Reprint #23682673 already downloaded and in folder; skipping.
Trying to fetch pmid 25040501
** Reprint #25040501 already downloaded and in folder; skipping.
Trying to fetch pmid 24628937
** Reprint #24628937 already downloaded and in folder; skipping.
Trying to fetch pmid 27174497
** Reprint #27174497 already downloaded and in folder; skipping.
Trying to fetch pmid 27859194
** Reprint 27859194 cannot be fetched as pubmed does not have a link to its pdf.
Trying to fetch pmid 22610656
** Reprint #22610656 already downloaded and in folder; skipping.
Trying to fetch pmid 23858657
** Reprint 23858657 cannot be fetched as pubmed does not have a link to its pdf.
Trying to fetch pmid 27547345
** Reprint #27547345 already downloaded and in folder; skipping.
Trying to fetch pmid 24998529
Trying genericCitationLabelled
Trying pubmed_central
Trying science_direct
** fetching reprint using the 'science_direct' finder...
** fetching of reprint 24998529 succeeded
Trying to fetch pmid 26482654
Traceback (most recent call last):
File "fetch_pdfs.py", line 252, in
if len(e) >=3 and '104' in e[0][1][0]:
TypeError: object of type 'ConnectionError' has no len()
(base) root@8914b8bb01b3:/data# python fetch_pdfs.py -pmf PMID_all.txt -out PDF -maxRetries 3
Trying to fetch pmid 25211280
** Reprint #25211280 already downloaded and in folder; skipping.
Trying to fetch pmid 26633170
** Reprint #26633170 already downloaded and in folder; skipping.
Trying to fetch pmid 23682673
** Reprint #23682673 already downloaded and in folder; skipping.
Trying to fetch pmid 25040501
** Reprint #25040501 already downloaded and in folder; skipping.
Trying to fetch pmid 24628937
** Reprint #24628937 already downloaded and in folder; skipping.
Trying to fetch pmid 27174497
** Reprint #27174497 already downloaded and in folder; skipping.
Trying to fetch pmid 27859194
Traceback (most recent call last):
File "fetch_pdfs.py", line 252, in
if len(e) >=3 and '104' in e[0][1][0]:
TypeError: object of type 'ConnectionError' has no len()
(base) root@8914b8bb01b3:/data# python fetch_pdfs.py -pmf PMID_all.txt -out PDF -maxRetries 3
Trying to fetch pmid 25211280
** Reprint #25211280 already downloaded and in folder; skipping.
Trying to fetch pmid 26633170
** Reprint #26633170 already downloaded and in folder; skipping.
Trying to fetch pmid 23682673
** Reprint #23682673 already downloaded and in folder; skipping.
Trying to fetch pmid 25040501
** Reprint #25040501 already downloaded and in folder; skipping.
Trying to fetch pmid 24628937
** Reprint #24628937 already downloaded and in folder; skipping.
Trying to fetch pmid 27174497
** Reprint #27174497 already downloaded and in folder; skipping.
Trying to fetch pmid 22610656
** Reprint #22610656 already downloaded and in folder; skipping.
Trying to fetch pmid 23858657
** Reprint 23858657 cannot be fetched as pubmed does not have a link to its pdf.
Trying to fetch pmid 27547345
** Reprint #27547345 already downloaded and in folder; skipping.
Trying to fetch pmid 26482654
Traceback (most recent call last):
File "fetch_pdfs.py", line 252, in
if len(e) >=3 and '104' in e[0][1][0]:
TypeError: object of type 'ConnectionError' has no len()
(base) root@8914b8bb01b3:/data# python fetch_pdfs.py -pmf PMID_all.txt -out PDF -maxRetries 3
Trying to fetch pmid 25211280
** Reprint #25211280 already downloaded and in folder; skipping.
Trying to fetch pmid 26633170
** Reprint #26633170 already downloaded and in folder; skipping.
Trying to fetch pmid 23682673
** Reprint #23682673 already downloaded and in folder; skipping.
Trying to fetch pmid 25040501
** Reprint #25040501 already downloaded and in folder; skipping.
Trying to fetch pmid 24628937
** Reprint #24628937 already downloaded and in folder; skipping.
Trying to fetch pmid 27174497
** Reprint #27174497 already downloaded and in folder; skipping.
Trying to fetch pmid 22610656
** Reprint #22610656 already downloaded and in folder; skipping.
Trying to fetch pmid 23858657
Traceback (most recent call last):
File "fetch_pdfs.py", line 252, in
if len(e) >=3 and '104' in e[0][1][0]:
TypeError: object of type 'ConnectionError' has no len()
(base) root@8914b8bb01b3:/data# python fetch_pdfs.py -pmf PMID_all.txt -out PDF -maxRetries 3
Trying to fetch pmid 25211280
** Reprint #25211280 already downloaded and in folder; skipping.
Trying to fetch pmid 26633170
** Reprint #26633170 already downloaded and in folder; skipping.
Trying to fetch pmid 23682673
** Reprint #23682673 already downloaded and in folder; skipping.
Trying to fetch pmid 25040501
** Reprint #25040501 already downloaded and in folder; skipping.
Trying to fetch pmid 24628937
** Reprint #24628937 already downloaded and in folder; skipping.
Trying to fetch pmid 27174497
** Reprint #27174497 already downloaded and in folder; skipping.
Trying to fetch pmid 22610656
** Reprint #22610656 already downloaded and in folder; skipping.
Trying to fetch pmid 23858657
** Reprint 23858657 cannot be fetched as pubmed does not have a link to its pdf.
Trying to fetch pmid 27547345
** Reprint #27547345 already downloaded and in folder; skipping.
Trying to fetch pmid 26482654
Traceback (most recent call last):
File "fetch_pdfs.py", line 252, in
if len(e) >=3 and '104' in e[0][1][0]:
TypeError: object of type 'ConnectionError' has no len()
(base) root@8914b8bb01b3:/data# python fetch_pdfs.py -pmf PMID_all.txt -out PDF -maxRetries 3
Trying to fetch pmid 25211280
** Reprint #25211280 already downloaded and in folder; skipping.
Trying to fetch pmid 26633170
** Reprint #26633170 already downloaded and in folder; skipping.
Trying to fetch pmid 23682673
** Reprint #23682673 already downloaded and in folder; skipping.
Trying to fetch pmid 25040501
** Reprint #25040501 already downloaded and in folder; skipping.
Trying to fetch pmid 24628937
** Reprint #24628937 already downloaded and in folder; skipping.
Trying to fetch pmid 27174497
** Reprint #27174497 already downloaded and in folder; skipping.
Trying to fetch pmid 22610656
** Reprint #22610656 already downloaded and in folder; skipping.
Trying to fetch pmid 23858657
Traceback (most recent call last):
File "fetch_pdfs.py", line 252, in
if len(e) >=3 and '104' in e[0][1][0]:
TypeError: object of type 'ConnectionError' has no len()
(base) root@8914b8bb01b3:/data# python fetch_pdfs.py -pmf PMID_all.txt -out PDF -maxRetries 3
Trying to fetch pmid 25211280
** Reprint #25211280 already downloaded and in folder; skipping.
Trying to fetch pmid 26633170
** Reprint #26633170 already downloaded and in folder; skipping.
Trying to fetch pmid 23682673
** Reprint #23682673 already downloaded and in folder; skipping.
Trying to fetch pmid 25040501
** Reprint #25040501 already downloaded and in folder; skipping.
Trying to fetch pmid 24628937
** Reprint #24628937 already downloaded and in folder; skipping.
Trying to fetch pmid 27174497
** Reprint #27174497 already downloaded and in folder; skipping.
Trying to fetch pmid 22610656
** Reprint #22610656 already downloaded and in folder; skipping.
Trying to fetch pmid 27547345
** Reprint #27547345 already downloaded and in folder; skipping.
Trying to fetch pmid 26991916
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 26991916 succeeded
Trying to fetch pmid 26742956
Traceback (most recent call last):
File "fetch_pdfs.py", line 252, in
if len(e) >=3 and '104' in e[0][1][0]:
TypeError: object of type 'ConnectionError' has no len()
from pubmed-batch-download.
After removing some PMIDs, this one generated a different error and went into a loop that I had to interrupt via keyboard.
Trying to fetch pmid 26655157
Trying genericCitationLabelled
Trying pubmed_central
Trying science_direct
** fetching of reprint 26655157 failed from error Invalid URL '': No schema supplied. Perhaps you meant http://?
Trying genericCitationLabelled
Trying pubmed_central
Trying science_direct
** fetching of reprint 26655157 failed from error Invalid URL '': No schema supplied. Perhaps you meant http://?
Trying genericCitationLabelled
Trying pubmed_central
Trying science_direct
** fetching of reprint 26655157 failed from error Invalid URL '': No schema supplied. Perhaps you meant http://?
Trying genericCitationLabelled
Trying pubmed_central
Trying science_direct
** fetching of reprint 26655157 failed from error Invalid URL '': No schema supplied. Perhaps you meant http://?
Trying genericCitationLabelled
Trying pubmed_central
Trying science_direct
** fetching of reprint 26655157 failed from error Invalid URL '': No schema supplied. Perhaps you meant http://?
Trying genericCitationLabelled
Trying pubmed_central
Trying science_direct
** fetching of reprint 26655157 failed from error Invalid URL '': No schema supplied. Perhaps you meant http://?
Trying genericCitationLabelled
Trying pubmed_central
Trying science_direct
** fetching of reprint 26655157 failed from error Invalid URL '': No schema supplied. Perhaps you meant http://?
Trying genericCitationLabelled
Trying pubmed_central
Trying science_direct
** fetching of reprint 26655157 failed from error Invalid URL '': No schema supplied. Perhaps you meant http://?
Trying genericCitationLabelled
Trying pubmed_central
Trying science_direct
** fetching of reprint 26655157 failed from error Invalid URL '': No schema supplied. Perhaps you meant http://?
Trying genericCitationLabelled
Trying pubmed_central
Trying science_direct
** fetching of reprint 26655157 failed from error Invalid URL '': No schema supplied. Perhaps you meant http://?
Trying genericCitationLabelled
Trying pubmed_central
Trying science_direct
** fetching of reprint 26655157 failed from error Invalid URL '': No schema supplied. Perhaps you meant http://?
from pubmed-batch-download.
Apologies; I handled the bug fix incorrectly. I changed it and ran some test cases, which I think should work (though i still can't replicate your error with the int type, so I am not 100% sure).
I fixed the infinite loop, so that shouldn't be an issue any more. Finally, I wrote a new scraper for uChicagoPress, to grab that particular pdf.
from pubmed-batch-download.
Hi Bill,
Sorry for the slow response, I was not able to test the code these days. I did run a short test and it seems that everything is working fine now. Many thanks for this!
I will close this issue now.
from pubmed-batch-download.
Related Issues (20)
- Errors downloading articles HOT 18
- Same error message HOT 2
- use pmf with Ruby version? HOT 2
- failed to fetch HOT 7
- Trouble with Elsevier articles HOT 7
- Error with Physiology Free articles
- fetching error HOT 3
- PMID extraction in bulk! HOT 1
- Download fails: NoneType object has no attribute.. HOT 1
- Error: Invalid URL 'DirectEmailBox-inPage'
- Error:
- "failed from error Invalid URL" HOT 1
- Files are downloaded successfully, but they seem corrupt.
- Invalid URL, no scheme supplied. HOT 2
- License HOT 2
- Update to avoid known mechanize error HOT 2
- index out of range error HOT 7
- Add interface for Zotero translators HOT 8
- Trying to fetch pmid 30374447 ** fetching of reprint 30374447 failed from error ('Connection aborted.', BadStatusLine("''",)) HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pubmed-batch-download.