Giter Site home page Giter Site logo

Comments (10)

billgreenwald avatar billgreenwald commented on July 20, 2024

From the log, do you know which pmid was running when the error happened?

from pubmed-batch-download.

nicolaycunha avatar nicolaycunha commented on July 20, 2024

below is the log output

*Trying to fetch pmid 26633170
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 26633170 succeeded
Trying to fetch pmid 23682673
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 23682673 succeeded
Trying to fetch pmid 25040501
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 25040501 succeeded
Trying to fetch pmid 24628937
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 24628937 succeeded
Trying to fetch pmid 27174497
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 27174497 succeeded
Trying to fetch pmid 27547345
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 27547345 succeeded
Trying to fetch pmid 22610656
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 22610656 succeeded
Trying to fetch pmid 23858657
** Reprint 23858657 cannot be fetched as pubmed does not have a link to its pdf.
Trying to fetch pmid 24998529
** Reprint 24998529 cannot be fetched as pubmed does not have a link to its pdf.
Trying to fetch pmid 27859194
** Reprint 27859194 cannot be fetched as pubmed does not have a link to its pdf.
Trying to fetch pmid 26991916
** Reprint 26991916 cannot be fetched as pubmed does not have a link to its pdf.

Here is the PMID

Trying to fetch pmid 26742956
Traceback (most recent call last):
File "fetch_pdfs.py", line 252, in
if type(e)==requests.ConnectionError and '104' in e[0][1][0]:
TypeError: argument of type 'int' is not iterable
*

from pubmed-batch-download.

billgreenwald avatar billgreenwald commented on July 20, 2024

The error handles fine for me, but I added better handling just in case. Can you give it a try? Also, what version of python and requests are you running?

Side note: I could fetch 24998529 and 26991916...not sure why yours is giving you that message. If you are familiar with python enough to add a print statement or two in specific places, let me know and we can debug that on your end.

from pubmed-batch-download.

billgreenwald avatar billgreenwald commented on July 20, 2024

If the new code doesnt work, I just added a .yml with that can be used with anaconda to create an environment with the correct versions and packages needed to run the program.

from pubmed-batch-download.

nicolaycunha avatar nicolaycunha commented on July 20, 2024

Hi Bill,

I tried the new code and the .yml file with anaconda (but used it in a docker environment), however both error types persist, for instance the PMIDs 23682673, 22610656, 24628937, 25040501 came damaged. Please see below the output.


(base) root@8914b8bb01b3:/data# python fetch_pdfs.py -pmf PMID_all.txt -out PDF -maxRetries 3 
Output directory of agora_vai did not exist.  Created the directory.
Trying to fetch pmid 25211280
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 25211280 succeeded
Trying to fetch pmid 26633170
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 26633170 succeeded
Trying to fetch pmid 23682673
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 23682673 succeeded
Trying to fetch pmid 25040501
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 25040501 succeeded
Trying to fetch pmid 24628937
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 24628937 succeeded
Trying to fetch pmid 27174497
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 27174497 succeeded
Trying to fetch pmid 27859194
** fetching of reprint 27859194 failed from error ('Connection aborted.', BadStatusLine("''",))
Trying to fetch pmid 22610656
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 22610656 succeeded
Trying to fetch pmid 23858657
 ** Reprint 23858657 cannot be fetched as pubmed does not have a link to its pdf.
Trying to fetch pmid 27547345
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 27547345 succeeded
Trying to fetch pmid 24998529
Trying genericCitationLabelled
Trying pubmed_central
Trying science_direct
** fetching reprint using the 'science_direct' finder...
** fetching of reprint 24998529 succeeded
Trying to fetch pmid 26482654
** fetching of reprint 26482654 failed from error ('Connection aborted.', BadStatusLine("''",))
Trying to fetch pmid 26991916
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 26991916 succeeded
Trying to fetch pmid 26742956
Traceback (most recent call last):
  File "fetch_pdfs.py", line 252, in <module>
    if '104' in e[0][1][0]:
IndexError: tuple index out of range
(base) root@8914b8bb01b3:/data# 



from pubmed-batch-download.

billgreenwald avatar billgreenwald commented on July 20, 2024

I just tried to download the pdfs that came damaged, but they came fine for me: are you logged into a system that has access to the journals? I am thinking that trying to download the file while not having access to the journal may be storing a non-pdf file as the output, which looks like a corrupted PDF.

Separately, I tried to add a new quick check to fix the error you are getting on the tuple index out of range. Let me know if that fixed it.

It looks like conda environments aren't platform agnostic, so not being able to install that package should be ok if you install the others, since its just a dependency needed for the others. Perhaps running conda install libgcc could do it?

from pubmed-batch-download.

nicolaycunha avatar nicolaycunha commented on July 20, 2024

Hi Bill,

I changed the network and now the pdfs are coming fine. However, after a few pdfs downloaded, the error appears. I am using a file with the PMIDs, and when error shows up, I remove the problematic PMID, but then the error occurs with a different PMID.

Regarding conda environment, I've tried conda install libgcc, but it could not find the library.

Below the error messages:

(base) root@8914b8bb01b3:/data# python fetch_pdfs.py -pmf PMID_all.txt -out PDF -maxRetries 3
Output directory of PDF did not exist. Created the directory.
Trying to fetch pmid 25211280
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 25211280 succeeded
Trying to fetch pmid 26633170
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 26633170 succeeded
Trying to fetch pmid 23682673
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 23682673 succeeded
Trying to fetch pmid 25040501
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 25040501 succeeded
Trying to fetch pmid 24628937
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 24628937 succeeded
Trying to fetch pmid 27174497
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 27174497 succeeded
Trying to fetch pmid 27859194
** Reprint 27859194 cannot be fetched as pubmed does not have a link to its pdf.
Trying to fetch pmid 22610656
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 22610656 succeeded
Trying to fetch pmid 23858657
** Reprint 23858657 cannot be fetched as pubmed does not have a link to its pdf.
Trying to fetch pmid 27547345
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 27547345 succeeded
Trying to fetch pmid 24998529
**** fetching of reprint 24998529 failed from error HTTPSConnectionPool(host='linkinghub.elsevier.com', port=443): Read timed out. (read timeout=5)
Traceback (most recent call last):
File "fetch_pdfs.py", line 252, in
if len(e) >=3 and '104' in e[0][1][0]:

TypeError: object of type 'ConnectionError' has no len()**
(base) root@8914b8bb01b3:/data# python fetch_pdfs.py -pmf PMID_all.txt -out PDF -maxRetries 3
Trying to fetch pmid 25211280
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 25211280 succeeded
Trying to fetch pmid 26633170
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 26633170 succeeded
Trying to fetch pmid 23682673
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 23682673 succeeded
Trying to fetch pmid 25040501
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 25040501 succeeded
Trying to fetch pmid 24628937
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 24628937 succeeded
Trying to fetch pmid 27174497
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 27174497 succeeded
Trying to fetch pmid 27859194
Traceback (most recent call last):
File "fetch_pdfs.py", line 252, in
if len(e) >=3 and '104' in e[0][1][0]:
TypeError: object of type 'ConnectionError' has no len()

(base) root@8914b8bb01b3:/data# python fetch_pdfs.py -pmf PMID_all.txt -out PDF -maxRetries 3
Trying to fetch pmid 25211280
** Reprint #25211280 already downloaded and in folder; skipping.
Trying to fetch pmid 26633170
** Reprint #26633170 already downloaded and in folder; skipping.
Trying to fetch pmid 23682673
** Reprint #23682673 already downloaded and in folder; skipping.
Trying to fetch pmid 25040501
** Reprint #25040501 already downloaded and in folder; skipping.
Trying to fetch pmid 24628937
** Reprint #24628937 already downloaded and in folder; skipping.
Trying to fetch pmid 27174497
** Reprint #27174497 already downloaded and in folder; skipping.
Trying to fetch pmid 27859194
** Reprint 27859194 cannot be fetched as pubmed does not have a link to its pdf.
Trying to fetch pmid 22610656
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 22610656 succeeded
Trying to fetch pmid 23858657
** Reprint 23858657 cannot be fetched as pubmed does not have a link to its pdf.
Trying to fetch pmid 27547345
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 27547345 succeeded
Trying to fetch pmid 24998529
Trying genericCitationLabelled
Trying pubmed_central
Trying science_direct
** fetching reprint using the 'science_direct' finder...
** fetching of reprint 24998529 succeeded
Trying to fetch pmid 26482654
Traceback (most recent call last):
File "fetch_pdfs.py", line 252, in
if len(e) >=3 and '104' in e[0][1][0]:
TypeError: object of type 'ConnectionError' has no len()

(base) root@8914b8bb01b3:/data# python fetch_pdfs.py -pmf PMID_all.txt -out PDF -maxRetries 3
Trying to fetch pmid 25211280
** Reprint #25211280 already downloaded and in folder; skipping.
Trying to fetch pmid 26633170
** Reprint #26633170 already downloaded and in folder; skipping.
Trying to fetch pmid 23682673
** Reprint #23682673 already downloaded and in folder; skipping.
Trying to fetch pmid 25040501
** Reprint #25040501 already downloaded and in folder; skipping.
Trying to fetch pmid 24628937
** Reprint #24628937 already downloaded and in folder; skipping.
Trying to fetch pmid 27174497
** Reprint #27174497 already downloaded and in folder; skipping.
Trying to fetch pmid 27859194
** Reprint 27859194 cannot be fetched as pubmed does not have a link to its pdf.
Trying to fetch pmid 22610656
** Reprint #22610656 already downloaded and in folder; skipping.
Trying to fetch pmid 23858657
Traceback (most recent call last):
File "fetch_pdfs.py", line 252, in
if len(e) >=3 and '104' in e[0][1][0]:
TypeError: object of type 'ConnectionError' has no len()

(base) root@8914b8bb01b3:/data# python fetch_pdfs.py -pmf PMID_all.txt -out PDF -maxRetries 3
Trying to fetch pmid 25211280
** Reprint #25211280 already downloaded and in folder; skipping.
Trying to fetch pmid 26633170
** Reprint #26633170 already downloaded and in folder; skipping.
Trying to fetch pmid 23682673
** Reprint #23682673 already downloaded and in folder; skipping.
Trying to fetch pmid 25040501
** Reprint #25040501 already downloaded and in folder; skipping.
Trying to fetch pmid 24628937
** Reprint #24628937 already downloaded and in folder; skipping.
Trying to fetch pmid 27174497
** Reprint #27174497 already downloaded and in folder; skipping.
Trying to fetch pmid 27859194
** Reprint 27859194 cannot be fetched as pubmed does not have a link to its pdf.
Trying to fetch pmid 22610656
** Reprint #22610656 already downloaded and in folder; skipping.
Trying to fetch pmid 23858657
Traceback (most recent call last):
File "fetch_pdfs.py", line 252, in
if len(e) >=3 and '104' in e[0][1][0]:
TypeError: object of type 'ConnectionError' has no len()

(base) root@8914b8bb01b3:/data# python fetch_pdfs.py -pmf PMID_all.txt -out PDF -maxRetries 3
Trying to fetch pmid 25211280
** Reprint #25211280 already downloaded and in folder; skipping.
Trying to fetch pmid 26633170
** Reprint #26633170 already downloaded and in folder; skipping.
Trying to fetch pmid 23682673
** Reprint #23682673 already downloaded and in folder; skipping.
Trying to fetch pmid 25040501
** Reprint #25040501 already downloaded and in folder; skipping.
Trying to fetch pmid 24628937
** Reprint #24628937 already downloaded and in folder; skipping.
Trying to fetch pmid 27174497
** Reprint #27174497 already downloaded and in folder; skipping.
Trying to fetch pmid 27859194
Traceback (most recent call last):
File "fetch_pdfs.py", line 252, in
if len(e) >=3 and '104' in e[0][1][0]:
TypeError: object of type 'ConnectionError' has no len()

(base) root@8914b8bb01b3:/data# python fetch_pdfs.py -pmf PMID_all.txt -out PDF -maxRetries 3
Trying to fetch pmid 25211280
** Reprint #25211280 already downloaded and in folder; skipping.
Trying to fetch pmid 26633170
** Reprint #26633170 already downloaded and in folder; skipping.
Trying to fetch pmid 23682673
** Reprint #23682673 already downloaded and in folder; skipping.
Trying to fetch pmid 25040501
** Reprint #25040501 already downloaded and in folder; skipping.
Trying to fetch pmid 24628937
** Reprint #24628937 already downloaded and in folder; skipping.
Trying to fetch pmid 27174497
** Reprint #27174497 already downloaded and in folder; skipping.
Trying to fetch pmid 27859194
** Reprint 27859194 cannot be fetched as pubmed does not have a link to its pdf.
Trying to fetch pmid 22610656
** Reprint #22610656 already downloaded and in folder; skipping.
Trying to fetch pmid 23858657
** Reprint 23858657 cannot be fetched as pubmed does not have a link to its pdf.
Trying to fetch pmid 27547345
** Reprint #27547345 already downloaded and in folder; skipping.
Trying to fetch pmid 24998529
Trying genericCitationLabelled
Trying pubmed_central
Trying science_direct
** fetching reprint using the 'science_direct' finder...
** fetching of reprint 24998529 succeeded
Trying to fetch pmid 26482654
Traceback (most recent call last):
File "fetch_pdfs.py", line 252, in
if len(e) >=3 and '104' in e[0][1][0]:
TypeError: object of type 'ConnectionError' has no len()

(base) root@8914b8bb01b3:/data# python fetch_pdfs.py -pmf PMID_all.txt -out PDF -maxRetries 3
Trying to fetch pmid 25211280
** Reprint #25211280 already downloaded and in folder; skipping.
Trying to fetch pmid 26633170
** Reprint #26633170 already downloaded and in folder; skipping.
Trying to fetch pmid 23682673
** Reprint #23682673 already downloaded and in folder; skipping.
Trying to fetch pmid 25040501
** Reprint #25040501 already downloaded and in folder; skipping.
Trying to fetch pmid 24628937
** Reprint #24628937 already downloaded and in folder; skipping.
Trying to fetch pmid 27174497
** Reprint #27174497 already downloaded and in folder; skipping.
Trying to fetch pmid 27859194
Traceback (most recent call last):
File "fetch_pdfs.py", line 252, in
if len(e) >=3 and '104' in e[0][1][0]:
TypeError: object of type 'ConnectionError' has no len()

(base) root@8914b8bb01b3:/data# python fetch_pdfs.py -pmf PMID_all.txt -out PDF -maxRetries 3
Trying to fetch pmid 25211280
** Reprint #25211280 already downloaded and in folder; skipping.
Trying to fetch pmid 26633170
** Reprint #26633170 already downloaded and in folder; skipping.
Trying to fetch pmid 23682673
** Reprint #23682673 already downloaded and in folder; skipping.
Trying to fetch pmid 25040501
** Reprint #25040501 already downloaded and in folder; skipping.
Trying to fetch pmid 24628937
** Reprint #24628937 already downloaded and in folder; skipping.
Trying to fetch pmid 27174497
** Reprint #27174497 already downloaded and in folder; skipping.
Trying to fetch pmid 22610656
** Reprint #22610656 already downloaded and in folder; skipping.
Trying to fetch pmid 23858657
** Reprint 23858657 cannot be fetched as pubmed does not have a link to its pdf.
Trying to fetch pmid 27547345
** Reprint #27547345 already downloaded and in folder; skipping.
Trying to fetch pmid 26482654
Traceback (most recent call last):
File "fetch_pdfs.py", line 252, in
if len(e) >=3 and '104' in e[0][1][0]:
TypeError: object of type 'ConnectionError' has no len()

(base) root@8914b8bb01b3:/data# python fetch_pdfs.py -pmf PMID_all.txt -out PDF -maxRetries 3
Trying to fetch pmid 25211280
** Reprint #25211280 already downloaded and in folder; skipping.
Trying to fetch pmid 26633170
** Reprint #26633170 already downloaded and in folder; skipping.
Trying to fetch pmid 23682673
** Reprint #23682673 already downloaded and in folder; skipping.
Trying to fetch pmid 25040501
** Reprint #25040501 already downloaded and in folder; skipping.
Trying to fetch pmid 24628937
** Reprint #24628937 already downloaded and in folder; skipping.
Trying to fetch pmid 27174497
** Reprint #27174497 already downloaded and in folder; skipping.
Trying to fetch pmid 22610656
** Reprint #22610656 already downloaded and in folder; skipping.
Trying to fetch pmid 23858657
Traceback (most recent call last):
File "fetch_pdfs.py", line 252, in
if len(e) >=3 and '104' in e[0][1][0]:
TypeError: object of type 'ConnectionError' has no len()

(base) root@8914b8bb01b3:/data# python fetch_pdfs.py -pmf PMID_all.txt -out PDF -maxRetries 3
Trying to fetch pmid 25211280
** Reprint #25211280 already downloaded and in folder; skipping.
Trying to fetch pmid 26633170
** Reprint #26633170 already downloaded and in folder; skipping.
Trying to fetch pmid 23682673
** Reprint #23682673 already downloaded and in folder; skipping.
Trying to fetch pmid 25040501
** Reprint #25040501 already downloaded and in folder; skipping.
Trying to fetch pmid 24628937
** Reprint #24628937 already downloaded and in folder; skipping.
Trying to fetch pmid 27174497
** Reprint #27174497 already downloaded and in folder; skipping.
Trying to fetch pmid 22610656
** Reprint #22610656 already downloaded and in folder; skipping.
Trying to fetch pmid 23858657
** Reprint 23858657 cannot be fetched as pubmed does not have a link to its pdf.
Trying to fetch pmid 27547345
** Reprint #27547345 already downloaded and in folder; skipping.
Trying to fetch pmid 26482654
Traceback (most recent call last):
File "fetch_pdfs.py", line 252, in
if len(e) >=3 and '104' in e[0][1][0]:
TypeError: object of type 'ConnectionError' has no len()

(base) root@8914b8bb01b3:/data# python fetch_pdfs.py -pmf PMID_all.txt -out PDF -maxRetries 3
Trying to fetch pmid 25211280
** Reprint #25211280 already downloaded and in folder; skipping.
Trying to fetch pmid 26633170
** Reprint #26633170 already downloaded and in folder; skipping.
Trying to fetch pmid 23682673
** Reprint #23682673 already downloaded and in folder; skipping.
Trying to fetch pmid 25040501
** Reprint #25040501 already downloaded and in folder; skipping.
Trying to fetch pmid 24628937
** Reprint #24628937 already downloaded and in folder; skipping.
Trying to fetch pmid 27174497
** Reprint #27174497 already downloaded and in folder; skipping.
Trying to fetch pmid 22610656
** Reprint #22610656 already downloaded and in folder; skipping.
Trying to fetch pmid 23858657
Traceback (most recent call last):
File "fetch_pdfs.py", line 252, in
if len(e) >=3 and '104' in e[0][1][0]:

TypeError: object of type 'ConnectionError' has no len()
(base) root@8914b8bb01b3:/data# python fetch_pdfs.py -pmf PMID_all.txt -out PDF -maxRetries 3
Trying to fetch pmid 25211280
** Reprint #25211280 already downloaded and in folder; skipping.
Trying to fetch pmid 26633170
** Reprint #26633170 already downloaded and in folder; skipping.
Trying to fetch pmid 23682673
** Reprint #23682673 already downloaded and in folder; skipping.
Trying to fetch pmid 25040501
** Reprint #25040501 already downloaded and in folder; skipping.
Trying to fetch pmid 24628937
** Reprint #24628937 already downloaded and in folder; skipping.
Trying to fetch pmid 27174497
** Reprint #27174497 already downloaded and in folder; skipping.
Trying to fetch pmid 22610656
** Reprint #22610656 already downloaded and in folder; skipping.
Trying to fetch pmid 27547345
** Reprint #27547345 already downloaded and in folder; skipping.
Trying to fetch pmid 26991916
Trying genericCitationLabelled
** fetching reprint using the 'generic citation labelled' finder...
** fetching of reprint 26991916 succeeded
Trying to fetch pmid 26742956
Traceback (most recent call last):
File "fetch_pdfs.py", line 252, in
if len(e) >=3 and '104' in e[0][1][0]:
TypeError: object of type 'ConnectionError' has no len()

from pubmed-batch-download.

nicolaycunha avatar nicolaycunha commented on July 20, 2024

After removing some PMIDs, this one generated a different error and went into a loop that I had to interrupt via keyboard.

Trying to fetch pmid 26655157
Trying genericCitationLabelled
Trying pubmed_central
Trying science_direct
** fetching of reprint 26655157 failed from error Invalid URL '': No schema supplied. Perhaps you meant http://?
Trying genericCitationLabelled
Trying pubmed_central
Trying science_direct
** fetching of reprint 26655157 failed from error Invalid URL '': No schema supplied. Perhaps you meant http://?
Trying genericCitationLabelled
Trying pubmed_central
Trying science_direct
** fetching of reprint 26655157 failed from error Invalid URL '': No schema supplied. Perhaps you meant http://?
Trying genericCitationLabelled
Trying pubmed_central
Trying science_direct
** fetching of reprint 26655157 failed from error Invalid URL '': No schema supplied. Perhaps you meant http://?
Trying genericCitationLabelled
Trying pubmed_central
Trying science_direct
** fetching of reprint 26655157 failed from error Invalid URL '': No schema supplied. Perhaps you meant http://?
Trying genericCitationLabelled
Trying pubmed_central
Trying science_direct
** fetching of reprint 26655157 failed from error Invalid URL '': No schema supplied. Perhaps you meant http://?
Trying genericCitationLabelled
Trying pubmed_central
Trying science_direct
** fetching of reprint 26655157 failed from error Invalid URL '': No schema supplied. Perhaps you meant http://?
Trying genericCitationLabelled
Trying pubmed_central
Trying science_direct
** fetching of reprint 26655157 failed from error Invalid URL '': No schema supplied. Perhaps you meant http://?
Trying genericCitationLabelled
Trying pubmed_central
Trying science_direct
** fetching of reprint 26655157 failed from error Invalid URL '': No schema supplied. Perhaps you meant http://?
Trying genericCitationLabelled
Trying pubmed_central
Trying science_direct
** fetching of reprint 26655157 failed from error Invalid URL '': No schema supplied. Perhaps you meant http://?
Trying genericCitationLabelled
Trying pubmed_central
Trying science_direct
** fetching of reprint 26655157 failed from error Invalid URL '': No schema supplied. Perhaps you meant http://?

from pubmed-batch-download.

billgreenwald avatar billgreenwald commented on July 20, 2024

Apologies; I handled the bug fix incorrectly. I changed it and ran some test cases, which I think should work (though i still can't replicate your error with the int type, so I am not 100% sure).

I fixed the infinite loop, so that shouldn't be an issue any more. Finally, I wrote a new scraper for uChicagoPress, to grab that particular pdf.

from pubmed-batch-download.

nicolaycunha avatar nicolaycunha commented on July 20, 2024

Hi Bill,
Sorry for the slow response, I was not able to test the code these days. I did run a short test and it seems that everything is working fine now. Many thanks for this!
I will close this issue now.

from pubmed-batch-download.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.