Comments (2)
MS review:
Test 1: Scanned 10 publications related to the topic "windpower":
- 7/10 properly downloaded
- 3 are PDF, 4 are non-PDF
- Out of the 3 not successfully downloaded, two come from IEEE LATIN AMERICA TRANSACTIONS, one from INTERNATIONAL JOURNAL OF RENEWABLE ENERGY RESEARCH.
- One document downloaded file has no file name - called pdf.
- CSV looks like the output in R
Test 2: Scanned 10 publications related to the topic "drones conservation:
- 9/10 successfully downloaded
- 1 not successfully downloaded is derived from EUROPEAN RESEARCH CONSORTIUM INFORMATICS & MATHEMATICS. This pdf not easily accessible (not on google scholar)
- All chrome documents
- CSV looks like the output in R
Test 3: Scanned 10 publications related to the topic of "Yellowstone National Park":
- 9/10 successfully downloaded
- 1 not successfully downloaded is derived from JOURNAL OF PARK AND RECREATION ADMINISTRATION.
- None of the downloaded publications are pdfs - all are chrome links, most open a blank web page.
- CSV looks like the output in R
When the input bib file is stored in the "outdir" folder, all previous input files, including the bib file used to perform the download, are overwritten by the output files. Should fix or may consider placing a check to ensure input bib files and R outputs are kept separate.
from bibscan.
If you remove line 117 crminer::crm_cache$delete_all()
the .bib file stays in the folder, however it also leaves a weird unopenable .pdf in the folder. Working on figuring this out
- If you repeat this process (with all the original files from the previous run), it crashes everything and creates 5-6 unopenable pdf files. It also deletes the .bib file.
from bibscan.
Related Issues (19)
- Doesn't filter out selected papers from Colandr
- getting a 403 with http://www.jswconline.org/
- Dependencies not loading HOT 3
- dirname error HOT 3
- Error in "select" function HOT 1
- parsing failure HOT 2
- Other package dependencies HOT 2
- Improve PDF filenames from the publisher's default HOT 2
- Further investigate the discrepancies between downloads
- issue with Dillon bib file HOT 2
- The installations are really long.
- Low Retrieval Rate HOT 2
- PLOS One article are returned as html
- modularize the article_pdf_download function
- harmonize styling
- add travis
- Add test about parameter passed and units testing
- `//` in the path of downloaded files
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from bibscan.