Giter Site home page Giter Site logo

caltechlibrary / pubarchiver Goto Github PK

View Code? Open in Web Editor NEW
3.0 6.0 0.0 27.15 MB

Package up microPublication.org and other journals for archiving into Portico and PMC

License: Other

Python 69.59% Shell 23.23% Makefile 7.18%
portico datacite datacite-metadata archiving archive journal jats jats-xml preservation pmc

pubarchiver's Issues

Add consistency checks

A simple implementation would be to periodically ask the journal API for the list of what's available and compare that to the recorded uploads, then email someone if there's a discrepancy.

"No content" returns from datacite not handled gracefully

If datacite.org returns no content for a given URL, you get an exception:

microarchiver messages.py error(): No content found for https://api.datacite.org/dois/10.17912/micropub.biology.000167
Traceback (most recent call last):
  File "/home/mhucka/system/lib/python3.6/site-packages/microarchiver/__main__.py", line 186, in main
    MainBody(source, after, output_dir, do_zip, report, get_xml, preview, say).run()
  File "/home/mhucka/system/lib/python3.6/site-packages/microarchiver/__main__.py", line 283, in run
    self.write_articles(dest_dir, articles)
  File "/home/mhucka/system/lib/python3.6/site-packages/microarchiver/__main__.py", line 394, in write_articles
    xml = self._metadata_xml(article)
  File "/home/mhucka/system/lib/python3.6/site-packages/microarchiver/__main__.py", line 421, in _metadata_xml
    raise error
microarchiver.exceptions.NoContent: No content found for https://api.datacite.org/dois/10.17912/micropub.biology.000167

This should be handled more gracefully.

Add alternative report formats

Request via email:

Would it be possible to have a different output or output format so that I can find information in the attachment without having to launch another app? Currently, when I click on the report, the columns are too narrow to see the specific article DOI value, so I have to open it in sheets or scroll over to see the link. While it seems minor, I am logging these data in another sheet so I can track the article through to pubmed upload, which means I'm entering things on another sheet --the less mousing I have to do the better.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.