Giter Site home page Giter Site logo

dhimmel / pubmedpy Goto Github PK

View Code? Open in Web Editor NEW
9.0 2.0 2.0 214 KB

Utilities for interacting with NCBI EUtilities relating to PubMed

Home Page: https://pypi.org/project/pubmedpy/

License: Other

Python 100.00%
pubmed eutilities ncbi bibliometrics python

pubmedpy's Introduction

Utilities for interacting with NCBI EUtilities relating to PubMed

documentation pubmedpy on PyPI

pubmedpy is a Python 3 package for interacting with PubMed's E-Utilities API as well the PubMed Central OAI-PMH Service

Projects

This package was created while working on https://github.com/greenelab/iscb-diversity. However, it draws on code and experience from several pre-existing projects. Parts of the codebase derives from or supported the analyses for the following analyses:

  1. Analysis of ISCB honorees and keynotes reveals disparities
    Trang T. Le, Daniel S. Himmelstein, Ariel A. Hippen Anderson, Matthew R. Gazzara, Casey S. Greene
    Manubot (2020-01-30) https://greenelab.github.io/iscb-diversity-manuscript/

  2. On author versus numeric citation styles
    Daniel Himmelstein
    Satoshi Village (2018-03-12) https://blog.dhimmel.com/citation-styles/

  3. The history of publishing delays
    Daniel Himmelstein
    Satoshi Village (2016-02-10) https://blog.dhimmel.com/history-of-delays/

  4. Publication delays at PLOS and 3,475 other journals
    Daniel Himmelstein
    Satoshi Village (2015-06-29) https://blog.dhimmel.com/plos-and-publishing-delays/

pubmedpy's People

Contributors

dhimmel avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

pubmedpy's Issues

Empty affiliation strings from PMCID

I'm trying to extract affiliation information from PMCID. For example, for PMC6986235, I tried the following:

from lxml.etree import tostring

art = get_frontmatter_etree_via_api('PMC6986235')
print(tostring(art, encoding = 'unicode'))

Part of the output contains the affiliation of the corresponding author:

<aff id="A1">Georgetown University, Department of Oncology and Lombardi
Comprehensive Cancer Center, Washington, DC, 20007.</aff>

However, when I tried

extract_authors_from_article(art)

all affiliations is empty:

[{'pmcid': 'PMC6986235',
  'position': 1,
  'fore_name': 'Ziling',
  'last_name': 'Fan',
  'corresponding': 0,
  'reverse_position': 3,
  'affiliations': []},
 {'pmcid': 'PMC6986235',
  'position': 2,
  'fore_name': 'Yuan',
  'last_name': 'Zhou',
  'corresponding': 0,
  'reverse_position': 2,
  'affiliations': []},
 {'pmcid': 'PMC6986235',
  'position': 3,
  'fore_name': 'Habtom W.',
  'last_name': 'Ressom',
  'corresponding': 0,
  'reverse_position': 1,
  'affiliations': []}]

It is possible that we can't extract this information because of the way journals deposited the metadata. I just wanted to make sure that there is not a better alternative than skipping these articles entirely.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.