pmparser

pmparser provides a simple interface for extracting various elements from the publicly available PubMed/MEDLINE XML files, incorporating PubMed’s regular updates, and combining the data with the NIH Open Citation Collection.

Papers

pmparser and PMDB: resources for large-scale, open studies of the biomedical literature, Schoenbachler and Hughey, bioRxiv