Entering edit mode
10.3 years ago
bwio
▴
30
I have a list of PMIDs and I want to write a simple scraper script that finds the URL of the corresponding pdf paper and downloads it. As far as I can see a direct url to the pdf file is not exposed in the pubmed api.
There is this service called thepaperlink, which does exactly what I want, but I don't want do build my scripts around a third party database. they claim that their service is build around eutils, but i don't see this is possible.
Sorry, I didn't found any duplicates before. It looks like you can follor the DOI to the publishers website, but you have to scrape the link to the pdf via regrex/heuristics:
https://github.com/elfar/PubmedPDF/blob/master/pdfetch.rb