https://collections.nlm.nih.gov/

Query Result Downloaded
tuskegee 763 132
racism 540

Is it reasonable to continue here, or do we already have a complete dump somewhere?

What would be the best way of going about downloading these? Is there a machine-parseable index?

There is a query service at https://collections.nlm.nih.gov/web_service . The script by @Micr0Byte at academia-preserver/nlm_nih_downloader at main - antifascistDH/academia-preserver - Codeberg.org takes care of doing and parsing the query, but we don’t pull all the possible file formats yet. And I’m sure there is an easier way to do this, than my hacky way of grabbing the redirect url of a document.

1 Like