https://archive.cdc.gov/

Everything with “alzheimers”

Will start a run now. That’s a short one.

1 Like

We also need:

  • “transgender” (230)
  • “intersex” (59)
  • “non-binary” (29)
  • “LGTBT” (128)

I grabbed all mentioned keywords.

1 Like

Is this obsolete and can be archived? (in relation to @schoeneh 's chat message from yesterday)

yes - will move the thread to Already Safeguarded :white_check_mark:

Here is a mirror of the datasets from the CDC’s data catalogue: data.cdc.gov

For convenience, an archive, please download a copy: https://river.styx.org/safeguarding/data.cdc.gov.tar.gz

Metadata is thin, but archive.org has that so in principle it can be reconstructed.

2 Likes

This is not all of the CDC datasets, just the ones served from their generic endpoint hooked up behind the data catalogue.

1 Like

Someone appears to have put a more complete data archive here: CDC datasets uploaded before January 28th, 2025 : Centers for Disease Control and Prevention : Free Download, Borrow, and Streaming : Internet Archive

I have it mirrored here: 20250128-cdc-datasets and am helping seed the torrent.

4 Likes

I just popped in to share that link. The isolated datasets are a gem.

1 Like