The SGIM Dataset Compendium is a curated reference for investigators conducting research with existing datasets, with an emphasis on health services research, clinical epidemiology, and medical education. A project of the SGIM Research Committee, the compendium has been maintained and updated by volunteer dataset experts since 2008.

What's included

  • Public datasets with detailed descriptions and expert-contributed notes on strengths, limitations, and practical considerations
  • Guidance on working with secondary data
  • Links to related repositories and resources

A note on using this resource: Entries are written by investigators with direct experience using each dataset. Users are encouraged to browse across sections rather than searching for a single dataset, as related resources often offer complementary strengths.

✉️ Questions? Contact info@sgim.org

Search & Filters

Search for

Topic