COVID-19 Data Set

The Starschema COVID-19 data set collates a range of important resources for assessing the impact, severity and response to the COVID-19 pandemic. The data is stored in Snowflake for ease of access. Detailed information about what is in the data set is available on the project's github repository. The METADATA table in the Snowflake Data Exchange share contains detailed column-level information about the tables that comprise the share.

A range of data sets have been published that are useful for monitoring and understanding the spread of COVID-19. Our efforts are intended to collate, curate and unify the most valuable data sources for enterprises, individuals and public health experts to assess the situation and make data-driven decisions. This single-source easily blends with other data sources so you can analyze the movement of the SARS-CoV-2 pandemic over time, in any context.

Covid19 image 4

Currently added data sets include

NameSourceTable name
US COVID-19 testing and mortalityThe COVID Tracking ProjectCT_US_COVID_TESTS
Global data on healthcare providersOpenStreetMap, via Healthsites.ioHS_BULK_DATA
Global case countsJHU CSSEJHU_COVID_19
US healthcare capacity by state, 2018The Henry J. Kaiser Family FoundationKFF_HCP_CAPACITY
US policy actions by stateThe Henry J. Kaiser Family FoundationKFF_US_POLICY_ACTIONS
US actions to mitigate spread, by stateThe Henry J. Kaiser Family FoundationKFF_US_STATE_MITIGATIONS
ICU beds by county, USThe Henry J. Kaiser Family FoundationKFF_US_ICU_BEDS
Italy case statistics, summaryProtezione CivilePCM_DPS_COVID19
Italy case statistics, detailedProtezione CivilePCM_DPS_COVID19_DETAILS
WHO situation reportsWorld Health OrganizationWHO_SITUATION_REPORTS

The COVID-19 data set enables enterprises, individuals and public health authorities to make data-driven decisions. Using the data on local case counts, enterprises can monitor the integrity of their supply chains and anticipate disruptions. Public health authorities can track the spread of COVID-19 and use the data to support public health measures such as school closures and estimate the relative risk of incidence in their region. Presented in an analytics-ready format and diligently maintained, the data set provides a single source of truth to enable decision-making based on the most reliable and accurate data available.

Request access to the free, public Starschema COVID-19 incidence data set on the Snowflake Data Exchange here.

Special thanks to our partners Snowflake, Tableau, Mapbox, Path, and Datablick for collaborating on this project.

This website uses cookies

To provide you with the best possible experience on our website, we may use cookies, as described here. By clicking accept, closing this banner, or continuing to browse our websites, you consent to the use of such cookies.

I agree