Content
Data
Open Access Data
Open access and datasets from U.S. government agencies are available for download. To locate data:
- Start with the Repositories for Open Data and Data.gov
- Determine the agency that works with subject specific data and use the Data tab on the site or the search box on the agency main page to locate data sets. Also try a Google search for specific sets. (Search USA.gov by research topic to determine the agency that is working with specific data)
- Review the Data Catalog sites below for additional data
- See the Subject tabs on this site
- Review agency repositories for federally funded research
- See the Google Dataset Search
- *Review the Data Rescue Project site if data is not available. See Current Efforts and Resources
- See the Open Data Handbook for a definition of open data
- Data.gov
Portal for 200,000 open-access federal government data sets, some state and local. Multiple filter options, range of formats for download.
- Science.gov
Gateway to U.S. federal science. Search across 60 databases and over 2,200 scientific websites. Limit search results to the Public Access category which enables searching of peer-reviewed scholarly publications (journal articles and accepted manuscripts) resulting from federally funded scientific research. There are currently 15 federal repositories supporting public access.
- Agency websites
Identify the agency in your research area using USA.gov. Search the data tab in agency menus or use the search box on the main page for data by topic.
- Google Dataset Search
Data & Website Rescue Efforts
Website Rescue Portals
- End of Term Web Archive
Captures and saves U.S. Government websites at the end of presidential administrations. The EOT has thus far preserved websites from administration changes in 2008, 2012, 2016, and 2020.
- Wayback Machine - by the Internet Archive
Explore more than 928 billion web pages saved over time. Install the Official Wayback Machine Extension to easily save websites, view missing 404 Not Found pages, or read archived books and papers. Internet Archive.
- Archive of Data.gov
Files in this repository were collected intermittently between 2024-11-19 and 2025-02-06. Beginning on 2025-02-06, the repository will be updated daily.
- Data Rescue Project
The Data Rescue Project is a coordinated effort among a group of data organizations, including IASSIST, RDAP, and members of the Data Curation Network. Serves as a clearinghouse for data rescue-related efforts and data access points for public US governmental data that are currently at risk. See what data is currently being saved with their new Data Rescue Tracker. Follow on Data Rescue Project - Bluesky Social.
- DataLumos
DataLumos is an ICPSR archive for valuable government data resources. ICPSR has a long commitment to safekeeping and disseminating US government and other social science data. They have an older version of many major datasets, including some from the CDC. This is the main repository for Data Rescue Project's data with added data from FEMA, the Department of Education, and IMLS.
- Find Lost Data
This search tool provides links for data that have been downloaded from federal, state, and local agencies. Boston University School of Public Health.
- ICPSR
Maintains a data archive of more than 350,000 files of research in the social and behavioral sciences. It hosts 23 specialized collections of data in education, aging, criminal justice, substance abuse, terrorism, and other fields. The Inter-university Consortium for Political and Social Research (ICPSR) is an American political science and social science research consortium, based at the University of Michigan.
- IPUMS
The IPUMS Center for Data Integration, based at the University of Minnesota, provides census and survey data from around the world. Find microdata from the U.S. Decennial Census, the American Community Survey (ACS), and the Current Population Survey. As part of their standard procedures, they download and preserve original data from U.S. statistical agencies that serve as the source data for IPUMS. Since Fri., Jan. 31, 2025, several organizations (and individuals) have downloaded many other public federal datasets. There are efforts underway to catalog and make these data available.
- GovTrack
Tracks the activities of the United States Congress.
- Archive-it: Federal Depository Library Program Web Archive
The Federal Depository Library Program (FDLP) Web Archive. Harvested by Library Services and Content Management (LSCM) and U.S. Government Publishing Office (GPO). The FDLP was established by Congress in 1895 to provide free, permanent public access to federal government information. GPO administers the FDLP on behalf of the participating libraries and the American people. To provide permanent public access to federal agency web content, the FDLP harvests selected U.S. Government web sites in their entirety. Access to these sites is made available through links in the online public access catalog, the Catalog of U.S. Government Publications.
- Climate & Economic Justice Screening Tool
The tool has an interactive map and uses datasets that are indicators of burdens communities experience in eight categories: climate change, energy, health, housing, legacy pollution, transportation, water and wastewater, and workforce development.
- Dryad
An open data publishing platform and a community committed to the open availability and routine re-use of all research data. Generalist repository available to help with data publication, storage, and preservation.
- Roper Center
Has collected over 50,000 files (datasets and documentation) from 22 federal survey projects. Efforts to this point have been focused on acquiring the files and ensuring backup copies are preserved on multiple servers. Cornell University.
- Data Liberation Project
An initiative to identify, obtain, reformat, clean, document, publish, and disseminate government datasets of public interest.
- Big Local News
Gathers data, builds tools and collaborates with reporters to provide access to public records. Some projects have been archived by the Stanford Digital Repository, which ensures their long-term preservation.
- UChicago Data Mirror
This platform provides convenient access to public datasets that are frequently used in academic research and education at the University of Chicago. University of Chicago Library and Data Science Institute.
- MuckRock
Provides a repository of hundreds of thousands of pages of original government materials, information on how to file FOIA requests and tools to make the requesting process easier. Fill out their form to suggest FOIAs based on missing data sets.
- r/Data Hoarder
A reddit community that is coordinating efforts to rescue data.
- Safeguarding Research
Based in the EU, USA, and globally - this initiative has access to 1-2 PB (and more on the way) of storage & people willing to seed. Have several large-scale efforts, including a 350GB web archive of CDC, including all 30.000 files from archive.cdc.gov and much more. There is a forum you can join.
- Archive Team
An offloading point and information depot for a number of archiving projects, all related to saving websites or data that is in danger of being lost, including archiving the federal government. They maintain US Government data page.
- Data Hoarding
An index of resources and archives related to data hoarding, web archival and self hosting. It was inspired by the recent purge of online information by government agencies, corporations and others, and aims to provide easier access to tools and information.
- IMLS
Direct link to Institute of Museum and Library Services (IMLS) archived data in DataLumos.
- Digital Government Hub
A dynamic, open-source reference library for anyone using design, data, and technology to improve and enhance government service delivery.
- Webrecorder US Government Web Archive
Has archived a large number of government sites both independently and as one of the End of Term Archive partners. Webrecorder US Government Web Archive is web archiving with Browsertrix.
- Zenodo
An open dissemination research data repository for the preservation and making available of research, educational and informational content. Access to Zenodo’s content is open to all, for non-military purposes only. CERN.
Education
- Education Resources Information Center (ERIC) Archive
An internet-based digital library of education research and information sponsored by the Institute of Education Sciences (IES) of the U.S. Department of Education. ERIC provides access to bibliographic records of journal and non-journal literature from 1966 to the present. Internet Archive.
- ERICA
A rescue catalog which preserves over 500,000 publications originally hosted by the U.S. Department of Education in the ERIC research repository. ERIC was an open access database of more than 2 million documents dating back to the 1960s. It ceased operating April 23, 2025. Education Resources Information Center Archive (ERICA).
- Journals No Longer Being Indexed By ERIC
Crowdsourcing effort to figure out which journals are being removed from ERIC. From Michigan State University.
- Department of Education
Direct link to Department of Education archived data in DataLumos.
- Integrated Postsecondary Education Data System (IPEDS) Series
Complete data files from 1980 to 2023. Includes data file, STATA data file, SPSS program, SAS program, STATA program, and dictionary. The IPEDS surveys most postsecondary institutions annually, including universities and colleges, as well as institutions offering technical and vocational education beyond the high school level. Allows users to compare the characteristics of different postsecondary institutions. Older IPEDS data are also available through the central ICPSR archive. Inter-university Consortium for Political and Social Research (ICPSR).
Climate & Environment
- Climate & Economic Justice Screening Tool
The tool has an interactive map and uses datasets that are indicators of burdens communities experience in eight categories: climate change, energy, health, housing, legacy pollution, transportation, water and wastewater, and workforce development.
- EPA's EJScreen 2.3
An unofficial copy of EJScreen hosted by the Public Environmental Data Partners.
- Public Environmental Data Project
Committed to preserving and providing public access to federal environmental data. They have identified 57 high-priority databases, of which 37 have been archived thus far [February 2025].
- Dataverse
Data uploaded by the Climate Change and Health Research Coordinating Center (CAFE)
Includes CDC's Social Vulnerability Index data.
Most of what's being placed here is data focusing on health and the environment. Harvard College.
- DataRefuge DataVerse
DataRefuge is also an initiative committed to identifying, assessing, prioritizing, securing, and distributing reliable copies of federal climate and environmental data so that it remains available to researchers. Harvard College.
- EDGI
A research collaborative and network of diverse professionals promoting evidence-based policy-making and public interest science that advances the Environmental Right to Know (ERTK). EDGI's Federal Environmental Web Tracker. Environmental Data & Governance Initiative (EDGI).
- The Climate Mirror Project
Trying to mirror and safely archive U.S. Govt. websites and datasets related to climate, climate change, and global warming.
- Open Energy Data Initiative (OEDI)
A centralized repository of datasets aggregated from the U.S. Department of Energy’s Programs, Offices, and National Laboratories.
- PublicData - UC Santa Barbara Letters & Science IT
Mirrored and archived public data on locally hosted git server. Includes retrieved data sets from CDC, DoE, NIH, and NOAA.
- EPA Risk Management Program Database
A recently updated version of the EPA’s Risk Management Program, with submissions through December 2024. It includes risk management plans filed by facilities with extremely hazardous substances. Data Liberation Project (DLP).
- Healthy Regions & Policies (HeRoP) Lab - U of Illinois Urbana-Champaign
Preserved datasets and guidances include: The Center for Disease Control (CDC); The Environmental Protection Agency (EPA); The Health Resources and Services Administration (HRSA). Available via Box.
- NOAA Heat-Index Files
Internet Archive.
- PANGAEA
An Open Access library aimed at archiving, publishing and distributing georeferenced data from earth system research. It is actively archiving NOAA data. PANGAEA guarantees long-term availability (greater than 10 years) of its content. PANGAEA is open to any project, institution, or individual scientist to use or to archive and publish data. PANGAEA is hosted by the Alfred Wegener Institute, Helmholtz Center for Polar and Marine Research (AWI) and the Center for Marine Environmental Sciences, University of Bremen (MARUM).
- Climate Change and Human Health Literature Portal (CCHHL) data dump
Here is gathered bibliographic information about 22,695 research items (journal articles etc.) as compiled by the National Institute of Environmental Health Sciences (NIEHS); part of the US federal government's National Institutes of Health (NIH) for its Climate Change and Human Health Literature Portal (CCHHL). Internet Archive.
- FEMA files
Direct link to Federal Emergency Management Agency (FEMA) archived data in DataLumos.
- Find Lost Data
This search tool provides links for data that have been downloaded from federal, state, and local agencies. Boston University School of Public Health.
- GeoCrawl
A community-led effort to crawl and archive map/GIS data. They have crawled through hundreds of known GIS servers and are seeking GIS applications that need archiving.
- Climate Program Portal
Tracks climate investments from the Infrastructure Investment and Jobs Act (IIJA) and the Inflation Reduction Act (IRA). Includes many of the latest publicly available datasets in the climate space, including emissions data, disadvantaged community designations, public investments, and climatic event information.
Public Health
- CDC Datasets on Internet Archive
CDC datasets uploaded before January 28th, 2025.
- RestoredCDC.org
"We are developing code to pull CDC pages which were archived by prior to January 20, 2025. Similar archives have been created by the End of Term (https://eotarchive.org) project and are hosted by the Wayback Machine (https://web.archive.org). The individual pages are archived, but links between them are broken and the pages are not easy to locate through web searches. Therefore, we will re-build the links between the pages, to create a site that can be navigated the same way the pre-January 21, 2025 CDC site. The only changes we will make on these pages is to add a header that indicates that this site is not a CDC website. Because of the complex navigation between pages, we will also include a button to report problems in this header. Our goal is to provide a mirror site that provides the same information and user experience as the previous CDC website. Some functionality, such as videos, was not archived and therefore will not work on our site."
- STAT News
Maintains an ongoing blog post that monitors and documents the changes in CDC data. STAT also has begun an effort to download and archive all available files from data.cdc.gov.
- Dataverse
Data uploaded by the Climate Change and Health Research Coordinating Center (CAFE)
Includes CDC's Social Vulnerability Index data. Most of what's being placed here is data focusing on health and the environment. Harvard College.
- SAMSHA
Direct link to Substance Abuse and Mental Health Administration (SAMSHA) archived data in DataLumos.
- Healthy Regions & Policies (HeRoP) Lab - U of Illinois Urbana-Champaign
Preserved datasets and guidances include: The Center for Disease Control (CDC); The Environmental Protection Agency (EPA); The Health Resources and Services Administration (HRSA). Available via Box.
- PublicData - UC Santa Barbara Letters & Science IT
Mirrored and archived public data on locally hosted git server. Includes retrieved data sets from CDC, DoE, NIH, and NOAA.
- COVID.gov
An archived version as of 4/11/25. Archived by Webrecorder who collaborated with Safeguarding Research and Culture (SRC).
- ACASignups.net
Links to archived versions of every CDC government page (Parts 1 through 15).
- Find Lost Data
This search tool provides links for data that have been downloaded from federal, state, and local agencies. Boston University School of Public Health.
- Climate Change and Human Health Literature Portal (CCHHL) data dump
Here is gathered bibliographic information about 22,695 research items (journal articles etc.) as compiled by the National Institute of Environmental Health Sciences (NIEHS); part of the US federal government's National Institutes of Health (NIH) for its Climate Change and Human Health Literature Portal (CCHHL). Internet Archive.
- American College of Obstetricians and Gynecologists (ACOG)
Hosting copies of immunization schedules and contraceptive use guidance from the CDC.
- Reproductive Rights Archive
Archived content from the U.S. Department of Justice website.
- National Center for Biotechnology Information (NCBI)
The 1000 Genomes Project is mirrored in The International Genome Sample Resource (IGSR). European Molecular Biology Laboratory-European Bioinformatics Institute (EMBL-EBI). Provides open access.
- Resources and Links
Various individuals and organizations have worked to archive / save data from the NIH, CDC, and other websites. This page lists many of those entities.
USAID
- DHS Spatial Data Repository
Data at the country and country sub-division levels that are part of USAID's Demographic Health Survey (DHS). This collection includes geographically-linked health and demographic data from the DHS Program and the U.S. Census Bureau for mapping in a geographic information system (GIS). The data includes indicators related to: fertility, family planning, maternal and child health, gender, HIV/AIDS, literacy, malaria, nutrition, and sanitation. DataLumos.
- USAID Documents Mirror
3,000+ (possibly all) documents from the United States Agency for International Development (USAID). Internet Archive.
- US Foreign Assistance
State Department and USAID data retrieved from ForeignAssistance.gov. ICPSR.
- DHS Indicator Data
Summary data for countries, country subdivisions, and demographic categories that were generated from USAID's Demographic Health Survey (DHS). The indicators are population-level estimates that were generated from sample surveys that were conducted in over 90 low and middle income countries at various points over several decades. Almost 2,000 indicators capture information related to: fertility, family planning, maternal and child health, gender, HIV/AIDS, literacy, malaria, nutrition, and sanitation. DataLumos.
- DHS API data rescue
Data were retrieved from the DHS Program indicator data API for years 1985-2023. openICPSR.
Statistics
Popular Statistical Resources
- Data.census.gov
The Census Bureau is the leading source of quality data about the nation's people and economy. Browse, search, and map data from many Census Bureau sources.
- Data.gov
Data.gov increases the ability of the public to easily find, download, and use datasets that are generated and held by the Federal Government.
- Data.OK.gov
Data.OK.gov strives to make Oklahoma government more transparent through an unprecedented level of openness in Oklahoma government. By publishing raw datasets in different formats, you can look up statistics, build applications, conduct analysis and perform research.
- National Agricultural Statistics Service
The USDA's National Agricultural Statistics Service (NASS) conducts hundreds of surveys every year and prepares reports covering virtually every aspect of U.S. agriculture.
- Statistical Abstract of the United States
The Statistical Abstract of the U.S., published by the U.S. Census Bureau from 1878 to 2011, was the authoritative and comprehensive summary of statistics on the social, political, and economic organization of the United States. It is now published by ProQuest, and current and historical volumes are held by the library.
- CIA World Factbook
The World Factbook is a great tool for international demographics, national profiles, time zone maps, Flags of the World, country comparisons, and more.
- Social Security Research, Statistics & Policy Analysis
Detailed statistics on the number of Americans receiving Society Security benefits.
- U.S. and World Population Clock
Find current population projections using the U.S. and World Population Clock.
More Statistics by Topic Area
Business/Economy
- Bureau of Economic Analysis (BEA)
BEA strives to provide the most timely, relevant and accurate economic data to help promote a better understanding of the U.S. economy.
- Bureau of Labor Statistics (BLS)
The BLS is the principal Federal agency responsible for measuring labor market activity, working conditions and price changes in the economy. Its mission is to collect, analyze, and disseminate essential economic information to support public and private decision making.
- Economic Census
The Economic Census provides detailed information on employer businesses, including detailed data by industry, geography, and more.
- E-Stats
The U.S. Census Bureau's Internet site devoted exclusively to "Measuring the Electronic Economy."
- FRED - Federal Reserve Economic Data
FRED offers a wealth of economic data and information, including daily U.S. interest rates, monetary and business indicators, exchange rates, balance of payments regional economic data, and more.
- Statistics of U.S. Businesses
Statistics of U.S. Businesses (SUSB) is an annual series that provides national and subnational data on the distribution of economic data by enterprise size and industry.
Crime
- FBI Crime Statistics
Explore the National Incident-Based Reporting System, the Summary Reporting System, the Law Enforcement Officers Killed and Assaulted Program, and the Hate Crime Statistics Program, as well as special compilations such as Cargo Theft Report, and Human Trafficking.
- National Criminal Justice Reference Center(NCJRS)
NCJRS offers justice and drug-related information to support research, policy and program development worldwide.
- Bureau of Justice Statistics
Statistics relating to courts, corrections, crime, law enforcement and victims.
Education
- National Center for Education Statistics
The National Center for Education Statistics (NCES) is the primary federal entity for collecting and analyzing data related to education.
- Education Fast Facts
Fast Facts provides users with concise information on a range of educational issues, from early childhood to adult learning.
- Oklahoma School Report Cards
Find out how Oklahoma schools measure up.
Energy
- Energy Information Administration (EIA)
EIA provides a wide range of information and data products covering energy production, stocks, demand, imports, exports, and prices; and prepares analyses and special reports on topics of current interest.
- Oklahoma Energy Statistics
Statistics and graphs that show energy consumption trends in the major sectors of the Oklahoma economy.
- United Nations Energy Statistics
Energy statistics for the period 1950-2009 for more than 190 countries.
Health
- National Center for Health Statistics (NCHS)
NCHS provides statistical information that will guide actions and policies to improve the health of the American people.
- FastStats: Health Statistics A-Z
FastStats provides quick access to data from the National Center for Health Statistics. Topics include diseases and conditions,injuries, life stages and populations, health care and insurance, births, and deaths.
- Oklahoma Center for Health Statistics
State-level health data and statistics.
- National Center for Health Statistics
The mission of the National Center for Health Statistics (NCHS) is to provide statistical information that will guide actions and policies to improve the health of the American people.
- Stats of the States
Individual state data and rankings for key health indicators, causes of death, and birth related subjects.
- Data from Substance Abuse and Mental Health Services Administration (SAMHSA)
Relevant data for mental health and substance abuse.
- Health Insurance Statistics
Statistics on health insurance coverage from the Census Bureau.
- World Health Organization Global Health Observatory
Health-related data for the WHO's member states.
Transportation
- Bureau of Transportation Statistics
The BTS mission is to create, manage, and share transportation statistical knowledge with public and private transportation communities and the Nation.
- National Highway Traffic Safety Administration: Fatality and Injury Reporting System Tool
Census
Community Census Quick Facts
About QuickFacts: QuickFacts tables are summary profiles showing frequently requested data items from various Census Bureau programs. QuickFacts includes incorporated places with 5,000 or more inhabitants. (Currently, Metro Areas and Zip Codes are not included as QuickFact geographies.)