The IPUMS NHGIS Privacy-Protected Demonstration Data link together two versions of 2010 Census summary tables:
- Original tables from the 2010 Census Summary Files
- New tables based on experimental runs of the Census Bureau's 2020 Disclosure Avoidance System (DAS) applied to the original 2010 Census responses
The 2020 DAS represents a major change in the Census Bureau's approach to protecting privacy. The Privacy-Protected Demonstration Data facilitate assessments of the new framework, enabling a broad range of users to investigate and provide feedback about the quality of data produced by different DAS versions, including both preliminary trial versions and the final production system.
- Purpose
- Feedback and Questions
- Technical Details
- Versions
- Vintage 2019-10-29
- Vintage 2020-05-27
- Vintage 2020-09-17
- Vintage 2020-11-16
- Vintage 2021-04-28_12-2
- Vintage 2021-04-28_4-5
- Vintage 2021-06-08 (Production Parameters for 2020 Redistricting Data)
- Vintage 2022-03-16
- Vintage 2022-08-25
- Vintage 2023-04-03
- Block Discrepancies Summary for v2021-04-28 files
- Block Discrepancies Summary for v2021-06-08 files
- Citation and Use
- Credits
Purpose
To protect the confidentiality of 2020 Census respondents, the U.S. Census Bureau is using a framework termed "differential privacy". In October 2019, the Census Bureau released a demonstration data product to help users assess the impact of differential privacy on the utility and accuracy of decennial census data. This product was a differentially private version of the 2010 Decennial Census. Several assessments of the demonstration data were presented at the Workshop on 2020 Data Products (December 11-12, 2019) organized by the Committee on National Statistics. These assessments identified limitations in the differentially private data, particularly for low-population geographic units, for which there are no other sources of complete, reliable population data. Workshop participants urged the Bureau to release additional demonstration data as they work to improve utility by refining the differentially private algorithm.
In June 2020, the Census Bureau announced plans to release a Privacy-Protected Microdata File (PPMF) after each programming sprint for which the Bureau generates a corresponding set of quality metrics. The Bureau has continually modified its differentially private algorithm, and each version of the PPMF reflects new modifications. Data users may use the PPMFs to track changes in accuracy and utility, and the June 2021 PPMF, which is based on the final production system for 2020 Redistricting Data, may be used to model the error distribution in published 2020 data tables.
To make these data more user-friendly, IPUMS NHGIS has created a Privacy-Protected Summary File (PPSF) from each version of the PPMF. Our PPSF consists of tabulations where each row represents a geographic unit and each column represents a summary statistic (e.g., the count of females age 0-4).
To facilitate comparisons, we link comparable data from the PPSF and original 2010 Census Summary File 1. These linked files comprise the IPUMS NHGIS Privacy-Protected 2010 Census Demonstration Data product.
Feedback and Questions
We encourage users to provide feedback about the demonstration data by emailing 2020DAS@census.gov. If you send input, please email a copy to IPUMS at nhgis+diffpriv@umn.edu.
You may also direct any comments or questions about these files to nhgis+diffpriv@umn.edu.
Technical Details
- The data for different geographic summary levels are in separate files
- The data files include standard NHGIS "GISJOIN" identifiers and NHGIS variable codes for both the original and privacy-protected data
- The data are stored in CSV (comma-separated values) files within ZIP archives
- The ZIP archives include human-readable codebooks describing the contents of the data files
- The data files use a "wide" record layout, with each data variable in a separate column
- Data for census blocks are in separate files for each state or state equivalent
Vintage 2019-10-29
Our first set of Privacy-Protected 2010 Census Demonstration Data facilitates comparisons between data from the original 2010 Summary File 1 and from the Census Bureau's 2010 Demonstration Data Products Baseline (2019-10-29). This dataset differs in format and content from later demonstration data releases. We host it on this separate page.
Vintage 2020-05-27
We derive this version of the Privacy-Protected 2010 Census Demonstration Data from the 2020-05-27 vintage of the PPMF. This file was produced by a new version of the Bureau's differentialy private TopDown Algorithm (TDA). Instead of post-processing all noisy measurements at one time, the new version of TDA employs a multipass solution. For this vintage, the first pass processed total population counts and relationship to householder or residence in a type of group quarters. The second pass processed counts required for the PL 94-171 redistricting dataset. The third pass processed counts required for the Population Estimates program, and the final pass processed all remaining counts. In the multipass version of TDA, output from each pass is constrained to the counts from prior passes. For example, if we sum the counts from the 63 race categories in pass two, the sum will equal the total population count generated in pass one.
Due to the particular circumstances of programming Sprint II, on which the 2020-05-27 vintage is based, there is no housing unit Privacy-Protected Microdata File in this release and hence no housing tables. It is expected that there will be housing data in subsequent releases, as there were in the first demonstration data products.
Vintage 2020-05-27: Coverage
- Geographic levels: 18 commonly-used levels, including census blocks
- Tables: 22 tables
- P1. Total Population
- P3. Race [7 categories]
- P4. Hispanic or Latino Origin
- P5. Hispanic or Latino Origin by Race
- P6. Race (Total Races Tallied)
- P7. Hispanic or Latino Origin by Race (Total Races Tallied)
- P8. Race [63 categories]
- P9. Hispanic or Latino, and Not Hispanic or Latino by Race
- P10. Race [63 categories] for the Population 18 years and Over
- P11. Hispanic or Latino, and Not Hispanic or Latino by Race for the Population 18 years and Over
- P12. Sex by Age
- P14. Sex by Age for the Population Under 20 Years
- P42. Group Quarters Population by Group Quarters Type
- P12A. Sex by Age (White Alone)
- P12B. Sex by Age (Black or African American Alone)
- P12C. Sex by Age (American Indian and Alaska Native Alone)
- P12D. Sex by Age (Asian Alone)
- P12E. Sex by Age (Native Hawaiian and Other Pacific Islander Alone)
- P12F. Sex by Age (Some Other Race Alone)
- P12G. Sex by Age (Two or More Races)
- P12H. Sex by Age (Hispanic or Latino Origin)
- P12I. Sex by Age (White Alone, Not Hispanic or Latino)
- We plan to add more tables and possibly more levels in future versions of the files. If you have a specific request, please email it to nhgis+diffpriv@umn.edu.
Vintage 2020-05-27: Parameters
The privacy loss budget assigned to person-level counts in the 2020-05-27 vintage was 4.0, which was allocated to geographic levels and queries as follows:
GEOGRAPHIC LEVEL | Allocation fraction |
---|---|
Nation | 0.2 |
State | 0.2 |
County | 0.12 |
Tract Group | 0.12 |
Tract | 0.12 |
Block Group | 0.12 |
Block | 0.12 |
QUERY | Allocation fraction |
Total population | 0.3 |
Relationship to Householder or Residence in Group Quarters | 0.15 |
Voting Age * Hispanic * Race | 0.29 |
Age * Sex * Hispanic * Race | 0.25 |
Detailed | 0.01 |
Vintage 2020-05-27: Data Files
Vintage 2020-09-17
We derive this version of the Privacy-Protected 2010 Census Demonstration Data from the 2020-09-17 vintage of the PPMF. This file supports the production of summary tables in the P.L. 94-171 Redistricting Data. Recent changes to the operational schedule of the 2020 Decennial Census necessitates a focus on redistricting data, since that is the first release of small-area data. Like the v2020-05-27 PPMF, this file was produced using the multipass post-processing TopDown Algorithm (TDA). The Bureau also modified the TDA to produce more accurate population counts for legal and political units. In particular, the Bureau implemented a new geographic hierarchy for American Indian and Alaska Native tribal areas within states. The total population of all persons living on federally or state-recognized tribal areas within a single state is now invariant.
UPDATE. 2020/11/16: The Census Bureau has released an erratum note related to the 2020-09-17 vintage PPMF.
Vintage 2020-09-17: Coverage
- Geographic levels: 18 commonly-used levels, including census blocks
- Tables: 6 tables
- P8. Race [63 categories]
- P9. Hispanic or Latino, and Not Hispanic or Latino by Race
- P10. Race [63 categories] for the Population 18 years and Over
- P11. Hispanic or Latino, and Not Hispanic or Latino by Race for the Population 18 years and Over
- P42. Group Quarters Population by Group Quarters Type
- H3. Occupancy Status
- We plan to add more tables and possibly more levels in future versions of the files. If you have a specific request, please email it to nhgis+diffpriv@umn.edu.
Vintage 2020-09-17: Parameters
The privacy loss budget assigned to person-level counts in the 2020-09-17 vintage was 4.0, which was allocated to geographic levels and queries as follows:
GEOGRAPHIC LEVEL | Allocation fraction |
---|---|
Nation | 0.2 |
State | 0.16 |
County | 0.16 |
Tract | 0.16 |
Block Group | 0.16 |
Block | 0.16 |
QUERY | Allocation fraction |
Total population | 0.3 |
Number of races | 0.10 |
Race | 0.10 |
Hispanic * Number of races | 0.10 |
Hispanic * Race | 0.025 |
Voting age * Number of races | 0.10 |
Voting age * Race | 0.025 |
Voting age * Number of races * Hispanic | 0.025 |
Voting age * Race * Hispanic | 0.025 |
Institution type | 0.10 |
Group quarters type | 0.075 |
Detailed | 0.025 |
The privacy loss budget assigned to housing unit counts in the 2020-09-17 vintage was 0.5. The geographic level allocation was the same as the person-level allocation, and the allocation to the Occupancy Status query was 1.0 (100%).
Vintage 2020-09-17: Data Files
Vintage 2020-11-16
We derive this version of the Privacy-Protected 2010 Census Demonstration Data from the 2020-11-16 vintage of the PPMF. This file supports the production of summary tables in the P.L. 94-171 Redistricting Data. Recent changes to the operational schedule of the 2020 Decennial Census necessitates a focus on redistricting data, since that is the first release of small-area data. Like the v2020-05-27 PPMF, this file was produced using the multipass post-processing TopDown Algorithm (TDA). The Bureau also modified the TDA to produce more accurate population counts for legal and political units. In particular, the Bureau implemented a new geographic hierarchy for American Indian and Alaska Native tribal areas within states. The total population of all persons living on federally or state-recognized tribal areas within a single state is now invariant. This release corrects an error in the code used to produce the v2020-09-17 PPMF.
Vintage 2020-11-16: Coverage
- Geographic levels: 18 commonly-used levels, including census blocks
- Tables: 6 tables
- P8. Race [63 categories]
- P9. Hispanic or Latino, and Not Hispanic or Latino by Race
- P10. Race [63 categories] for the Population 18 years and Over
- P11. Hispanic or Latino, and Not Hispanic or Latino by Race for the Population 18 years and Over
- P42. Group Quarters Population by Group Quarters Type
- H3. Occupancy Status
- We plan to add more tables and possibly more levels in future versions of the files. If you have a specific request, please email it to nhgis+diffpriv@umn.edu.
Vintage 2020-11-16: Parameters
The privacy loss budget assigned to person-level counts in the 2020-11-16 vintage was 4.0, which was allocated to geographic levels and queries as follows:
GEOGRAPHIC LEVEL | Allocation fraction |
---|---|
Nation | 0.2 |
State | 0.16 |
County | 0.16 |
Tract | 0.16 |
Block Group | 0.16 |
Block | 0.16 |
QUERY | Allocation fraction |
Total population | 0.3 |
Number of races | 0.10 |
Race | 0.10 |
Hispanic * Number of races | 0.10 |
Hispanic * Race | 0.025 |
Voting age * Number of races | 0.10 |
Voting age * Race | 0.025 |
Voting age * Number of races * Hispanic | 0.025 |
Voting age * Race * Hispanic | 0.025 |
Institution type | 0.10 |
Group quarters type | 0.075 |
Detailed | 0.025 |
The privacy loss budget assigned to housing unit counts in the 2020-11-16 vintage was 0.5. The geographic level allocation was the same as the person-level allocation, and the allocation to the Occupancy Status query was 1.0 (100%).
Vintage 2020-11-16: Data Files
Vintage 2021-04-28_12-2
We derive this version of the Privacy-Protected 2010 Census Demonstration Data from the 2021-04-28_12-2 vintage of the PPMF. This file supports the production of summary tables in the P.L. 94-171 Redistricting Data. Recent changes to the operational schedule of the 2020 Decennial Census necessitates a focus on redistricting data, since that is the first release of small-area data. Like prior PPMF releases, this file was produced using the multipass post-processing TopDown Algorithm (TDA).
Vintage 2021-04-28_12-2: Coverage
- Geographic levels: 18 commonly-used levels, including census blocks
- Tables: 6 tables
- P8. Race [63 categories]
- P9. Hispanic or Latino, and Not Hispanic or Latino by Race
- P10. Race [63 categories] for the Population 18 years and Over
- P11. Hispanic or Latino, and Not Hispanic or Latino by Race for the Population 18 years and Over
- P42. Group Quarters Population by Group Quarters Type
- H3. Occupancy Status
- We plan to add more tables and possibly more levels in future versions of the files. If you have a specific request, please email it to nhgis+diffpriv@umn.edu.
Vintage 2021-04-28_12-2: Parameters
The privacy loss budget assigned to person-level counts in the 2021-04-28_12-2 vintage was 10.3. Details about the allocation are forthcoming.
The privacy loss budget assigned to housing unit counts in the 2021-04-28_12-2 vintage was 1.9. Details about the allocation are forthcoming.
Vintage 2021-04-28_12-2: Data Files
Vintage 2021-04-28_4-5
We derive this version of the Privacy-Protected 2010 Census Demonstration Data from the 2021-04-18_4-5 vintage of the PPMF. This file supports the production of summary tables in the P.L. 94-171 Redistricting Data. Recent changes to the operational schedule of the 2020 Decennial Census necessitates a focus on redistricting data, since that is the first release of small-area data. Like prior PPMF releases, this file was produced using the multipass post-processing TopDown Algorithm (TDA). The Bureau also modified the TDA to produce more accurate population counts for legal and political units. In particular, the Bureau implemented a new geographic hierarchy for American Indian and Alaska Native tribal areas within states. The total population of all persons living on federally or state-recognized tribal areas within a single state is now invariant.
Vintage 2021-04-28_4-5: Coverage
- Geographic levels: 18 commonly-used levels, including census blocks
- Tables: 6 tables
- P8. Race [63 categories]
- P9. Hispanic or Latino, and Not Hispanic or Latino by Race
- P10. Race [63 categories] for the Population 18 years and Over
- P11. Hispanic or Latino, and Not Hispanic or Latino by Race for the Population 18 years and Over
- P42. Group Quarters Population by Group Quarters Type
- H3. Occupancy Status
- We plan to add more tables and possibly more levels in future versions of the files. If you have a specific request, please email it to nhgis+diffpriv@umn.edu.
Vintage 2021-04-28_4-5: Parameters
The privacy loss budget assigned to person-level counts in the 2021-04-28_4-5 vintage was 4.0. Details about the allocation are forthcoming.
The privacy loss budget assigned to housing unit counts in the 2021-04-28_4-5 vintage was 0.5. Details about the allocation are forthcoming.
Vintage 2021-04-28_4-5: Data Files
Vintage 2021-06-08
Based on Production Parameters for 2020 Redistricting Data
We derive this version of the Privacy-Protected 2010 Census Demonstration Data from the 2021-06-08 vintage of the PPMF. This file supports the production of summary tables in the P.L. 94-171 Redistricting Data. The disclosure avoidance parameters used for the 2021-06-08 PPMF are the same as those used for the 2020 P.L. 94-171 Redistricting Data. The overall privacy loss budget (epsilon) for this PPMF was 19.61, with 17.14 allocated to the person tables and 2.47 allocated to the housing unit table.
Vintage 2021-06-08: Coverage
- Geographic levels: 18 commonly-used levels, including census blocks
- Tables: 6 tables
- P8. Race [63 categories]
- P9. Hispanic or Latino, and Not Hispanic or Latino by Race
- P10. Race [63 categories] for the Population 18 years and Over
- P11. Hispanic or Latino, and Not Hispanic or Latino by Race for the Population 18 years and Over
- P42. Group Quarters Population by Group Quarters Type
- H3. Occupancy Status
- We plan to add more tables and possibly more levels in future versions of the files. If you have a specific request, please email it to nhgis+diffpriv@umn.edu.
Vintage 2021-06-08: Parameters
The privacy loss budget assigned to person-level counts in the 2021-06-08 vintage was 17.14. Allocation details are available from the Census Bureau.
The privacy loss budget assigned to housing unit counts in the 2021-06-08 vintage was 2.47.
Vintage 2021-06-08: Data Files
Vintage 2022-03-16
We derive this version of Privacy-Protected Demonstration Data from the 2022-03-16 Demonstration Data for the 2020 Census Demographic and Housing Characteristics File. This file was produced by the Bureau's differentially private TopDown Algorithm (TDA) and contains data on topics such as sex, age, race, ethnicity, household and group quarters type, and housing tenure. The initial 2022-03-16 product contained only data for person tables, but we have now added data for housing tables, which the Bureau released on April 14, 2022. These data are comparable with the 2019-10-29 and 2020-05-27 demonstration products to assess the impact of TDA modifications on the output.
We have created three files for most geographic levels. The P files, available for states down to block groups, contain data from a variety of tables. The PCT files, available for states down to census tracts, contain data from PCT12. Sex by Age, which provides counts for males and females by single years of age. The H files, available for states down to block groups, contain data from a variety of tables related to household type, housing tenure, household size, and age of the householder. The H files, available for states down to block groups, contain data on housing tenure, household type, household size, and age, race, and ethnicity of the householder. The HCT files, available for states down to census tracts, contain data from HCT1, HCT2, and HCT4, which provides counts of occupied housing units by tenure, race and ethnicity of the householder, presence and age of own children, and presence and age of people under 18 years by household type.
Vintage 2022-03-16: Coverage
- Geographic levels: 20 commonly-used levels
- Tables: 80 tables
- P1. Total Population
- P3. Race [7 categories]
- P4. Hispanic or Latino Origin
- P5. Hispanic or Latino Origin by Race
- P6. Race (Total Races Tallied)
- P7. Hispanic or Latino Origin by Race (Total Races Tallied)
- P8. Race [63 categories]
- P9. Hispanic or Latino, and Not Hispanic or Latino by Race
- P10. Race [63 categories] for the Population 18 years and Over
- P11. Hispanic or Latino, and Not Hispanic or Latino by Race for the Population 18 years and Over
- P12. Sex by Age
- P13. Median Age by Sex
- P14. Sex by Age for the Population Under 20 Years
- P16. Population in Households by Age
- P43. Group Quarters Population by Sex by Age by Group Quarters Type
- P12A. Sex by Age (White Alone)
- P12B. Sex by Age (Black or African American Alone)
- P12C. Sex by Age (American Indian and Alaska Native Alone)
- P12D. Sex by Age (Asian Alone)
- P12E. Sex by Age (Native Hawaiian and Other Pacific Islander Alone)
- P12F. Sex by Age (Some Other Race Alone)
- P12G. Sex by Age (Two or More Races)
- P12H. Sex by Age (Hispanic or Latino Origin)
- P12I. Sex by Age (White Alone, Not Hispanic or Latino)
- P13A. Median Age by Sex (White Alone)
- P13B. Median Age by Sex (Black or African American Alone)
- P13C. Median Age by Sex (American Indian and Alaska Native Alone)
- P13D. Median Age by Sex (Asian Alone)
- P13E. Median Age by Sex (Native Hawaiian and Other Pacific Islander Alone)
- P13F. Median Age by Sex (Some Other Race Alone)
- P13G. Median Age by Sex (Two or More Races)
- P13H. Median Age by Sex (Hispanic or Latino)
- P13I. Median Age by Sex (White Alone, Not Hispanic or Latino)
- H10. Total Population in Occupied Housing Units
- PCT12. Sex by Age [103 age categories]
- HCT1. Tenure by Hispanic or Latino Origin of Householder by Race of Householder
- HCT2. Tenure by Presence and Age of Own Children
- HCT4. Tenure by Presence and Age of People Under 18 Years by Household Type (Excluding Householders, Spouses, and Unmarried Partners)
- P18. Household Type
- P18A. Household Type (White Alone Householder)
- P18B. Household Type (Black or African American Alone Householder)
- P18C. Household Type (American Indian and Alaska Native Alone Householder)
- P18D. Household Type (Asian Alone Householder)
- P18E. Household Type (Native Hawaiian and Other Pacific Islander Alone Householder)
- P18F. Household Type (Some Other Race Alone Householder)
- P18G. Household Type (Two or More Races Householder)
- P18H. Household Type (Hispanic or Latino Householder)
- P18I. Household Type (White Alone, Not Hispanic or Latino Householder)
- H1. Housing Units
- H2. Urban and Rural
- H3. Occupancy Status
- H4. Tenure
- H5. Vacancy Status
- H6. Race of Householder
- H7. Hispanic or Latino Origin of Householder by Race of Householder
- H13. Household Size
- H14. Tenure by Race of Householder
- H15. Tenure by Hispanic or Latino Origin of Householder
- H16. Tenure by Household Size
- H17. Tenure by Age of Householder
- H18. Tenure by Household Type by Age of Householder
- H19. Tenure by Presence of People Under 18 Years (Excluding Householders, Spouses, and Unmarried Partners)
- H16A. Tenure by Household Size (White Alone Householder)
- H16B. Tenure by Household Size (Black or African American Alone Householder)
- H16C. Tenure by Household Size (American Indian and Alaska Native Alone Householder)
- H16D. Tenure by Household Size (Asian Alone Householder)
- H16E. Tenure by Household Size (Native Hawaiian and Other Pacific Islander Alone Householder)
- H16F. Tenure by Household Size (Some Other Race Alone Householder)
- H16G. Tenure by Household Size (Two or More Races Householder)
- H16H. Tenure by Household Size (Hispanic or Latino Householder)
- H16I. Tenure by Household Size (White Alone, Not Hispanic or Latino Householder)
- H17A. Tenure by Age of Householder (White Alone Householder)
- H17B. Tenure by Age of Householder (Black or African American Alone Householder)
- H17C. Tenure by Age of Householder (American Indian and Alaska Native Alone Householder)
- H17D. Tenure by Age of Householder (Asian Alone Householder)
- H17E. Tenure by Age of Householder (Native Hawaiian and Other Pacific Islander Alone Householder)
- H17F. Tenure by Age of Householder (Some Other Race Alone Householder)
- H17G. Tenure by Age of Householder (Two or More Races Householder)
- H17H. Tenure by Age of Householder (Hispanic or Latino Householder)
- H17I. Tenure by Age of Householder (White Alone, Not Hispanic or Latino Householder)
- We will extend the P files to census blocks but do not have them prepared yet.
Vintage 2022-03-16: Parameters
The privacy loss budget assigned to person-level and housing unit-level counts in the 2022-03-16 vintage was 20.82 and 22.77 respectively. Allocation details are available from the Census Bureau.
Vintage 2022-03-16: Data Files
* NOTE: On 2022-04-04, users alerted IPUMS NHGIS staff to errors in tables P12D - I, available in the nationwide P files. These errors were introduced by the NHGIS team when constructing the CSV files from the Census Bureau's 2022-03-16 DHC demonstration data. We re-processed the files and re-released them at 4:00 p.m. CT on 2022-04-05. If you downloaded nationwide P files on or before 2022-04-05, we strongly recommend downloading a new version.
Vintage 2022-08-25
We derive this version of Privacy-Protected Demonstration Data from the 2022-08-25 Demonstration Data for the 2020 Census Demographic and Housing Characteristics File. This file was produced by the Bureau's differentially private TopDown Algorithm (TDA) and contains data on topics such as sex, age, race, ethnicity, household and group quarters type, and housing tenure. The initial 2022-08-25 product contained only data for person tables, but we have now added data for housing tables, which the Bureau released on August 25, 2022. These data are comparable with the 2019-10-29, 2020-05-27, and 2022-03-16 demonstration products to assess the impact of TDA modifications on the output.
We have created three files for most geographic levels. The P files, available for states down to block groups, contain data from a variety of tables. The PCT files, available for states down to census tracts, contain data from PCT12. Sex by Age, which provides counts for males and females by single years of age. The H files, available for states down to block groups, contain data from a variety of tables related to household type, housing tenure, household size, and age of the householder. The H files, available for states down to block groups, contain data on housing tenure, household type, household size, and age, race, and ethnicity of the householder. The HCT files, available for states down to census tracts, contain data from HCT1, HCT2, and HCT4, which provides counts of occupied housing units by tenure, race and ethnicity of the householder, presence and age of own children, and presence and age of people under 18 years by household type.
Vintage 2022-08-25: Coverage
- Geographic levels: 20 commonly-used levels
- Tables: 80 tables
- P1. Total Population
- P3. Race [7 categories]
- P4. Hispanic or Latino Origin
- P5. Hispanic or Latino Origin by Race
- P6. Race (Total Races Tallied)
- P7. Hispanic or Latino Origin by Race (Total Races Tallied)
- P8. Race [63 categories]
- P9. Hispanic or Latino, and Not Hispanic or Latino by Race
- P10. Race [63 categories] for the Population 18 years and Over
- P11. Hispanic or Latino, and Not Hispanic or Latino by Race for the Population 18 years and Over
- P12. Sex by Age
- P13. Median Age by Sex
- P14. Sex by Age for the Population Under 20 Years
- P16. Population in Households by Age
- P43. Group Quarters Population by Sex by Age by Group Quarters Type
- P12A. Sex by Age (White Alone)
- P12B. Sex by Age (Black or African American Alone)
- P12C. Sex by Age (American Indian and Alaska Native Alone)
- P12D. Sex by Age (Asian Alone)
- P12E. Sex by Age (Native Hawaiian and Other Pacific Islander Alone)
- P12F. Sex by Age (Some Other Race Alone)
- P12G. Sex by Age (Two or More Races)
- P12H. Sex by Age (Hispanic or Latino Origin)
- P12I. Sex by Age (White Alone, Not Hispanic or Latino)
- P13A. Median Age by Sex (White Alone)
- P13B. Median Age by Sex (Black or African American Alone)
- P13C. Median Age by Sex (American Indian and Alaska Native Alone)
- P13D. Median Age by Sex (Asian Alone)
- P13E. Median Age by Sex (Native Hawaiian and Other Pacific Islander Alone)
- P13F. Median Age by Sex (Some Other Race Alone)
- P13G. Median Age by Sex (Two or More Races)
- P13H. Median Age by Sex (Hispanic or Latino)
- P13I. Median Age by Sex (White Alone, Not Hispanic or Latino)
- H10. Total Population in Occupied Housing Units
- PCT12. Sex by Age [103 age categories]
- HCT1. Tenure by Hispanic or Latino Origin of Householder by Race of Householder
- HCT2. Tenure by Presence and Age of Own Children
- HCT4. Tenure by Presence and Age of People Under 18 Years by Household Type (Excluding Householders, Spouses, and Unmarried Partners)
- P18. Household Type
- P18A. Household Type (White Alone Householder)
- P18B. Household Type (Black or African American Alone Householder)
- P18C. Household Type (American Indian and Alaska Native Alone Householder)
- P18D. Household Type (Asian Alone Householder)
- P18E. Household Type (Native Hawaiian and Other Pacific Islander Alone Householder)
- P18F. Household Type (Some Other Race Alone Householder)
- P18G. Household Type (Two or More Races Householder)
- P18H. Household Type (Hispanic or Latino Householder)
- P18I. Household Type (White Alone, Not Hispanic or Latino Householder)
- H1. Housing Units
- H2. Urban and Rural
- H3. Occupancy Status
- H4. Tenure
- H5. Vacancy Status
- H6. Race of Householder
- H7. Hispanic or Latino Origin of Householder by Race of Householder
- H13. Household Size
- H14. Tenure by Race of Householder
- H15. Tenure by Hispanic or Latino Origin of Householder
- H16. Tenure by Household Size
- H17. Tenure by Age of Householder
- H18. Tenure by Household Type by Age of Householder
- H19. Tenure by Presence of People Under 18 Years (Excluding Householders, Spouses, and Unmarried Partners)
- H16A. Tenure by Household Size (White Alone Householder)
- H16B. Tenure by Household Size (Black or African American Alone Householder)
- H16C. Tenure by Household Size (American Indian and Alaska Native Alone Householder)
- H16D. Tenure by Household Size (Asian Alone Householder)
- H16E. Tenure by Household Size (Native Hawaiian and Other Pacific Islander Alone Householder)
- H16F. Tenure by Household Size (Some Other Race Alone Householder)
- H16G. Tenure by Household Size (Two or More Races Householder)
- H16H. Tenure by Household Size (Hispanic or Latino Householder)
- H16I. Tenure by Household Size (White Alone, Not Hispanic or Latino Householder)
- H17A. Tenure by Age of Householder (White Alone Householder)
- H17B. Tenure by Age of Householder (Black or African American Alone Householder)
- H17C. Tenure by Age of Householder (American Indian and Alaska Native Alone Householder)
- H17D. Tenure by Age of Householder (Asian Alone Householder)
- H17E. Tenure by Age of Householder (Native Hawaiian and Other Pacific Islander Alone Householder)
- H17F. Tenure by Age of Householder (Some Other Race Alone Householder)
- H17G. Tenure by Age of Householder (Two or More Races Householder)
- H17H. Tenure by Age of Householder (Hispanic or Latino Householder)
- H17I. Tenure by Age of Householder (White Alone, Not Hispanic or Latino Householder)
- We will extend the P files to census blocks but do not have them prepared yet.
Vintage 2022-08-25: Parameters
The privacy loss budget assigned to person-level and housing unit-level counts in the 2022-08-25 vintage was 21.97 and 29.92 respectively. Allocation details are available from the Census Bureau.
Vintage 2022-08-25: Data Files
Vintage 2023-04-03
We derive this version of Privacy-Protected Demonstration Data from the 2010 Privacy-Protected Microdata File (PPMF) for the Redistricting Data and DHC, version 2022-04-03. This file was produced by the Bureau's differentially private TopDown Algorithm (TDA) and contains data on topics such as sex, age, race, ethnicity, household and group quarters type, and housing tenure. The 2023-04-03 PPMF was created using the TDA production settings that will be used to create the 2020 Demographic and Housing Characteristics File. These data are comparable with the 2019-10-29, 2020-05-27, 2022-03-16, and 2022-08-25 demonstration products to assess the impact of TDA modifications on the output.
We will be releasing data as we create them. The P files, available for counties down to block groups, contain data from a variety of tables related to race, ethnicity, age, and sex. The H files, available for counties down to block groups, contain data from a variety of tables related to occupancy status, housing tenure, household size, and age, race, and ethnicity of the householder.
Vintage 2023-04-03: Coverage
- Geographic levels: 9 commonly-used levels
- Tables: 34 tables
- P1. Total Population
- P3. Race [7 categories]
- P4. Hispanic or Latino Origin
- P5. Hispanic or Latino Origin by Race
- P6. Race (Total Races Tallied)
- P7. Hispanic or Latino Origin by Race (Total Races Tallied)
- P8. Race [63 categories]
- P9. Hispanic or Latino, and Not Hispanic or Latino by Race
- P10. Race [63 categories] for the Population 18 years and Over
- P11. Hispanic or Latino, and Not Hispanic or Latino by Race for the Population 18 years and Over
- P12. Sex by Age
- P14. Sex by Age for the Population Under 20 Years
- P42. Group Quarters Population by Group Quarters Type
- P12A. Sex by Age (White Alone)
- P12B. Sex by Age (Black or African American Alone)
- P12C. Sex by Age (American Indian and Alaska Native Alone)
- P12D. Sex by Age (Asian Alone)
- P12E. Sex by Age (Native Hawaiian and Other Pacific Islander Alone)
- P12F. Sex by Age (Some Other Race Alone)
- P12G. Sex by Age (Two or More Races)
- P12H. Sex by Age (Hispanic or Latino Origin)
- P12I. Sex by Age (White Alone, Not Hispanic or Latino)
- H1. Housing Units
- H3. Occupancy Status
- H4. Tenure
- H5. Vacancy Status
- H6. Race of Householder
- H7. Hispanic or Latino Origin of Householder by Race of Householder
- H13. Household Size
- H14. Tenure by Race of Householder
- H15. Tenure by Hispanic or Latino Origin of Householder
- H16. Tenure by Household Size
- H17. Tenure by Age of Householder
- H19. Tenure by Presence of People Under 18 Years (Excluding Householders, Spouses, and Unmarried Partners)
Vintage 2023-04-03: Parameters
The privacy loss budget (epsilon) assigned to person-level and housing unit-level counts in the 2023-04-03 vintage was 26.43 and 34.33 respectively. Allocation details are available from the Census Bureau.
Vintage 2023-04-03: Data Files
Nationwide Files (P) | ||
---|---|---|
Nationwide Files (H) | ||
Block Discrepancies in the v2021-04-28 Demonstration Data
We have created state-level summaries of differences between census block counts in the 2010 Decennial Census and the recently released 2021-04-28 vintage Privacy-Protected Microdata Files (PPMFs). The summaries will help data users assess the accuracy of the new PPMFs at the census block level. The ZIP archive contains three files. The race_by_age_differences_20210608.xlsx file summarizes block-level discrepancies for five race/ethnicity categories and three age groups. The race/ethnicity categories are:
- Total population (all races)
- White alone, non-Hispanic or Latino
- Black or African American alone or in combination
- Asian alone or in combination
- Hispanic or Latino
The age groups are:
- Total population (all ages)
- Population under 18 years of age
- Population 18 years and older
The miscellaneous_differences_20210608.xlsx file summarizes block-level discrepancies that meet the following criteria:
- Blocks changed from greater than 50% Non-Hispanic White alone to less than 50% Non-Hispanic White alone
- Blocks with population age 0 to 17 but no population ages 18+
- Blocks with population in Summary File 1 but no population in demonstration data
- Blocks with population in households but no occupied housing units
- Blocks with occupied housing units but no population
- Blocks with more than 15 persons per household
We also include a PDF file that includes more detail on the contents of the Excel spreadsheets.
Block Discrepancies in the v2021-06-08 Demonstration Data
We have created state-level summaries of differences between census block counts in the 2010 Decennial Census and the 2021-06-08 vintage Privacy-Protected Microdata Files (PPMFs). The summaries will help data users assess the accuracy of the new PPMFs at the census block level. The ZIP archive contains three files. The race_by_age_differences.xlsx file summarizes block-level discrepancies for five race/ethnicity categories and three age groups. The race/ethnicity categories are:
- Total population (all races)
- White alone, non-Hispanic or Latino
- Black or African American alone or in combination
- Asian alone or in combination
- Hispanic or Latino
The age groups are:
- Total population (all ages)
- Population under 18 years of age
- Population 18 years and older
The miscellaneous_differences.xlsx file summarizes block-level discrepancies that meet the following criteria:
- Blocks changed from greater than 50% Non-Hispanic White alone to less than 50% Non-Hispanic White alone
- Blocks with population age 0 to 17 but no population ages 18+
- Blocks with population in Summary File 1 but no population in demonstration data
- Blocks with population in households but no occupied housing units
- Blocks with occupied housing units but no population
- Blocks with more than 15 persons per household
We also include a PDF file that includes more detail on the contents of the Excel spreadsheets.
Citation and Use
Use of the IPUMS NHGIS Privacy-Protected Demonstration Data is subject to the same conditions as for all NHGIS data:
- You will not redistribute the data without permission.
- You will cite the source appropriately.
In publications or research reports, we request a product-specific citation following this general form:
David Van Riper, Tracy Kugler, and Jonathan Schroeder. IPUMS NHGIS Privacy-Protected 2010 Census Demonstration Data, version YYYYMMDD [Database]. Minneapolis, MN: IPUMS. 2020.
... with YYYYMMDD replaced by the data vintage, corresponding to the PPMF version published by the Census Bureau. A complete recommended citation is also provided in the codebooks that accompany the data files.
Credits
Our tabulation of the Census Bureau's Privacy-Protected Microdata Files and the construction of the Privacy-Protected Demonstration Data are supported by funding from the National Science Foundation (SES-1825768) and the Alfred P. Sloan Foundation (G-2019-12589). The Minnesota Population Center has also provided key resources and support, with funding from the Eunice Kennedy Shriver National Institute of Child Health and Human Development (P2CHD041023).