What's the smallest geographic slice to browse ACS data?

Hello, new to the forum Wave

I’m exploring data in the ACS 5-year survey data. I thought I’d look at the census tract-level, but notice that in many tables the data has large margins of error, with a coefficient of variation 50% or more (screenshot).

When looking at ACS 5-year data, what’s the smallest geographic slice you look at? Seems like the zip code is where margin of error comes within a tolerable range? Of course would depend on the specific table. Tables that include multi-dimensional demographic slicing probably still have a high MOE at the zip code level.

Thanks,

  • Welcome Eric. 

    There is so much in this question, and I'm actually working on a blog post about this very topic! You're right that with more dimensions of demographic slicing, the more you should look at coarser levels of geography. For basics like total pop, total housing units, etc. you might be fine with block groups or tracts, depending on your purpose.

    I want to mention another option though: aggregation.

    Aggregate by geography:

    instead of being limited by census-defined geography, you can aggregate multiple block groups or multiple tracts together until you have a margin of error/reliability score that's acceptable to you.

    Aggregate by rows/categories:

    For example, with the age and sex breakdowns in your screenshot, could you live with 10-year age breakdowns instead of 5-year ones? Do you really just need everyone age 65+?  Do you need the breakdowns by sex, or are you really just interested in age groups? Combining these groups will also help to get the reliability up.

    Hope this helps,

    Diana

  • Hi Diana,

    thanks for your notes here! Would be curious to read your blog post :)

    Aggregating by geography makes sense.

    Regarding aggregating by rows/categories, you said

    > For example, with the age and sex breakdowns in your screenshot, could you live with 10-year age breakdowns instead of 5-year ones? Do you really just need everyone age 65+?  Do you need the breakdowns by sex, or are you really just interested in age groups? Combining these groups will also help to get the reliability up.

    How doyou aggregate by rows/categories in practice? I don't see a way to do this in the table exploration UI that data.census.gov offers.

    I guess I could aggregate by rows with code.

    The "Total Population Estimate" for 40-49 years woud be 598. In "Understanding and Using the U.S. Census Bureau’s American Community Survey" I see:

    > Margin of Error for Aggregated Count Data
    > The ACS allows the use of unique estimates called derived estimates. These are generated by aggregating
    reported estimates across geographic areas or population sub groups. Margin of error is not provided for
    aggregated estimates and therefore needs to be calculated. This is calculated by square root of the sum of
    squared margin of errors. The letter ‘c’ in the equation below represents each estimate that will be included in
    the aggregation.

    I could select the rows, for example as you suggested say 10-year age breakdowns:

    Age Total Population Estimate Margin of error Percent Population Estimate Margin of error
    40 to 44 years 313 164 10.7 4.7
    45 to 49 years 285 114 9.7 3.9

    So presumably the margin of error would be the square of (164^2 + 114^2), which equals 199. Aggregating the Percent Population Estimate would be 20.4 with a margin of error of 6.1.

    Thanks for pointing me in the right direction here! Aggregating by row seems like a real valuable method.

  • As an FYI the calculations (approximate) for various ways of aggregating data (combining categories or geographies) are in the ACS General Handbook chapter 8  here

    https://www.census.gov/content/dam/Census/library/publications/2020/acs/acs_general_handbook_2020_ch08.pdf

    As a general matter. unless you are doing only a small number of calculations, you will need a statistical package to do your calculations.  The R statistics package https://cran.r-project.org/ is a free open source package. The free add on package "tidycensus" can handle downloading and calculations. walker-data.com/.../

  • Here is a link to the full ACS handbook for data users;  https://www.census.gov/content/dam/Census/library/publications/2020/acs/acs_general_handbook_2020.pdf

    Understanding and Using American Community Survey Data What All Data Users Need to Know

  • Thanks Eric and David. I agree with both of you.

    Here is the blog post I mentioned, it's now live. It talks specifically about aggregating tracts together: https://www.esri.com/arcgis-blog/products/arcgis-living-atlas/analytics/acs-summarization-app/ 

    (Not included in here but in discussions for a future blog is the part about aggregating groups, like it seems you're doing with age groups. This is the best way to go for people who are committed to finer geography levels but are okay with combining groups.)

  • As someone new and not into stats etc. I pulled some key measures ACS1 by county and found most had data (not -99999999 ). However, there were some key race measures (Black and Hispanic for example) with a lot of -999999. I think there actually was a state that was missing one or more races for the ACS1.  I think S2701_C01_017E and S2701_C01_023E stood out the most. I assume that the ACS5 will resolve most of these issues at the county and lower levels. I'm going to compare ACS1 and ACS5 for the 900 counties in ACS1 to see what the differences are. I will use around 70 measures. When I'm done, I'll paste the 70 measures ACS1 vs ACS5 in a discussion here (just for reference in case anyone wants to see it). I could also do something with margin or errors, but for all the Public Health websites and more, they just take whatever the data values are without regard to margin of error, so I think I will too. No heavy lifting, just using the published numbers. Not that I could if I tried, not in your league but I'm Ok with that.

    These are a few of the measures and what percent of the 900 counties have data  for ACS1  (not -9999999). So later perhaps 4 columns with percent with data and the actual numbers by diff ACS1 vs ACS5   Top level, no mix and match (by Age by Race, by Poverty)

    Estimate!!Total!!Total population 100%
    Estimate!!Total!!Total population!!SELECTED AGE CATEGORIES!!18 years and over 100%
    Estimate!!Total!!Total population!!SELECTED AGE CATEGORIES!!65 years and over 100%
    Estimate!!Total!!Total population!!SUMMARY INDICATORS!!Median age (years) 100%
    Estimate!!Percent!!Total population!!AGE!!Under 5 years 100%
    Estimate!!Percent!!Total population!!SELECTED AGE CATEGORIES!!Under 18 years 100%
    Estimate!!Female!!Total population 100%
    Estimate!!Total!!HOUSEHOLDS!!Total households 100%
    Estimate!!Total!!FAMILIES!!Total families 100%
    Estimate!!Total!!HOUSEHOLDS!!Average household size 100%
    Estimate!!Total!!FAMILIES!!Average family size 100%
    Estimate!!Total!!Total households!!HOUSING TENURE!!Renter-occupied housing units 100%
    Estimate!!Households!!Total!!Less than $10,000 100%
    Estimate!!Households!!Total!!$200,000 or more 100%
    Estimate!!Households!!Median income (dollars) 100%
    Estimate!!Households!!Mean income (dollars) 100%
    Estimate!!Families!!Total!!Less than $10,000 100%
    Estimate!!Families!!Total!!$200,000 or more 100%
    Estimate!!Families!!Median income (dollars) 100%
    Estimate!!Families!!Mean income (dollars) 100%
    Estimate!!Percent occupied housing units!!Occupied housing units!!HOUSEHOLD SIZE!!3-person household 100%
    Estimate!!Percent occupied housing units!!Occupied housing units!!HOUSEHOLD SIZE!!4-or-more-person household 100%
    Estimate!!Percent below poverty level!!Population for whom poverty status is determined 100%
    Estimate!!Percent below poverty level!!Population for whom poverty status is determined!!AGE!!Under 18 years 100%
    Estimate!!Percent below poverty level!!Population for whom poverty status is determined!!AGE!!Under 18 years!!Under 5 years 100%
    Estimate!!Total!!Civilian noninstitutionalized population!!NATIVITY AND U.S. CITIZENSHIP STATUS!!Foreign born 98%
    Estimate!!Total!!Civilian noninstitutionalized population!!RACE AND HISPANIC OR LATINO ORIGIN!!Black or African American alone 65%
    Estimate!!Total!!Civilian noninstitutionalized population!!RACE AND HISPANIC OR LATINO ORIGIN!!Hispanic or Latino (of any race) 80%
    Estimate!!Total!!Civilian noninstitutionalized population!!RACE AND HISPANIC OR LATINO ORIGIN!!White alone, not Hispanic or Latino 98%
    Estimate!!Total!!Civilian noninstitutionalized population!!DISABILITY STATUS!!With a disability 100%
    Estimate!!Percent Uninsured!!Civilian noninstitutionalized population 100%
    Estimate!!Percent Uninsured!!Civilian noninstitutionalized population!!AGE!!Under 6 years 100%
    Estimate!!Percent Uninsured!!Civilian noninstitutionalized population!!AGE!!Under 19 years 100%
    Estimate!!Percent!!Population 3 years and over enrolled in school!!Kindergarten to 12th grade 100%
    Estimate!!Percent!!Population 3 years and over enrolled in school!!College, undergraduate 100%
    Estimate!!Percent!!Population 3 years and over enrolled in school!!Graduate, professional school 100%
    Estimate!!Percent in public school!!Population 3 years and over enrolled in school!!Kindergarten to 12th grade 99%
    Estimate!!Percent in public school!!Population 3 years and over enrolled in school!!College, undergraduate 99%
    Estimate!!Percent in public school!!Population 3 years and over enrolled in school!!Graduate, professional school 99%
    Estimate!!Percent!!AGE BY EDUCATIONAL ATTAINMENT!!Population 18 to 24 years!!Bachelor's degree or higher 100%
    Estimate!!Percent!!AGE BY EDUCATIONAL ATTAINMENT!!Population 25 years and over!!Bachelor's degree or higher 100%
    Estimate!!Labor Force Participation Rate!!Population 16 years and over 97%
    Estimate!!Unemployment rate!!Population 16 years and over 97%
    Estimate!!Total!!Population 16 to 64 years!!Mean usual hours worked for workers 100%
    Estimate!!Total!!Civilian employed population 16 years and over 98%
    Estimate!!Total!!Civilian employed population 16 years and over!!Service occupations: 98%
    Estimate!!Percent households receiving food stamps/SNAP!!Households 100%
    Estimate!!Total!!POVERTY STATUS IN THE PAST 12 MONTHS!!Civilian population 18 years and over for whom poverty status is determined 100%
    Estimate!!Percent Veterans!!Civilian population 18 years and over 100%