What's the smallest geographic slice to browse ACS data?

eric.andrew.lewis@gmail.com over 2 years ago

Hello, new to the forum

I’m exploring data in the ACS 5-year survey data. I thought I’d look at the census tract-level, but notice that in many tables the data has large margins of error, with a coefficient of variation 50% or more (screenshot).

When looking at ACS 5-year data, what’s the smallest geographic slice you look at? Seems like the zip code is where margin of error comes within a tolerable range? Of course would depend on the specific table. Tables that include multi-dimensional demographic slicing probably still have a high MOE at the zip code level.

Thanks,

Diana Lavery over 2 years ago

Welcome Eric.

There is so much in this question, and I'm actually working on a blog post about this very topic! You're right that with more dimensions of demographic slicing, the more you should look at coarser levels of geography. For basics like total pop, total housing units, etc. you might be fine with block groups or tracts, depending on your purpose.

I want to mention another option though: aggregation.

Aggregate by geography:

instead of being limited by census-defined geography, you can aggregate multiple block groups or multiple tracts together until you have a margin of error/reliability score that's acceptable to you.

Aggregate by rows/categories:

For example, with the age and sex breakdowns in your screenshot, could you live with 10-year age breakdowns instead of 5-year ones? Do you really just need everyone age 65+? Do you need the breakdowns by sex, or are you really just interested in age groups? Combining these groups will also help to get the reliability up.

Hope this helps,

Diana
Cancel
Up 0 Down

Reply

Cancel
eric.andrew.lewis@gmail.com over 2 years ago in reply to Diana Lavery

Hi Diana,

thanks for your notes here! Would be curious to read your blog post :)

Aggregating by geography makes sense.

Regarding aggregating by rows/categories, you said

> For example, with the age and sex breakdowns in your screenshot, could you live with 10-year age breakdowns instead of 5-year ones? Do you really just need everyone age 65+? Do you need the breakdowns by sex, or are you really just interested in age groups? Combining these groups will also help to get the reliability up.

How doyou aggregate by rows/categories in practice? I don't see a way to do this in the table exploration UI that data.census.gov offers.

I guess I could aggregate by rows with code.

The "Total Population Estimate" for 40-49 years woud be 598. In "Understanding and Using the U.S. Census B ureau’s American Community Survey" I see:

> Margin of Error for Aggregated Count Data
> The ACS allows the use of unique estimates called derived estimates. These are generated by aggregating
reported estimates across geographic areas or population sub groups. Margin of error is not provided for
aggregated estimates and therefore needs to be calculated. This is calculated by square root of the sum of
squared margin of errors. The letter ‘c’ in the equation below represents each estimate that will be included in
the aggregation.

I could select the rows, for example as you suggested say 10-year age breakdowns:

Age Total Population Estimate Margin of error Percent Population Estimate Margin of error

40 to 44 years 313 164 10.7 4.7

45 to 49 years 285 114 9.7 3.9

So presumably the margin of error would be the square of (164^2 + 114^2), which equals 199. Aggregating the Percent Population Estimate would be 20.4 with a margin of error of 6.1.

Thanks for pointing me in the right direction here! Aggregating by row seems like a real valuable method.
Cancel
Up 0 Down

Reply

Cancel
David Dorer over 2 years ago in reply to eric.andrew.lewis@gmail.com

As an FYI the calculations (approximate) for various ways of aggregating data (combining categories or geographies) are in the ACS General Handbook chapter 8 here

https://www.census.gov/content/dam/Census/library/publications/2020/acs/acs_general_handbook_2020_ch08.pdf

As a general matter. unless you are doing only a small number of calculations, you will need a statistical package to do your calculations. The R statistics package https://cran.r-project.org/ is a free open source package. The free add on package "tidycensus" can handle downloading and calculations. walker-data.com/.../
Cancel
Up 0 Down

Reply

Cancel
David Dorer over 2 years ago in reply to David Dorer

Here is a link to the full ACS handbook for data users; https://www.census.gov/content/dam/Census/library/publications/2020/acs/acs_general_handbook_2020.pdf

Understanding and Using American Community Survey Data What All Data Users Need to Know
Cancel
Up 0 Down

Reply

Cancel
Diana Lavery over 2 years ago in reply to eric.andrew.lewis@gmail.com

Thanks Eric and David. I agree with both of you.

Here is the blog post I mentioned, it's now live. It talks specifically about aggregating tracts together: https://www.esri.com/arcgis-blog/products/arcgis-living-atlas/analytics/acs-summarization-app/

(Not included in here but in discussions for a future blog is the part about aggregating groups, like it seems you're doing with age groups. This is the best way to go for people who are committed to finer geography levels but are okay with combining groups.)
Cancel
Up 0 Down

Reply

Cancel

tomlaheyh over 2 years ago

As someone new and not into stats etc. I pulled some key measures ACS1 by county and found most had data (not -99999999 ). However, there were some key race measures (Black and Hispanic for example) with a lot of -999999. I think there actually was a state that was missing one or more races for the ACS1. I think S2701_C01_017E and S2701_C01_023E stood out the most. I assume that the ACS5 will resolve most of these issues at the county and lower levels. I'm going to compare ACS1 and ACS5 for the 900 counties in ACS1 to see what the differences are. I will use around 70 measures. When I'm done, I'll paste the 70 measures ACS1 vs ACS5 in a discussion here (just for reference in case anyone wants to see it). I could also do something with margin or errors, but for all the Public Health websites and more, they just take whatever the data values are without regard to margin of error, so I think I will too. No heavy lifting, just using the published numbers. Not that I could if I tried, not in your league but I'm Ok with that.

These are a few of the measures and what percent of the 900 counties have data for ACS1 (not -9999999). So later perhaps 4 columns with percent with data and the actual numbers by diff ACS1 vs ACS5 Top level, no mix and match (by Age by Race, by Poverty)

Estimate!!Total!!Total population	100%
Estimate!!Total!!Total population!!SELECTED AGE CATEGORIES!!18 years and over	100%
Estimate!!Total!!Total population!!SELECTED AGE CATEGORIES!!65 years and over	100%
Estimate!!Total!!Total population!!SUMMARY INDICATORS!!Median age (years)	100%
Estimate!!Percent!!Total population!!AGE!!Under 5 years	100%
Estimate!!Percent!!Total population!!SELECTED AGE CATEGORIES!!Under 18 years	100%
Estimate!!Female!!Total population	100%
Estimate!!Total!!HOUSEHOLDS!!Total households	100%
Estimate!!Total!!FAMILIES!!Total families	100%
Estimate!!Total!!HOUSEHOLDS!!Average household size	100%
Estimate!!Total!!FAMILIES!!Average family size	100%
Estimate!!Total!!Total households!!HOUSING TENURE!!Renter-occupied housing units	100%
Estimate!!Households!!Total!!Less than $10,000	100%
Estimate!!Households!!Total!!$200,000 or more	100%
Estimate!!Households!!Median income (dollars)	100%
Estimate!!Households!!Mean income (dollars)	100%
Estimate!!Families!!Total!!Less than $10,000	100%
Estimate!!Families!!Total!!$200,000 or more	100%
Estimate!!Families!!Median income (dollars)	100%
Estimate!!Families!!Mean income (dollars)	100%
Estimate!!Percent occupied housing units!!Occupied housing units!!HOUSEHOLD SIZE!!3-person household	100%
Estimate!!Percent occupied housing units!!Occupied housing units!!HOUSEHOLD SIZE!!4-or-more-person household	100%
Estimate!!Percent below poverty level!!Population for whom poverty status is determined	100%
Estimate!!Percent below poverty level!!Population for whom poverty status is determined!!AGE!!Under 18 years	100%
Estimate!!Percent below poverty level!!Population for whom poverty status is determined!!AGE!!Under 18 years!!Under 5 years	100%
Estimate!!Total!!Civilian noninstitutionalized population!!NATIVITY AND U.S. CITIZENSHIP STATUS!!Foreign born	98%
Estimate!!Total!!Civilian noninstitutionalized population!!RACE AND HISPANIC OR LATINO ORIGIN!!Black or African American alone	65%
Estimate!!Total!!Civilian noninstitutionalized population!!RACE AND HISPANIC OR LATINO ORIGIN!!Hispanic or Latino (of any race)	80%
Estimate!!Total!!Civilian noninstitutionalized population!!RACE AND HISPANIC OR LATINO ORIGIN!!White alone, not Hispanic or Latino	98%
Estimate!!Total!!Civilian noninstitutionalized population!!DISABILITY STATUS!!With a disability	100%
Estimate!!Percent Uninsured!!Civilian noninstitutionalized population	100%
Estimate!!Percent Uninsured!!Civilian noninstitutionalized population!!AGE!!Under 6 years	100%
Estimate!!Percent Uninsured!!Civilian noninstitutionalized population!!AGE!!Under 19 years	100%
Estimate!!Percent!!Population 3 years and over enrolled in school!!Kindergarten to 12th grade	100%
Estimate!!Percent!!Population 3 years and over enrolled in school!!College, undergraduate	100%
Estimate!!Percent!!Population 3 years and over enrolled in school!!Graduate, professional school	100%
Estimate!!Percent in public school!!Population 3 years and over enrolled in school!!Kindergarten to 12th grade	99%
Estimate!!Percent in public school!!Population 3 years and over enrolled in school!!College, undergraduate	99%
Estimate!!Percent in public school!!Population 3 years and over enrolled in school!!Graduate, professional school	99%
Estimate!!Percent!!AGE BY EDUCATIONAL ATTAINMENT!!Population 18 to 24 years!!Bachelor's degree or higher	100%
Estimate!!Percent!!AGE BY EDUCATIONAL ATTAINMENT!!Population 25 years and over!!Bachelor's degree or higher	100%
Estimate!!Labor Force Participation Rate!!Population 16 years and over	97%
Estimate!!Unemployment rate!!Population 16 years and over	97%
Estimate!!Total!!Population 16 to 64 years!!Mean usual hours worked for workers	100%
Estimate!!Total!!Civilian employed population 16 years and over	98%
Estimate!!Total!!Civilian employed population 16 years and over!!Service occupations:	98%
Estimate!!Percent households receiving food stamps/SNAP!!Households	100%
Estimate!!Total!!POVERTY STATUS IN THE PAST 12 MONTHS!!Civilian population 18 years and over for whom poverty status is determined	100%
Estimate!!Percent Veterans!!Civilian population 18 years and over	100%

Age	Total Population Estimate	Margin of error	Percent Population Estimate	Margin of error
40 to 44 years	313	164	10.7	4.7
45 to 49 years	285	114	9.7	3.9