1 vs 5 year estimates regarding annual difference in median income

We are trying to calculate the difference in median annual income at the county level between those who are high school graduates vs. those who are less than high school graduates.  There are both 1 year and 5 year estimates from the ACS available.  The goal is to provide a yearly estimate of the difference in income between the two groups (e.g., “in 2013, the annual difference in median income between high school graduates and those with less than a high school education was $X”).  With 1 Year estimates, we could make a statement like that.  With 5 year estimates, the statement would have to be more like, “in 2013-2017, the annual difference in median income between high school graduates and those with less than a high school education was $X”.

 

Understanding that the 5 year estimates are more reliable because the samples are larger and also have smaller margins of error, what would be the better source to calculate the difference by year (i.e., 2013, 2014)?  The 1 year estimates are exactly that, even though the margins of error are larger.  The 5 year estimates share overlapping years, so they are not a true indicator of the difference in a single year.  Since we are looking at county level data, the 1 year estimates include a large sample size, but in comparing the 1 Year and 5 Year data, the actual median values are not wildly different, but the margins of error are very different.  Even though we would ideally like to make a statement about differences by year, would it be better to use the 5 year estimates since they are more reliable? 

Parents
  • Thank you all for the responses! I have some additional follow up information as well as questions:

    1. We are using data from Shelby County, TN, which has a population of over 900,000. So it seems that using 1 year data is acceptable.

    2. The purpose of our study is not to determine whether the difference in median income between high school graduates and non-graduates is getting bigger or smaller over time, but to determine the cumulative economic impact of having a high school diploma vs. not over time. In other words, what is the median difference in income (in dollars) in 2013, 2014, 2015, 2016, and 2017 between those with a high school diploma vs. not. Then over the entire timeframe from 2013-17, what is the cumulative difference in income between high school graduates and non-graduates (adding the differences for the individual years). So, we’d like to be able to say something to the effect, “The annual difference in median income between high school graduates and non-graduates in 2013 was $8,046” (actual data, but from the 5 year estimate). Then make a similar statement for each individual year 2014-17, with the ultimate statement something to effect of, “Between 2013 and 2017 the cumulative difference in median income between high school graduates and non-graduates was $34,663” (actual data, but from the 5 year estimate). The point would be to show how having a high school diploma vs. not affects income, and how that difference accumulates over time. We also want to say how that effect for the county is compounded based on the number of high school graduates each year (e.g., if there were 500 graduates in Shelby County in 2013, that is a cumulative difference for the county of $4,023,000 ($8,046 x 500)).

    3. Do you typically report a range of values for the median based on the margin of error. And for our example, whether they would report a range of values for the difference between high school graduates and non-graduates based on the margin of error.
    Thank you!
Reply
  • Thank you all for the responses! I have some additional follow up information as well as questions:

    1. We are using data from Shelby County, TN, which has a population of over 900,000. So it seems that using 1 year data is acceptable.

    2. The purpose of our study is not to determine whether the difference in median income between high school graduates and non-graduates is getting bigger or smaller over time, but to determine the cumulative economic impact of having a high school diploma vs. not over time. In other words, what is the median difference in income (in dollars) in 2013, 2014, 2015, 2016, and 2017 between those with a high school diploma vs. not. Then over the entire timeframe from 2013-17, what is the cumulative difference in income between high school graduates and non-graduates (adding the differences for the individual years). So, we’d like to be able to say something to the effect, “The annual difference in median income between high school graduates and non-graduates in 2013 was $8,046” (actual data, but from the 5 year estimate). Then make a similar statement for each individual year 2014-17, with the ultimate statement something to effect of, “Between 2013 and 2017 the cumulative difference in median income between high school graduates and non-graduates was $34,663” (actual data, but from the 5 year estimate). The point would be to show how having a high school diploma vs. not affects income, and how that difference accumulates over time. We also want to say how that effect for the county is compounded based on the number of high school graduates each year (e.g., if there were 500 graduates in Shelby County in 2013, that is a cumulative difference for the county of $4,023,000 ($8,046 x 500)).

    3. Do you typically report a range of values for the median based on the margin of error. And for our example, whether they would report a range of values for the difference between high school graduates and non-graduates based on the margin of error.
    Thank you!
Children
No Data