Regression Analysis using ACS 5 year estimates

Hello!

 

I am grad student at NC State working with a fellow student on a project involving ArcGIS and ACS 5-year estimate data. We would like to construct a model using ordinary least squares (and eventually geographically weighted regression) with various ACS variables as independent variables. I am unsure how to incorporate the MOEs included in ACS data into these models.

 

Any help for a newbie at using survey data to construct models accurately (particularly in ArcGIS) is much appreciated!

 

Thanks,

Jacob

Parents
  • Hi Jacob,

    The first thought that comes to mind is to use the MOEs (or calculate them, if you're combining estimates) as a "reliability check" and come up with some cut off point (the difference between the upper bound and lower bound has to be less than 10% of the estimate, 12%, ??? - you can decide on what makes sense for your particular variables and geography levels you're planning to use). If the estimates pass that reliability check, then you can use them in your models. If not, treat as missing data (basically set all those cases to unknown).

    If you're finding that there are a lot of estimates that don't pass your reliability cut off point, there are some things you can try:
    -work with a "courser" geography-level (e.g. county subdivisions instead of tracts)
    -combine categories/estimates (e.g., age 65-74 and age 75-84 become age 65-84)
    -work with the 5-year ACS file instead of the 1-year ACS file. Not as recent data, but often more accurate (lower MOEs). It sounds as if you're already working with the 5-year estimates, so this might not help you out here.

    Good luck with your project!

    -Diana
Reply
  • Hi Jacob,

    The first thought that comes to mind is to use the MOEs (or calculate them, if you're combining estimates) as a "reliability check" and come up with some cut off point (the difference between the upper bound and lower bound has to be less than 10% of the estimate, 12%, ??? - you can decide on what makes sense for your particular variables and geography levels you're planning to use). If the estimates pass that reliability check, then you can use them in your models. If not, treat as missing data (basically set all those cases to unknown).

    If you're finding that there are a lot of estimates that don't pass your reliability cut off point, there are some things you can try:
    -work with a "courser" geography-level (e.g. county subdivisions instead of tracts)
    -combine categories/estimates (e.g., age 65-74 and age 75-84 become age 65-84)
    -work with the 5-year ACS file instead of the 1-year ACS file. Not as recent data, but often more accurate (lower MOEs). It sounds as if you're already working with the 5-year estimates, so this might not help you out here.

    Good luck with your project!

    -Diana
Children
No Data