De-identify ZCTA based data

I'm trying to merge zip codes so the the combined area has a population of 20,000 or more.  I'm trying to de-identify a data set that has zip codes by merging adjacent ZCTAs.

If you have any ideas let me know.   Perhaps someone has already done this and they have a list.

Parents Reply Children
  • Dear Stas,

    I noticed the R package but I haven't had a chance to look into it yet.  Thanks for letting me know about it.  I just got my R program working and it seems to be serviceable. I ran it on all the Massachusetts zip codes. There are 540 of them. I aggregated the ZCTAs so that the groups have a population of 10,000 or more.  It took maybe 3 or 4 minutes.  I merged the polygons that touch a selected polygon.  If there are multiple touching polygons I take the one with the smallest distance between centroids.  I haven't tried a selection criterion based on the populations of the touching polygons.  Merging on distance gives a range for the populations of 10,000 to about 219,000 for Massachusetts.   Merging using populations might lower the upper limit on the resulting populations. I'll have to try it.

    Thanks again,