Top-coded values

Has anyone ever attempted to "fill-in" the top-coded values for the 5 year ACS PUMS data? For example, in 2009, all of the house values (variable: valp) greater than 4 million dollars in Hawaii get cutoff and it's impossible to tell if the observation you're looking at is a 50 million dollar home or a 4.01 million dollar home - since both look the same. The top-coded cutoff values are different for every year by state combination and I haven't had much luck trying to break through the top-coded value. I've mostly been running linear regressions on the data and am just trying to get a reasonable estimate for each observation that is top-coded. Any insight would be greatly appreciated.

[Updated on 2/24/2015 3:08 PM]
Parents
  • John,

    Thanks for your response. The set of households that I've been trying to model are the households with tenure equal to either '1' or '2', meaning that the property is owned free and clear or is mortgaged. This is the owner-occupied set and excludes renters, group quarters, and vacant units.

    You are correct - depending on the year, the top-coding values for each state are different. For example, in 2010, Alaska is top-coded at $800k but in 2014, Alaska is top-coded at $825k. Each record that is top-coded reports the average value of all the top-coded records within that state.
Reply
  • John,

    Thanks for your response. The set of households that I've been trying to model are the households with tenure equal to either '1' or '2', meaning that the property is owned free and clear or is mortgaged. This is the owner-occupied set and excludes renters, group quarters, and vacant units.

    You are correct - depending on the year, the top-coding values for each state are different. For example, in 2010, Alaska is top-coded at $800k but in 2014, Alaska is top-coded at $825k. Each record that is top-coded reports the average value of all the top-coded records within that state.
Children
No Data