The 2025 ACS Data Users Conference will be held on May 29, 2025 (virtual day) and June 3, 2025 (in-person in DC)
American Community Survey Data Users Group
Search for topics, people, or keywords
Sign Up
Log in
Site
Search for topics, people, or keywords
User
Home
Discussion Forum
ACS Resources
Webinars
Conferences
Steering Committee
More
Cancel
Home
Discussion Forum
ACS Resources
Webinars
Conferences
About
More
Cancel
Details
Views
7617 views
Replies
8 replies
Subscribers
541 subscribers
Users
0 members are here
Labels
topcode
5 year estimates
PUMS
ACS
top code
Related Census Bureau Resources
Related Forum Threads
Top-coded values
ultimate
over 9 years ago
Has anyone ever attempted to "fill-in" the top-coded values for the 5 year ACS PUMS data? For example, in 2009, all of the house values (variable: valp) greater than 4 million dollars in Hawaii get cutoff and it's impossible to tell if the observation you're looking at is a 50 million dollar home or a 4.01 million dollar home - since both look the same. The top-coded cutoff values are different for every year by state combination and I haven't had much luck trying to break through the top-coded value. I've mostly been running linear regressions on the data and am just trying to get a reasonable estimate for each observation that is top-coded. Any insight would be greatly appreciated.
[Updated on 2/24/2015 3:08 PM]
Parents
Jeffrey Naas
over 9 years ago
You need a secondary source of data to distribute the top-coded data. Your results will vary according to how large (and representative) the secondary dataset is. I used to use proprietary rental listing databases to further break out rental units in the top category according to bedroom size and square footage with some degree of success. You might see if you can get some historical MLS databases (or something similar) and try to create some distributions based on price per square foot or some other parameters.
Cancel
Up
0
Down
Reply
Cancel
Reply
Jeffrey Naas
over 9 years ago
You need a secondary source of data to distribute the top-coded data. Your results will vary according to how large (and representative) the secondary dataset is. I used to use proprietary rental listing databases to further break out rental units in the top category according to bedroom size and square footage with some degree of success. You might see if you can get some historical MLS databases (or something similar) and try to create some distributions based on price per square foot or some other parameters.
Cancel
Up
0
Down
Reply
Cancel
Children
No Data