Uneducated Physical Therapists

I'm working on a project in which I'm subtracting the education of individuals via ACS data, and using BLS typical education needed for employment to create a variable of "underemployment". As I was doing this, I noticed a high number of physical therapists with the maximum negative value, signifying no high school education but practicing as a physical therapist, which requires a professional degree. A physical therapist aide is 'OCCP' = 3620 while a physical therapist is 3610, so I'm wondering if ACS retroactively fixes these types of errors. For instance, there is a 29 year old who apparently completed no grade school education (SCHL = 1) in Georgia. There are 35 of these physical therapists, 5 chiropractors, 2 judicial law clerks, and an audiologist. I weighted some other occupations which are aggregated in ACS but not BLS so the "underemployment" variable has fractional value almost as low in "Lawyers, and judges, magistrates, and other judicial workers" as well as "other life scientists". My dataset is the 5-year 2018-2022 population national population files aggregated (a, b, c, and d). 

Parents Reply Children
  • The "flag" variables are typically the same variable name as the one you're using, preceded by an F. For example, the imputation (or "allocation") flag for OCCP is FOCCP. For highest education (SCHL) it's FSCHLP. So you could try running your analysis excluding any cases where FOCCP = 1 or FSCHLP = 1 and see if you still get mismatches between education and occupation.

  • Thank you for your reply! That seems to solve much of the issue, but there are still many unexplained observations. My 'underemployed' variable subtracts EDTAINMENT (which I just take the top five values of SCHL and put them on that scale) from the BLS typical required education by occupation.

    Example: Required professional or doctoral degree = 5. Educational attainment less than high school = 0. 'underemployed' = 0 - 5 = -5 in that scenario. 



    Some stats for reference:

    mean of 'FOCCP' when 'underemployed' < -4
    0.8864541832669323 i.e. 88.64% of observations

    mean of 'FOCCP' when 'underemployed' >= -4
    0.15477719436066595 i.e. 15.48% of observations

    mean of 'FSCHLP' when 'underemployed' < -4
    0.6633466135458167 i.e. 66.33% of observations

    mean of 'FSCHLP' when 'underemployed' >= -4
    0.07940061918797757 i.e. 7.94% of observations

    Now I feel somewhat alarmed by how much imputation there is...