OCC1990 is a modified version of the 1990 Census Bureau occupational classification scheme. OCC1990 provides researchers with a consistent classification of occupations using the 1990 coding scheme as its starting point. It spans the period from 1950 forward.
Following is a list of the extracted variables, including those for the years of 1980, 1990, 2000, 2010, and 2017:
2017 2010 2000 1990 1980 Variable Variable Label Type Codes acs acs 5pct 5pct 5pct ---------- ----------------------------------------- ---- ----- ---- ---- ---- ---- ---- YEAR Census year [preselected].................. H codes X X X X X DATANUM Data set number [preselected].............. H codes X X X X X SERIAL Household serial number [preselected]...... H codes X X X X X CBSERIAL Original CB household serial number [pre].. H codes X X . . . HHWT Household weight [preselected]............. H codes X X X X X GQ Group quarters status [preselected]........ H codes X X X X X PERNUM Person number in sample unit [preselected]. P codes X X X X X PERWT Person weight [preselected]................ P codes X X X X X STATEFIP State (FIPS code).......................... H codes X X X X X COUNTYFIP County (FIPS code)......................... H codes X X X X X MET2013 Metropolitan area (2013 OMB delineations).. H codes X X X . . PUMA Public Use Microdata Area.................. H codes X X X X . SEX Sex........................................ P codes X X X X X AGE Age........................................ P codes X X X X X RACE Race....................................... P codes X X X X X HISPAN Hispanic origin............................ P codes X X X X X BPL Birthplace................................. P codes X X X X X CITIZEN Citizenship status......................... P codes X X X X X YRIMMIG Year of immigration........................ P codes X X X X X EDUC Educational attainment..................... P codes X X X X X DEGFIELD Field of degree............................ P codes X X . . . DEGFIELD2 Field of degree (2)........................ P codes X X . . . EMPSTAT Employment status.......................... P codes X X X X X OCC Occupation................................. P codes X X X X X OCC1990 Occupation, 1990 basis..................... P codes X X X X X CLASSWKR Class of worker............................ P codes X X X X X WKSWORK2 Weeks worked last year, intervalled........ P codes X X X X X INCWAGE Wage and salary income..................... P codes X X X X XDue to memory constraints, the current online version of the app only includes 2010 and 2017 at present. This may be expanded in the future. Still, since the online app currently includes just two years, this document will focus mainly on looking at a single year's data.
From the variables listed above, RACE, HISPAN, BPL, EDUC, DEGFIELD, DEGFIELD2, EMPSTAT, and CLASSWKR each include an additional detailed variable with the same name with a D added. For example RACE will have a detailed version named RACED. Hence, the 28 variable selected for 2017 above will actually result in 36 variables, including the 8 detailed variables. Following are detailed descriptions of all of these variables:
p YEAR : int sample year: currently 2014 to 2017 p DATANUM : int particular sample from which the case is drawn in a given year. See DATANUM Codes p SERIAL : int identifying number unique to each household record in a given sample. p CBSERIAL : num unique, original identification number assigned to each household record in a given sample by the Census Bureau. p HHWT : int indicates how many households in the U.S. population are represented by a given household in an IPUMS sample. p GQ : int classifies all housing units as a vacant units (0), households (1-2), or group quarters (3-5). See GQ Codes p PERNUM : int numbers all persons within each household consecutively in the order in which they appear on the original census or survey form. p PERWT : int indicates how many persons in the U.S. population are represented by a given person in an IPUMS sample. # STATEFIP : int state where the household was located, using the FIPS coding scheme. See STATEFIP Codes # COUNTY : int county where the household was located, using the ICPSR coding scheme (renamed COUNTYFIP). See COUNTY Codes # MET2013 : int metro area where the household was located, using the 2013 definitions for metropolitan statistical areas (MSAs) from the OMB. See MET2013 Codes # PUMA : int identifies the Public Use Microdata Area (PUMA) where the housing unit was located. See PUMA Codes # SEX : int 1 = male, 2 = female # AGE : int age in years, 0 = Less than 1 year old, 96 = maximum in 2017 # RACE : int race, 1 = White, 2 = Black, 3 = American Indian or Alaskan Native, 4 = Chinese, 5 = Japanese, 6 = Other Asian or Pacific Islander, 7 = Other race, 8 = Two major races, 9 = Three or more major races a RACED : int detailed race, 100 to 996, see RACE Codes # HISPAN : int Hispanic origin, 0 = Not Hispanic, 1 = Mexican, 2 = Puerto Rican, 3 = Cuban, 4 = Other, 9 = Not Reported a HISPAND : int detailed Hispanic origin, 000 to 900, see HISPAN Codes # BPL : int U.S. state or territory or the foreign country where the person was born (188 categories). See BPL Codes a BPLD : int U.S. state or territory or the foreign country where the person was born (572 categories). See BPL Codes # CITIZEN : int citizenship status: 0 = N/A (Born in U.S.), 1 = Born abroad of American parents, 2 = Naturalized citizen, 3 = Not a citizen # YRIMMIG : int year in which a foreign-born person entered the United States. See YRIMMIG Codes # EDUC : int respondents' educational attainment, as measured by the highest year of school or degree completed (12 categories). See EDUC Codes a EDUCD : int respondents' educational attainment, as measured by the highest year of school or degree completed (44 categories). See EDUC Codes # DEGFIELD : int field of degree, 00 = N/A, 11 = Agriculture, ... 21 = Computer and Information Sciences, ... 24 = Engineering, 25 = Engineering Technologies, ... 37 = Mathematics and Statistics, ..., 62 = Business, 64 = History a DEGFIELDD: int detailed field of degree, 0000 to 6403, see DEGFIELD Codes # DEGFIELD2: int second field of study, same codes as DEGFIELD above a DEGFIELD2D:int detailed second field of study, same codes as DEGFIELDD above # EMPSTAT : int employment status: 0 = N/A, 1 = Employed, 2 = Unemployed, 3 = Not in labor force a EMPSTATD : int employment status: 00 = N/A, 10 = At work, 12 = Has job, not working, 14 = Armed forces--at work, 15 = Armed forces--with job but not at work, 20 = Unemployed, 30 = Not in Labor Force # OCC : int person's primary occupation, coded into a contemporary census classification scheme. See OCC Codes and ACS Occupation Codes # OCC1990 : int person's primary occupation, coded to be comparable back to 1950. See OCC1990 Codes and 1990 Occupation Codes # CLASSWKR : int worker class: 0 = N/A, 1 = Self-employed, 2 = Works for wages a CLASSWKRD: int worker class: 00 = N/A, 13 = Self-employed, not incorporated, 14 = Self-employed, incorporated, 22 = Wage, private, 23 = Wage at non-profit, 25 = Federal government, 27 = State government, 28 = Local government, 29 = Unpaid family worker # WKSWORK2 : int number of weeks worked the previous year: 0 = N/A (or Missing), 1 = 1-13 weeks, 2 = 14-26 weeks, 3 = 27-39 weeks, 4 = 40-47 weeks, 5 = 48-49 weeks, 6 = 50-52 weeks # INCWAGE : int total pre-tax wage and salary income in current dollars for the previous year. See INCWAGE Codes c STATE : chr Two-character state abbreviation from https://www2.census.gov/geo/docs/reference/state.txt. c COUNTY : chr County name and two-character state abbreviation from https://www2.census.gov/geo/docs/reference/codes/files/national_county.txt. c METRO : chr Metropolitan area name from https://www2.census.gov/programs-surveys/metro-micro/geographies/reference-files/2017/delineation-files/list2.xls. # = selected, p = preselected, a = added automatically, c = created from lookupsThe # symbol denotes the variables that were manually selected and the p and a symbols that were automatically preselected or selected in response to the manual selections. The last three variables marked with the symbol c were created by using the STATEFIP, COUNTY, and MET2013 to look up the text representations of states, counties, and metropolitan areas in the indicated files.
When the acs90 application first starts, it defaults to the following screen:
As can be seen, the screen has four tabs (Output, Plot, Map, and Usage) and is initially set to the Output tab. It displays a table that shows the education level in 2017 of workers with the occupation "Computer software developers (229)" in all counties of California with 5000 or more workers. The format and contents of the table are determined by the default input in the left sidepanel:
Input Label Default Description -------------------------- ---------- ----------- First Year 2017 First year to include in data Last Year 2017 Last year to include in data Search STATE CA 2-character state abbreviation (blank indicates entire U.S.) COUNTY Optional pattern to match in COUNTY METRO Optional pattern to match in METRO Units Percent Count, Percent, or Percent in group (first of two selected groups) Geography COUNTY STATE, COUNTY, METRO, NATION Occupation Computer software developers (229) - OCC1990 occupation. See OCC1990 Codes and 1990 Occupation Codes Employment status Employed All, Employed, Unemployed, In labor force Group by (select up to 2) EDUCATION Up to two groups for grouping data. Each group is actually a recode for one variable with defined labels for each category within the group. Sort column 4 The column by which to sort the rows. Decimal Places 1 The number of decimals used in percent units. Ascending/Descending Descending The order in which to sort the column specified in "Sort column". Minimum Count 5000 Minimum count to include in the results. Max Hdr Width 40 Max Geo Width 40 Max Col Width 40 Maximum Total Width 240 Maximum Total Rows 900 Range field1 Minimum range1 0 Maximum range1 0 Range field2 Minimum range2 0 Maximum range2 0 Ignore URL ParametersThe inputs labelled 'Search STATE', 'COUNTY', and 'METRO' can be used to filter these fields. Hence, the CA in the 'Select STATE' input causes only metro areas in California to be displayed. The data is also filtered by the selection of 5000 in the input labelled 'Minimum Count'. This causes only those metro areas with 5,000 or more of the specified workers to be listed.
The input labelled 'Occupation' can be set to the major occupation groups shown at this link which are in the data. The data for the acs90 application is currently limited to this subset of occupations due to memory issues on the server but may be expanded if possible. The occupation select list also includes some subgroups of these major groups such as "Computer software developers (229). That is what the occupation is set to by default.
The input labelled "Employment status' can be set to filter the workers shown. It can be currently set to 'All', 'Employed', 'Unemployed', and 'In labor force'. This last selection includes all workers, including those not in the labor force.
The input labelled 'Group by' is the key way of selecting the columns for the table. In this case, the columns are obtained by grouping the workers by their education level (EDUCATION). The categories for EDUCATION are "Bachelor-", Masters", and "Phd+" to indicate workers with a Bachelor's degree or less, a Master's degree, or a Phd degree or higher. The third column with the header 'Count' will always contain the total count for the columns to the right of it.
In the default output, the fourth column and up contain percentages of the total count for each column. Hence, these numbers will add up to 100 percent, discounting any round-off error. This output of percents is set by the input labelled 'Units' which is set to 'Percent'. Setting this to 'Percent in group' will have a different affect if the 'Group by' input contains more than one selection. Selecting 'Count' for units will change the output of these columns to the actual counts. Then, their sum will add up to the total count shown in the third column.
The input labelled 'Sort column' specifies the column by which the rows should be sorted. Since it's set to 4, the rows are sorted by the fourth column. Setting it to a minus number counts the columns from the right. Hence, a setting it to -1 causes the rows to be sorted by the last column. The radio buttons labelled 'Ascending' and 'Descending' beneath this input will cause the sorting to be ascending or descending, respectively. Finally, the input labelled 'Decimal places' will set the number of decimals to use when the units is in percents. For units of 'Count', no decimal places are required.
Below the table are shown the URL parameters which can be used to obtain this page, along with the inputs. Since no parameters are listed for the short format, this page can be obtained via a URL https://econdata.shinyapps.io/acs17/. The long format lists all of the parameters and can be copied to make a record of all of the inputs, even the defaults. However, this link could likewise be obtained using these parameters. Note: The parameters may not have been updated to include all possible parameters but will likely be updated to include them soon.
Selecting the Plot tab, will display the following plot:
Currently, the "Plot Type" can be set to "Bar Plot" or "Line Graph".
Selecting the Map tab, will display the following plot:
This map corresponds to the following data shown on the Output tab:
Year COUNTY Count Bachelor. Masters Phd. 1 2017 Riverside County CA 5,659 89.8 8.7 1.5 2 2017 Los Angeles County CA 46,009 73.4 22.8 3.7 3 2017 Sacramento County CA 9,603 72.8 23.7 3.5 4 2017 San Francisco County CA 25,559 71.5 22.7 5.8 5 2017 Orange County CA 29,277 69.7 26.0 4.2 6 2017 San Diego County CA 27,569 63.9 32.6 3.5 7 2017 Contra Costa County CA 13,197 56.8 38.1 5.1 8 2017 San Mateo County CA 19,108 53.4 40.8 5.8 9 2017 Alameda County CA 39,393 53.0 39.6 7.3 10 2017 Santa Clara County CA 90,412 42.4 48.3 9.3As can be seen, Riverside County ia red since 89.8 percent of its software developers have Bachelor's degrees or less. Los Angeles, Sacramento, San Francisco, and Orange counties are orange since the percentage of their software developers with a Bachelor's degree or less is 73.4, 72.8, 71.5, and 69.7, respectively. San Diego County is yellow-green since its percentage is 63.9 percent and Contra Costa, San Mateo, and Alameda counties are green with percentages of 56.8, 53.4, and 53.0, respectively. Finally, Santa Clara County is light blue since only 42.4 of its software developers have Bachelor's degrees or less.
You can look at counties with just 1,000 or more software developers by going to the Output tab and changing "Minimum Count" to 1000. This will expand the number of counties from 10 to 25 (including (NA) for records for which no county was specified). Clicking the "Ascending" radio button will then display the following rows, sorted from the lowest to highest values of Bachelor's degrees or less:
Year COUNTY Count Bachelor. Masters Phd. 1 2017 Santa Clara County CA 90,412 42.4 48.3 9.3 2 2017 Santa Cruz County CA 2,342 46.9 33.5 19.6 3 2017 Alameda County CA 39,393 53.0 39.6 7.3 4 2017 San Mateo County CA 19,108 53.4 40.8 5.8 5 2017 Stanislaus County CA 1,075 55.0 34.8 10.2 6 2017 Contra Costa County CA 13,197 56.8 38.1 5.1 7 2017 San Joaquin County CA 2,148 58.7 38.9 2.4 8 2017 San Diego County CA 27,569 63.9 32.6 3.5 9 2017 San Bernardino County CA 4,067 65.5 23.9 10.6 10 2017 Marin County CA 1,579 67.7 32.3 0.0 11 2017 Orange County CA 29,277 69.7 26.0 4.2 12 2017 San Francisco County CA 25,559 71.5 22.7 5.8 13 2017 Santa Barbara County CA 2,208 71.9 13.9 14.2 14 2017 Sacramento County CA 9,603 72.8 23.7 3.5 15 2017 Los Angeles County CA 46,009 73.4 22.8 3.7 16 2017 Yolo County CA 1,016 73.9 26.1 0.0 17 2017 San Luis Obispo County CA 2,394 76.4 23.6 0.0 18 2017 Ventura County CA 3,917 80.9 15.7 3.4 19 2017 (NA) 2,529 82.2 15.3 2.5 20 2017 El Dorado County CA 1,205 86.4 13.6 0.0 21 2017 Fresno County CA 1,481 86.7 13.3 0.0 22 2017 Placer County CA 2,372 89.0 9.3 1.7 23 2017 Riverside County CA 5,659 89.8 8.7 1.5 24 2017 Sonoma County CA 2,689 94.1 5.9 0.0 25 2017 Kern County CA 1,011 100.0 0.0 0.0Going back to the Map tab then shows the following map:
As can be seen, more counties are now colored. Also, the range of the color legend has now changed since the percentages now range from 100.0 to 42.4 percent rather than from 89.8 to 42.4 percent as before. As a result, Riverside County is now brown instead of red and it is Kern County that is red.
One problem with the above map and table is that they are focused on the percentage of software developers who have a Bachelor's degree or less rather than a Master's degree or more. This is because the program is currently coded to focus on any one category of a currently defined grouping and the EDUCATION grouping has the categories Bachelor (and below), Masters, and Phd (and above). It would likely be clearer and more positive to focus on the latter. This was accomplished by creating a new grouping called Masters with the categories Bachelor and below and Masters and above. Then switching to the Output tab, setting "Group by" to MASTERS and changing "Sort column" to 5 (to select the Masters or above category) will output the following table:
Year COUNTY Count Bachelor.and.below Masters.and.above 1 2017 Santa Clara County CA 90,412 42.4 57.6 2 2017 Santa Cruz County CA 2,342 46.9 53.1 3 2017 Alameda County CA 39,393 53.0 47.0 4 2017 San Mateo County CA 19,108 53.4 46.6 5 2017 Stanislaus County CA 1,075 55.0 45.0 6 2017 Contra Costa County CA 13,197 56.8 43.2 7 2017 San Joaquin County CA 2,148 58.7 41.3 8 2017 San Diego County CA 27,569 63.9 36.1 9 2017 San Bernardino County CA 4,067 65.5 34.5 10 2017 Marin County CA 1,579 67.7 32.3 11 2017 Orange County CA 29,277 69.7 30.3 12 2017 San Francisco County CA 25,559 71.5 28.5 13 2017 Santa Barbara County CA 2,208 71.9 28.1 14 2017 Sacramento County CA 9,603 72.8 27.2 15 2017 Los Angeles County CA 46,009 73.4 26.6 16 2017 Yolo County CA 1,016 73.9 26.1 17 2017 San Luis Obispo County CA 2,394 76.4 23.6 18 2017 Ventura County CA 3,917 80.9 19.1 19 2017 (NA) 2,529 82.2 17.8 20 2017 El Dorado County CA 1,205 86.4 13.6 21 2017 Fresno County CA 1,481 86.7 13.3 22 2017 Placer County CA 2,372 89.0 11.0 23 2017 Riverside County CA 5,659 89.8 10.2 24 2017 Sonoma County CA 2,689 94.1 5.9 25 2017 Kern County CA 1,011 100.0 0.0Switching to the Map tab will then display the following map:
This is likely much more clear as it focuses on the percentage of software developers with a Master's degree and above. In any case, the precise appearance of the prior maps is affected by the values of the inputs following inputs in the left sidepanel:
Input Label Default Description ------------------ ---------------------------------------------- ----------- Min Pop 200000 Minimum population of cities to display. Lowering this will increase the number of cities displayed. Long Offset 0.5 This will set the approximate offset to the left of the city dot that the city name will appear (in units of longitude). Cities to Skip Oakland,Long Beach This default for California suppressed the display of Oakland and Long Beach which overwrote other city names. Cities to Show Santa Barbara,Santa Rosa,San Rafael,Santa Cruz This default for California adds additional cities which have less than the population specified by "Min Pop" Map Colors lightblue1,green,yellow,orange,red This sets the range of colors used. Include State(s) True(checked) This will cause the entire state (or states for a U.S. map) to display. If unchecked, only the specified counties are displayed.Going back to the Output tab, blanking out the "Search State" field, and returning to the Map tab will display the following map of the entire United States:
As can be seen, most of the counties are colored white. This is because they contain less than 1000 software developers as determined by the survey. Fortunately, it's also possible to look at the percentages by state. Go back to the Output tab and change "Geography" to STATE. This will display a table of 49 of the 50 states plus Washington D.C. To get the missing two states, change "Minimum Count" to 500. The following table of all 50 states plus Washing D.C. will display:
Year STATE Count Bachelor. Masters Phd. 1 2017 ME 3,541 92.3 7.7 0.0 2 2017 MS 4,546 92.3 4.9 2.8 3 2017 AK 806 91.9 8.1 0.0 4 2017 ID 5,281 90.6 9.4 0.0 5 2017 VT 2,563 89.4 10.6 0.0 6 2017 ND 1,623 88.8 11.2 0.0 7 2017 NV 6,256 87.5 9.6 3.0 8 2017 UT 27,182 85.3 12.0 2.7 9 2017 IA 14,940 85.0 12.7 2.3 10 2017 SD 1,739 82.9 17.1 0.0 11 2017 MO 24,758 82.6 16.4 1.0 12 2017 KY 14,512 82.2 17.1 0.6 13 2017 LA 7,395 81.0 16.5 2.5 14 2017 AZ 28,209 80.2 18.7 1.1 15 2017 MN 40,479 80.1 19.6 0.3 16 2017 SC 14,240 79.2 18.9 1.8 17 2017 AR 9,064 78.5 19.1 2.4 18 2017 WI 29,884 78.3 20.1 1.6 19 2017 OR 25,863 77.4 18.7 3.9 20 2017 AL 16,830 76.4 21.4 2.2 21 2017 OK 9,088 76.2 21.7 2.1 22 2017 NE 10,468 74.8 24.3 0.9 23 2017 IN 20,973 74.8 25.0 0.2 24 2017 TN 22,638 74.6 23.2 2.2 25 2017 WY 877 74.5 0.0 25.5 26 2017 CT 18,030 74.1 21.6 4.3 27 2017 RI 5,858 74.0 23.6 2.3 28 2017 PA 62,516 73.9 23.6 2.5 29 2017 OH 44,869 73.9 24.2 2.0 30 2017 NC 54,871 72.8 24.3 2.9 31 2017 FL 73,765 72.6 24.1 3.3 32 2017 KS 14,361 71.7 25.1 3.2 33 2017 NH 13,140 71.6 26.2 2.3 34 2017 WV 2,847 71.4 28.6 0.0 35 2017 GA 51,081 70.7 28.3 1.1 36 2017 HI 2,935 70.1 25.5 4.4 37 2017 IL 66,867 69.5 26.3 4.2 38 2017 CO 50,474 69.4 27.9 2.7 39 2017 TX 133,351 68.2 29.0 2.8 40 2017 NY 92,198 68.0 28.4 3.6 41 2017 MT 2,241 67.5 20.9 11.6 42 2017 MI 41,988 66.5 32.6 0.9 43 2017 MD 58,240 66.0 30.7 3.4 44 2017 NM 5,047 64.3 29.5 6.2 45 2017 WA 94,231 63.5 31.4 5.1 46 2017 MA 70,723 62.6 30.2 7.2 47 2017 VA 78,219 60.7 35.3 3.9 48 2017 CA 340,950 60.2 33.8 6.0 49 2017 NJ 70,269 57.1 39.1 3.9 50 2017 DE 3,662 56.0 36.6 7.3 51 2017 DC 6,818 49.4 47.6 2.9As can be seen Alaska (AK) and Wyoming (WY) had 806 and 877 software developers in 2017, respectively. Going back to the Map tab will display the following map of the entire United States:
As can be seen, all of the states are colored according to the percentage of its software developers who have a Bachelor's degree or less. Of course, Alaska and Wyoming would have been white if "Minimum Count" had been left set to 1,000. Still, it does seem to be more useful to look at a relatively small population, like software developers, by state when looking at the entire United States.