The Emsi compensation model provides occupational wage data with skill and certification premiums. It combines percentile wage data provided from Emsi’s LMI-based government data with observations culled from worker profiles.
The compensation model combines wage data from two distinct sources. The backbone for all Emsi occupational wage data is the Bureau of Labor Statistics’ Occupational Employment Statistics (OES) dataset. This set is updated annually, and provides percentile earnings data for occupations at the metro level throughout the United States. In cases where percentile earnings data are suppressed, Emsi first unsuppresses the data.
Worker profile data is used to supplement OES data by providing wage observations that can be tied to skills and certifications (such granularity does not exist in the OES dataset). Resumes are sourced from Emsi’s third-party provider. Over 27 million resumes from this pool contain wage information.
The OES dataset is updated annually in May, and Emsi’s third-party provider of profile data also provides updates roughly once a year. Therefore, Emsi’s compensation model is updated annually.
The compensation model is designed to provide wage estimates for occupations with certain skill or keyword distinctions. When an occupation is requested with no special skills or keywords, normal Emsi OES-based earnings are returned, and no wage data from profiles is incorporated.
When a user requests an occupation with additional skill or keyword filters (such as a welder with GMAW and FCAW welding skills), a wage curve is created using OES percentile data with profile earnings data plotted along the OES base curve as available. At least 100 profiles matching the user’s filters must be present; otherwise a message is returned notifying the user that there were not enough profiles to create a valid sample.
- Creation of Base National Wage Curves
We first create a base national wage curve for each occupation. These are the wage curves from which all user-requested wage curves will ultimately be derived.
For each occupation, OES provides national percentile estimates at the 10th, 25th, 50th, 75th, and 90th percentiles. The observations for that occupation from profiles (all with different listed skills and keywords) are laid along the OES wage curve in the correct percentile ranges.
Ideally, observations are equally distributed along the wage curve (i.e. 25% of observations fall between the 25th and 50th percentiles). In reality, there tend to be groups and gaps along the wage curve. To counteract the uneven distribution of observations along the wage curve, each observation is weighted. The records in ranges with fewer observations are weighted more heavily, and records in ranges with more observations are weighted less heavily. This method of weighting unevenly-distributed samples is known as poststratification.
- Processing User Requests
The compensation model first maps from job titles to SOCs (if necessary). A national wage curve is then built using just the profiles that incorporate the filters requested by the user (e.g. the welder with GMAW and FCAW welding skills). An overall curve for welders already exists (as outlined in the previous section), and along that curve are instances of resumes that contain GMAW and FCAW as skills. At least 100 of them must be present to form a valid sample. A wage curve is built using the wage information contained in those profiles. This curve is returned to the user.
If a more regionalized wage curve is requested, estimates are generated using OES wage ratios between the nation and the requested region. Observations are not limited by location because of sample size requirements; this allows all relevant wage observations to be used.
The ratios are built by comparing the national and regional OES wage curves for the occupation in question. That ratio is then applied to the 10th, 25th, 50th, 75th, and 90th percentiles on the national curve to create regionalized 10th, 25th, 50th, 75th, and 90th percentiles.
For regional requests involving skills or keywords, the national wage curve incorporating observations that include the skills and keywords is built as outlined above. The regional ratio is then applied to this national estimate, producing a regional estimate.
- Minimum Wage Adjustment
Given the lag in official wage data, it is possible to see results that are lower than a state’s current minimum wage. OES is a three-year rolling survey with an additional year’s lag before publishing, and the observations Emsi incorporates into the compensation model can be from as far back as 2006. Because of these lags, users may sometimes see results (usually in 10th percentile earnings) that are lower than current minimum wage.
Emsi sets minimum wage floors by state using minimum wage data from the United States Department of Labor. In order to match OES data, we use minimum wage floors that correspond to the latest OES year available. For example, the latest OES earnings year available for Emsi’s 2019.1 datarun was 2017. Therefore, during that datarun, Emsi compensation model wage data were for 2017, and 2017 minimum wage laws were used as floors for the earnings data.
- Creation of Base National Wage Curves