H-1b Visa Prediction Models
Buisiness question: Foreign worker with specialized knowledge, what would the annual wage look like?
Predict annual wage
The list of columns:
First model: Linear Regression model with feature PREVAILING_WAGE_RATE_ANNUAL
improvement: using gradient boosting
with 1 feature: PREVAILING_WAGE_RATE_ANNUAL
From this above graph, we see the increasing of prevailing wage generally increases annual wage
with 2 features: PREVAILING_WAGE_RATE_ANNUAL, SOC_CODE
(SOC_CODE: Occupational code associated with the job being requested for certification, as classified by the Standard Occupational Classification (SOC) System.)
for example: the occupation for code 17-2061 is Computer Hardware Engineers
Explain individual predictions with shapley value plots
NAICS_CODE: Industry code associated with the employer requesting permanent labor condition, as classified by the North American Industrial Classification System (NAICS).
Here is an example graph that help us understand how the algorithm made it’s classification.
Predict prevailing wage level
Buisiness question: Foreign workers with level I salary have lower chance of drawing the lottery, what are the prevailing wage level for each filing?
Feature Importance for Random Forest:
Feature Importance for XGBoost: