Path: blob/master/Data Science using Python/5 Employee_Appraisal_Regression_Analysis.ipynb
3371 views
Employee Appraisal Regression Analysis
This notebook explores a dataset with employee appraisal data and builds a Linear Regression model.
Features used:
Rating
Behavior
Experience
Band (encoded as Band_Num)
Target:
Appraisal
Dataset Overview
Encode Categorical Feature (Band)
Visuals
Data Preparation
Train Linear Regression Model
Linear equation for Salary Prediction
Appraisal = 10.2(rating) + 5(Exp) +1.5(be)+3(B_num) + 0.081
K-Fold Cross-Validation
K-Fold Cross Validation is a technique used to evaluate the performance of a machine learning model by splitting the dataset into K equal parts (folds):
The model is trained on K-1 folds and validated on the remaining fold.
This process is repeated K times, each time using a different fold as the validation set.
The final performance is the average of all K validation scores.
Why use it?
It helps reduce overfitting.
Provides a more robust estimate of model performance.
Ensures that every data point is used for both training and validation.
Insights & Conclusion
Experience and Rating positively impact Appraisal.
Appraisal is somewhat influenced by Band.
Linear Regression performs reasonably well.
K-Fold helps validate the model across different subsets.
Quick Practice:
Check Model Performance taling two paramters, Three parameters,
Vary test and train Split
Name your Model as Appraisalsys