What's My Premium is a health insurance premium prediction model that uses machine learning to predict the cost of health insurance based on a number of factors. This code compares the cross value score and train/test accuracies of different ML models--Linear Regression, Support Vector Regression, Random Forest, Gradient Boost, XGBoost--to identify the best model for prediction. The models are trained on a Kaggle dataset of 1338 entries.
Dataset can be found here. Check out the original setup inspiration here.
11/10/2023: currently under development
- exploratory data analysis (EDA) with dataset
- data preprocessing
- train and validate model for best random state number
- linear regression model
- support vector regression model
- random forest model
- gradient boosting model
- XGBoost model
- integrate with the Federal Marketplace API
- create web app front
- deploy to webpage
This project is licensed under the MIT License.