This demo involves predicting customer churn for a telecommunications company.
- Using Jupyter Notebooks to create and score a model
- Use DSX specific features like asset sharing, collaboration
- Save and retrieve models using Cloud Object Storage
- Deploy the final model in a Watson Machine Learning repository
- Access deployed model through an API call to WML
- Deploy a web app on Bluemix that uses the accessed model
- Python, through Jupyter Notebooks
- Spark, to allow for scaleable modelling through SparkML
- Data Visualization through IBM Pixiedust
- Random Forests Classifier, trained using SparkML pipelines
- Response variable is Customer Churn
- Features include age, income, gender, mobile data usage, payment method, etc.
Notebooks
Predict Customer Churn.ipynb- trains the initial modelPredict Churn - Score New Data.ipynb- performs testing on unseen dataPredict Churn - Deploy to WML.ipynb- interfaces with Watson Machine Learning to deploy the trained and scored model
Data Assets
churn.csv- churn, by customer IDcustomer.csv- customer information, including IDcustomer_churn.csv- a combined data table with customer information and churnnew_customer_churn_data.csv- unseen data, with only customer information, but no churn value