The Orange Telecom's Churn Dataset, which consists of cleaned customer activity data (features), along with a churn label specifying whether a customer canceled the subscription, will be used to develop our predictive model. The churn-80 and churn-20 datasets can be downloaded from the following links, respectively:
- https://bml-data.s3.amazonaws.com/churn-bigml-80.csv
- https://bml-data.s3.amazonaws.com/churn-bigml-20.csv
However, as more data is often desirable for developing ML models, let's use the larger set (that is, churn-80) for training and cross-validation purposes, and the smaller set (that is, churn-20) for final testing and model performance evaluation.
Note that the latter set is only used to evaluate ...