CIS 9660 - Group 2 Project Proposal
2026-03-16
CIS 9660 - Data Mining
Proposal Due: March 20, 2026
Presentation: March 16, 2026
The Big Question
Can we predict whether a telecom customer will churn based on their contract type, monthly charges, and service usage?
| Dimension | Focus |
|---|---|
| ๐ Domain | E-commerce & Retail - Telecom Customer Retention |
| ๐ฏ Task Type | Binary Classification (Churn: Yes / No) |
| ๐งช Primary Method | Logistic Regression with Interaction Terms |
๐ kaggle.com/datasets/blastchar/telco-customer-churn
| Property | Value |
|---|---|
| Observations | 7,043 customers |
| Variables | 21 features |
| Target | Churn (Yes / No) |
Demographics
Gender, Senior Citizen, Partner, Dependents
Services
Phone, Internet, Streaming, Security
Account
Contract, Tenure, Monthly/Total Charges, Payment Method
Churn - Binary: Yes / No
Contract - Month-to-month, One year, Two yearMonthlyCharges - Continuous (USD)InternetService - DSL, Fiber Optic, Nonetenure - Months with companyTotalChargesPaymentMethodPaperlessBillingOnlineSecurity, TechSupportStreamingTV, StreamingMoviesCan we predict customer churn from
contract type and monthly charges?
Build a model that accurately classifies customers as likely to churn or stay
Does the interaction between InternetService ร tenure significantly affect churn probability?
๐ฆ Boxplots
Compare MonthlyCharges and tenure distributions across churn classes
๐ต Scatterplots
Explore relationships between continuous variables (MonthlyCharges, tenure, TotalCharges) and churn
๐ Bar Charts
Churn rates segmented by ContractType, InternetService, and PaymentMethod
Tools:
pandas,matplotlib,seaborn
Baseline model - ContractType + MonthlyCharges + InternetService + tenure
Interaction term - ContractType ร InternetService to test synergy effects on churn probability
Model evaluation - Confusion matrix, Accuracy, Precision, Recall, F1-Score, ROC-AUC
Cross-validation - k-fold CV to assess generalizability
Tools:
scikit-learn,statsmodels
| Metric | Target |
|---|---|
| Accuracy | โฅ 80% |
| ROC-AUC | โฅ 0.80 |
| Recall (Churn=Yes) | โฅ 75% |
Recall prioritized - cost of missing a churner is high
Send personalized offers - e.g., upgrade a month-to-month user to a discounted annual contract
Restructure pricing tiers or offer loyalty discounts to high-charge, high-risk customers
Retaining customers longer directly increases Customer Lifetime Value
High churn among fiber optic users may signal service quality issues to investigate
| Milestone | Date | Deliverable |
|---|---|---|
| ๐ฃ๏ธ In-Class Presentation | March 16, 2026 | Slides (email by March 15) |
| ๐ Project Proposal | March 20, 2026 | 1โ2 page writeup (Brightspace) |
| ๐ป Final Presentation | May 11, 2026 | 10-min talk + Q&A |
| ๐ Final Report | May 15, 2026 | 4โ5 page report + Python code + data |
| โญ Peer Evaluation | May 15, 2026 | 0โ5 scale per teammate |
Dataset: Telco Customer Churn (Kaggle)
7,043 obs ร 21 variables
Question: Can contract type & monthly charges predict churn?
Method: Logistic Regression + Interaction Terms
ContractType ร InternetService
Goal: Help marketing teams proactively retain at-risk customers
CIS 9660 ยท Group 2 Project ยท Spring 2026