Customer Churn Prediction Analysis

CIS 9660 - Group 2 Project Proposal

2026-03-16

Team Members

Our Team

  • Harrison Cabe
  • Raรบl J. Solรก Navarro
  • Samuel Spitzer
  • Victor Murra Schott

Course

CIS 9660 - Data Mining

Proposal Due: March 20, 2026
Presentation: March 16, 2026

Project Overview

The Big Question

Can we predict whether a telecom customer will churn based on their contract type, monthly charges, and service usage?


Dimension Focus
๐Ÿ“Š Domain E-commerce & Retail - Telecom Customer Retention
๐ŸŽฏ Task Type Binary Classification (Churn: Yes / No)
๐Ÿงช Primary Method Logistic Regression with Interaction Terms

Dataset

Kaggle: Telco Customer Churn

๐Ÿ”— kaggle.com/datasets/blastchar/telco-customer-churn

Property Value
Observations 7,043 customers
Variables 21 features
Target Churn (Yes / No)

Feature Groups

Demographics
Gender, Senior Citizen, Partner, Dependents

Services
Phone, Internet, Streaming, Security

Account
Contract, Tenure, Monthly/Total Charges, Payment Method

Key Variables

๐ŸŽฏ Target

Churn - Binary: Yes / No

๐Ÿ“Œ Primary Predictors

  • Contract - Month-to-month, One year, Two year
  • MonthlyCharges - Continuous (USD)
  • InternetService - DSL, Fiber Optic, None
  • tenure - Months with company

๐Ÿ“Ž Additional Features

  • TotalCharges
  • PaymentMethod
  • PaperlessBilling
  • OnlineSecurity, TechSupport
  • StreamingTV, StreamingMovies

Research Question


Can we predict customer churn from
contract type and monthly charges?


๐Ÿ”ฎ Predictive

Build a model that accurately classifies customers as likely to churn or stay

๐Ÿ”ฌ Inferential

Does the interaction between InternetService ร— tenure significantly affect churn probability?

Methods: EDA

Step 1 - Exploratory Data Analysis

๐Ÿ“ฆ Boxplots
Compare MonthlyCharges and tenure distributions across churn classes

๐Ÿ”ต Scatterplots
Explore relationships between continuous variables (MonthlyCharges, tenure, TotalCharges) and churn

๐Ÿ“Š Bar Charts
Churn rates segmented by ContractType, InternetService, and PaymentMethod


Tools: pandas, matplotlib, seaborn

Methods: Logistic Regression

Step 2 - Binary Logistic Regression

  1. Baseline model - ContractType + MonthlyCharges + InternetService + tenure

  2. Interaction term - ContractType ร— InternetService to test synergy effects on churn probability

  3. Model evaluation - Confusion matrix, Accuracy, Precision, Recall, F1-Score, ROC-AUC

  4. Cross-validation - k-fold CV to assess generalizability


Tools: scikit-learn, statsmodels

Expected Findings

๐Ÿ“ˆ Hypotheses

  • Month-to-month contract customers will have significantly higher churn rates
  • Higher MonthlyCharges correlates with increased churn risk
  • Fiber optic + short tenure customers may show elevated churn (interaction effect)

๐Ÿ“ Model Performance Targets

Metric Target
Accuracy โ‰ฅ 80%
ROC-AUC โ‰ฅ 0.80
Recall (Churn=Yes) โ‰ฅ 75%

Recall prioritized - cost of missing a churner is high

Practical Implications


๐ŸŽฏ Targeted Retention

Send personalized offers - e.g., upgrade a month-to-month user to a discounted annual contract

๐Ÿ’ฐ Pricing Strategy

Restructure pricing tiers or offer loyalty discounts to high-charge, high-risk customers

๐Ÿ“ˆ CLV Optimization

Retaining customers longer directly increases Customer Lifetime Value

๐Ÿ”ง Service Improvement

High churn among fiber optic users may signal service quality issues to investigate

Project Timeline

Milestone Date Deliverable
๐Ÿ—ฃ๏ธ In-Class Presentation March 16, 2026 Slides (email by March 15)
๐Ÿ“‹ Project Proposal March 20, 2026 1โ€“2 page writeup (Brightspace)
๐Ÿ’ป Final Presentation May 11, 2026 10-min talk + Q&A
๐Ÿ“„ Final Report May 15, 2026 4โ€“5 page report + Python code + data
โญ Peer Evaluation May 15, 2026 0โ€“5 scale per teammate

Summary


Dataset: Telco Customer Churn (Kaggle)
7,043 obs ร— 21 variables

Question: Can contract type & monthly charges predict churn?

Method: Logistic Regression + Interaction Terms
ContractType ร— InternetService

Goal: Help marketing teams proactively retain at-risk customers