YAML Metadata
Warning:
empty or missing yaml metadata in repo card
(https://huggingface.co/docs/hub/model-cards#model-card-metadata)
Clustering Algorithms for Customer Segmentation
This repository contains a comprehensive implementation of various clustering algorithms to perform customer segmentation on a synthetic dataset. The project explores K-Means, Hierarchical Clustering, DBSCAN, and Gaussian Mixture Models (GMM) to identify distinct customer groups based on age and income.
Project Structure
implementation.ipynb: The main Jupyter notebook containing the entire analysis, from data generation to model evaluation and visualization.data/: Contains the syntheticcustomer_data.csvfile.models/: Stores the trained clustering models and the data scaler.results/: Includes the algorithm comparison, detailed analysis, and experiment summary.visualizations/: Contains the output plots, such as the elbow method analysis and cluster comparisons.
Features
- Data Generation: A synthetic customer dataset is generated with clear cluster structures for effective model training and evaluation.
- Multiple Algorithms: Implements and compares four popular clustering algorithms:
- K-Means
- Hierarchical Clustering
- DBSCAN
- Gaussian Mixture Models (GMM)
- Model Evaluation: Uses the elbow method and silhouette scores to determine the optimal number of clusters and evaluate performance.
- Comprehensive Visualization: Generates plots to visualize the clusters, compare algorithm performance, and analyze the optimal 'k'.
How to Use
- Clone the repository:
git clone https://github.com/GruheshKurra/ClusteringAlgorithms.git - Install dependencies:
pip install -r requirements.txt - Run the notebook:
Open and run the
implementation.ipynbnotebook in a Jupyter environment to see the full analysis.
License
This project is licensed under the MIT License.
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support