About
This project is part of the Udacity Machine Learning Engineer Nanodegree (https://www.udacity.com/course/machine-learning-engineer-nanodegree-by-google--nd009).
Data
The dataset used in this project is included as customers.csv. You can find more information on this dataset on the UCI Machine Learning Repository page.
Project Summary
Created customer segments for a wholesale distributor in Lisbon, Portugal using unsupervised learning techniques and python.
Steps involved
- applied preprocessing techniques such as feature scaling and outlier detection
- interpreted data points that have been scaled, transformed or reduced from PCA
- analyzed PCA dimensions and constructed new feature space
- clustered a set of data to find hidden patterns in the dataset
- assessed information given by cluster data and used it in a meaningful way
