ML- SMS Spam/Ham Clustering

A Machine Learning classic starter project using Python libraries to cluster a data set of 'sms' messages into 'spam' and 'ham' using k-means.

The dataset is a collection of 5,574 SMS meesages taken from UCI Machine Learning repository, need to be tagged as "spam" and "ham".

The whole pipeline conists of the following steps:

Although there are multiple methods for solving the problem, tfidf approach is employed here to obtain high prediction accuracy.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
dataset		dataset
results		results
README.md		README.md
spam_detector.py		spam_detector.py

Provide feedback