Classifies SMS/email messages as spam or legitimate using Machine Learning and NLP techniques.
- Accuracy: 85%
- Dataset: Kaggle SMS Spam Collection (5,574 messages)
Python | scikit-learn | NLP | pandas | Matplotlib
- Raw text messages are cleaned and preprocessed
- TF-IDF vectorization converts text to numerical features
- Trained classifier predicts spam vs ham
- Confusion matrix shows model performance
pip install -r requirements.txt python spam_detection.py
Input: "Congratulations! You've won a free travel voucher to Paris. Click here to claim." Output: Is SPAM