SMS Spam Detection App 📱

A machine learning-powered web application that classifies SMS messages as spam or not using NLP techniques and the Multinomial Naive Bayes algorithm. This project includes full model training, evaluation, and a user-friendly Streamlit interface.

📂 Dataset

Source: Kaggle - UCI SMS Spam Collection Dataset
Description: A set of SMS labeled messages as spam or not.

⚙️ Features

Data cleaning and preprocessing
Exploratory Data Analysis (EDA)
Text tokenization using NLTK
Vectorization using TF-IDF
Model comparison using multiple classifiers
Final model: Multinomial Naive Bayes
Evaluation metrics: Accuracy, Precision, Confusion Matrix
Streamlit web app for user interaction

🚀 How to Run

Clone the repo

git clone https://github.com/Mozeel-V/spam-detection.git
cd spam-detection

Create a Conda Environment(Optional)

conda create -n spamguard
conda activate spamguard

Install dependencies
```
pip install -r requirements.txt
```
Run the app
```
streamlit run app.py
```

Preview of the app can be accessed from here

📁 Project Structure

📦 spam-detection/
├── app.py                  # Streamlit app
├── model.pkl               # Trained Naive Bayes model
├── vectorizer.pkl          # TF-IDF vectorizer
├── spam.csv                # Original dataset
├── spam_utf8.csv           # UTF-8 converted dataset
├── spam-detection.ipynb    # Training and EDA notebook
├── requirements.txt        # Python dependencies
├── LICENSE                 # MIT open-source license
└── README.md               # Contains basic info about the project

🧠 Model Insights

The dataset was vectorized using TF-IDF to capture term importance.
Multiple classifiers were tested (e.g. Logistic Regression, SVM).
Multinomial Naive Bayes gave the best results on precision and accuracy.
The model was saved as model.pkl and used directly in the app.

🛠 Tech Stack

Python, Pandas, Scikit-learn, NLTK
TF-IDF Vectorizer
Streamlit (for frontend)

📄 License

This project is licensed under the MIT License.

🤝 Contributions

Feel free to fork, raise issues, or submit PRs to improve this project!

📝 Author

Mozeel Vanwani | IIT Kharagpur CSE

Email: [[email protected]]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SMS Spam Detection App 📱

📂 Dataset

⚙️ Features

🚀 How to Run

📁 Project Structure

🧠 Model Insights

🛠 Tech Stack

📄 License

🤝 Contributions

📝 Author

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.devcontainer		.devcontainer
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
model.pkl		model.pkl
requirements.txt		requirements.txt
spam-detection.ipynb		spam-detection.ipynb
spam.csv		spam.csv
spam_utf8.csv		spam_utf8.csv
vectorizer.pkl		vectorizer.pkl

License

Mozeel-V/spam-detection

Folders and files

Latest commit

History

Repository files navigation

SMS Spam Detection App 📱

📂 Dataset

⚙️ Features

🚀 How to Run

📁 Project Structure

🧠 Model Insights

🛠 Tech Stack

📄 License

🤝 Contributions

📝 Author

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages