[go: nahoru, domu]

Skip to content

tristaneljed/Email-Spam-Detector

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Email Spam Detector

An implementation with Python and Scikit-learn

Spam detection is a beginner’s example of document classification task which involves classifying an email as spam or non-spam (a.k.a. ham) mail.

Steps

  1. Preparing the text data.
  2. Creating word dictionary.
  3. Feature extraction process.
  4. Training the classifier.
  5. Running predictions.

Dataset

In this turotial, I am extracting data from the publicly available mail corpus Ling-spam corpus.

Instalation

  • Unzip the data folders in the same location.
  • Run the code and enjoy!