An ensemble approach applied to classify spam e-mails

Kuo Ching Ying, Shih Wei Lin, Zne Jung Lee*, Yen Tim Lin

*Corresponding author for this work

Research output: Contribution to journalJournal Article peer-review

21 Scopus citations

Abstract

Spam e-mails, known as unsolicited e-mail messages, have become an increasing problem for information security. The intrusion of spam e-mails persecute the users and waste the network resources. Traditionally, machine learning and statistical filtering systems are used to filter out spam e-mails. However, there is no unique method can be successfully applied to classify spam e-mails. It is necessary to apply multiple approaches to detect spam and effectively filter out the increasing volumes of spam e-mails. In this paper, an ensemble approach, based on decision tree, support vector machine and back-propagation network, is applied to classify spam e-mails. The proposed approach is based on the characteristics of the spam e-mails. The spam e-mails are categorized into 14 features and then the ensemble approach is performed to classify them. From simulation results, the proposed ensemble approach outperforms other approaches for two test datasets.

Original languageEnglish
Pages (from-to)2197-2201
Number of pages5
JournalExpert Systems with Applications
Volume37
Issue number3
DOIs
StatePublished - 15 03 2010
Externally publishedYes

Keywords

  • Back-propagation network
  • Decision tree
  • E-mail
  • Ensemble
  • Spam
  • Support vector machine

Fingerprint

Dive into the research topics of 'An ensemble approach applied to classify spam e-mails'. Together they form a unique fingerprint.

Cite this