Dimensionality Reduction Applied to Spam Filtering using Bayesian Classifiers

Authors

  • Tiago A. Almeida
  • Akebo Yamakami

DOI:

https://doi.org/10.5335/rbca.2013.1317

Keywords:

Redução de dimensionalidade, Filtragem de spams, Aprendizagem de máquina

Abstract

In recent years, e-mail spam has become an increasingly important problem with a big economic impact in society. Fortunately, there are different approaches able to automatically detect and remove most of these messages, and the best-known ones are based on Bayesian decision theory. However, the most of these probabilistic approaches have the same difficulty: the high dimensionality of the feature space. Many term selection methods have been proposed in the literature. In this paper, we revise the most popular methods used as term selection techniques with seven different versions of Naive Bayes spam filters.

Downloads

Download data is not yet available.

Published

2011-04-11

Issue

Section

Tutorial

How to Cite

[1]
2011. Dimensionality Reduction Applied to Spam Filtering using Bayesian Classifiers. Brazilian Journal of Applied Computing. 3, 1 (Apr. 2011), 16–29. DOI:https://doi.org/10.5335/rbca.2013.1317.