Modelos evolutivos baseados em grânulos e nuvens de dados para classificação online de spam

Pouças, Ricardo de Paula

Use este identificador para citar ou linkar para este item: http://repositorio.ufla.br/jspui/handle/1/39264

Título:	Modelos evolutivos baseados em grânulos e nuvens de dados para classificação online de spam
Autores:	Leite, Daniel Furtado Gouvêa Junior, Maury Meirelles Rodríguez, Demóstenes Zegarra
Palavras-chave:	Detecção de spam Sistemas inteligentes evolutivos Sistemas Fuzzy Agrupamento incremental Nuvem de dados Spam detection Evolving intelligent methods Fuzzy systems Incremental clustering Data clouds
Data do documento:	11-Fev-2020
Editor:	Universidade Federal de Lavras
Citação:	POUÇAS, R. de P. Modelos evolutivos baseados em grânulos e nuvens de dados para classificação online de spam. 2020. 101 p. Dissertação (Mestrado em Engenharia de Sistemas e Automação)-Universidade Federal de Lavras, Lavras, 2017.
Resumo:	Sending and receiving e-mails has become a concern since people use such tool to disseminate malicious code aiming to damage a computer system or steal information. The act of sending a message without user permission is called spam. There exist several techniques to disseminate spams. They are based on the content of the message or in some weakness of the classification system, which intercepts messages. Classification systems able to self-adapt over time are rare. Adaptation is needed because spams vary over time as consequence of the application of several message-masking techniques. Moreover, classification models that handle large volumes of data using low computational resource are interesting. Evolving Intelligent Systems are able to adapt their parameters and structure in view of the changes in a stream of data extracted from e-mails. This work uses TEDA (Typicality and Eccentricity based Data Analytics) and FBeM (Fuzzy Set-Based Evolving Modeling) for online unsupervised classification of spams. TEDA is based on the concepts of data clouds, eccentricity and typicality. The idea is that TEDA clouds do not have a specific geometric shape such as conventional clusters. FBeM uses fuzzy granular objects to summarize information extracted from a data stream. FBeM is based on the concept of coverage (granulation) of the data space. Its rules are linguistically interpretable; they are useful to help decision making. TEDA and FBeM are compared in the sense of classification error, processing speed and parsimony. For dimensionality reduction, ACO (Ant Colony Optimization) is employed. ACO is inspired on intelligent behavior of ants. The feature selection problem is represented as a graph, where the optimum path minimizes an objective function and suggests the most discriminate features for spam classification. A dataset containing 25745 samples, being 7830 spams and 17915 legitimate e-mails, was created. 711 features extracted from an e-mail server describe each sample.
URI:	http://repositorio.ufla.br/jspui/handle/1/39264
Aparece nas coleções:	Engenharia de Sistemas e automação (Dissertações)

Arquivos associados a este item:

Arquivo	Descrição	Tamanho	Formato
DISSERTAÇÃO_Modelos evolutivos baseados em grânulos e nuvens de dados para classificação online de spam.pdf		2,25 MB	Adobe PDF	Visualizar/Abrir

Mostrar registro completo do item Recomendar este item Visualizar estatísticas