info:eu-repo/semantics/article
info:eu-repo/semantics/publishedVersion
info:eu-repo/semantics/publishedVersion
Evaluation and Comparison of Concept Based and N-Grams Based Text Clustering Using SOM
Carregando...
Notas
Data
Orientadores
Editores
Coorientadores
Membros de banca
Título da Revista
ISSN da Revista
Título de Volume
Editor
Universidade Federal de Lavras
Faculdade, Instituto ou Escola
Departamento
Programa de Pós-Graduação
Agência de fomento
Tipo de impacto
Áreas Temáticas da Extenção
Objetivos de Desenvolvimento Sustentável
Dados abertos
Resumo
Abstract
Descrição
With the great and rapidly growing number of documents available in digital form (Internet, library, CD-Rom…), the automatic classification of texts has become a significant research field and a fundamental task in document processing. This paper deals with unsupervised classification of textual documents also called text clustering using Self-Organizing Maps of Kohonen in two new situations: a conceptual representation of texts and a representation based on n-grams, instead of a representation based on words. The effects of these combinations are examined in several experiments using 4 measurements of similarity. The Reuters-21578 corpus is used for evaluation. The evaluation was done by using the F-measure and the entropy.
Área de concentração
Agência de desenvolvimento
Palavra chave
Marca
Objetivo
Procedência
Impacto da pesquisa
Resumen
Palavras-chave
ISBN
DOI
Citação
AMINE, A.; ELBERRICHI, Z.; SIMONET, M.; MALKI, M. Evaluation and Comparison of Concept Based and N-Grams Based Text Clustering Using SOM. INFOCOMP Journal of Computer Science, Lavras, v. 7, n. 1, p. 27-35, Mar. 2008.
