Evaluation and Comparison of Concept Based and N-Grams Based Text Clustering Using SOM
| dc.creator | Amine, Abdelmalek | |
| dc.creator | Elberrichi, Zakaria | |
| dc.creator | Simonet, Michel | |
| dc.creator | Malki, Mimoun | |
| dc.date | 2008-03-01 | |
| dc.date.accessioned | 2017-08-01T21:08:39Z | |
| dc.date.available | 2017-08-01T21:08:39Z | |
| dc.date.issued | 2017-08-01 | |
| dc.description | With the great and rapidly growing number of documents available in digital form (Internet, library, CD-Rom…), the automatic classification of texts has become a significant research field and a fundamental task in document processing. This paper deals with unsupervised classification of textual documents also called text clustering using Self-Organizing Maps of Kohonen in two new situations: a conceptual representation of texts and a representation based on n-grams, instead of a representation based on words. The effects of these combinations are examined in several experiments using 4 measurements of similarity. The Reuters-21578 corpus is used for evaluation. The evaluation was done by using the F-measure and the entropy. | |
| dc.format | application/pdf | |
| dc.identifier | http://www.dcc.ufla.br/infocomp/index.php/INFOCOMP/article/view/203 | |
| dc.identifier.citation | AMINE, A.; ELBERRICHI, Z.; SIMONET, M.; MALKI, M. Evaluation and Comparison of Concept Based and N-Grams Based Text Clustering Using SOM. INFOCOMP Journal of Computer Science, Lavras, v. 7, n. 1, p. 27-35, Mar. 2008. | |
| dc.identifier.uri | https://repositorio.ufla.br/handle/1/14967 | |
| dc.publisher | Universidade Federal de Lavras | |
| dc.relation | http://www.dcc.ufla.br/infocomp/index.php/INFOCOMP/article/view/203/188 | |
| dc.source | INFOCOMP; Vol 7 No 1 (2008): March, 2008; 27-35 | |
| dc.source | 1982-3363 | |
| dc.source | 1807-4545 | |
| dc.subject | Text clustering | |
| dc.subject | Self-Organizing Maps of Kohonen | |
| dc.subject | N-grams | |
| dc.subject | Concept | |
| dc.subject | Similarity | |
| dc.subject | Reuters21578 | |
| dc.title | Evaluation and Comparison of Concept Based and N-Grams Based Text Clustering Using SOM | |
| dc.type | info:eu-repo/semantics/article | |
| dc.type | info:eu-repo/semantics/publishedVersion |
