Data clustering based on principal curves

Moraes, Elson Claudio Correa; Ferreira, Danton Diego; Vitor, Giovani Bernardes; Barbosa, Bruno Henrique Groenner

Artigo

Data clustering based on principal curves

Autores

Moraes, Elson Claudio Correa

Ferreira, Danton Diego

Vitor, Giovani Bernardes

Barbosa, Bruno Henrique Groenner

Editor

Springer

Abstract

In this contribution we present a new method for data clustering based on principal curves. Principal curves consist of a nonlinear generalization of principal component analysis and may also be regarded as continuous versions of 1D self-organizing maps. The proposed method implements the k-segment algorithm for principal curves extraction. Then, the method divides the principal curves into two or more curves, according to the number of clusters defined by the user. Thus, the distance between the data points and the generate curves is calculated and, afterwards, the classification is performed according to the smallest distance found. The method was applied to nine databases with different dimensionality and number of classes. The results were compared with three clustering algorithms: the k-means algorithm and the 1-D and 2-D self-organizing map algorithms. Experiments show that the method is suitable for clusters with elongated and spherical shapes and achieved significantly better results in some data sets than other clustering algorithms used in this work.

Citação

MORAES, E. C. C. et al. Data clustering based on principal curves. Advances in Data Analysis and Classification, [S.l.], v. 14, p. 77-96, 2020.

URI

https://repositorio.ufla.br/handle/1/39543
https://link.springer.com/article/10.1007/s11634-019-00363-w

Coleções

DEG - Artigos publicados em periódicos
DAT - Artigos publicados em periódicos

Página do item completo

Data clustering based on principal curves

Notas

Data

Autores

Orientadores

Editores

Coorientadores

Membros de banca

Título da Revista

ISSN da Revista

Título de Volume

Editor

Faculdade, Instituto ou Escola

Departamento

Programa de Pós-Graduação

Agência de fomento

Tipo de impacto

Áreas Temáticas da Extenção

Objetivos de Desenvolvimento Sustentável

Dados abertos

Resumo

Abstract

Descrição

Área de concentração

Agência de desenvolvimento

Palavra chave

Marca

Objetivo

Procedência

Impacto da pesquisa

Resumen

Palavras-chave

ISBN

DOI

Citação

Link externo

URI

Coleções

Avaliação

Revisão

Suplementado Por

Referenciado Por