Please use this identifier to cite or link to this item: http://repositorio.ufla.br/jspui/handle/1/36752
Full metadata record
DC FieldValueLanguage
dc.creatorNascimento, Moysés-
dc.creatorSilva, Fabyano Fonseca e-
dc.creatorSáfadi, Thelma-
dc.creatorNascimento, Ana Carolina Campana-
dc.creatorFerreira, Talles Eduardo Maciel-
dc.creatorBarroso, Laís Mayara Azevedo-
dc.creatorAzevedo, Camila Ferreira-
dc.creatorGuimarães, Simone Eliza Faccione-
dc.creatorSerão, Nick Vergara Lopes-
dc.date.accessioned2019-09-09T19:11:19Z-
dc.date.available2019-09-09T19:11:19Z-
dc.date.issued2017-07-17-
dc.identifier.citationNASCIMENTO, M. et al. Independent Component Analysis (ICA) based-clustering of temporal RNA-seq data. PLoS One, [S.l.], v. 12, n. 7, 2017. DOI: 10.1371/journal.pone.0181195.pt_BR
dc.identifier.urihttp://repositorio.ufla.br/jspui/handle/1/36752-
dc.description.abstractGene expression time series (GETS) analysis aims to characterize sets of genes according to their longitudinal patterns of expression. Due to the large number of genes evaluated in GETS analysis, an useful strategy to summarize biological functional processes and regulatory mechanisms is through clustering of genes that present similar expression pattern over time. Traditional cluster methods usually ignore the challenges in GETS, such as the lack of data normality and small number of temporal observations. Independent Component Analysis (ICA) is a statistical procedure that uses a transformation to convert raw time series data into sets of values of independent variables, which can be used for cluster analysis to identify sets of genes with similar temporal expression patterns. ICA allows clustering small series of distribution-free data while accounting for the dependence between subsequent time-points. Using temporal simulated and real (four libraries of two pig breeds at 21, 40, 70 and 90 days of gestation) RNA-seq data set we present a methodology (ICAclust) that jointly considers independent components analysis (ICA) and a hierarchical method for clustering GETS. We compare ICAclust results with those obtained for K-means clustering. ICAclust presented, on average, an absolute gain of 5.15% over the best K-means scenario. Considering the worst scenario for K-means, the gain was of 84.85%, when compared with the best ICAclust result. For the real data set, genes were grouped into six distinct clusters with 89, 51, 153, 67, 40, and 58 genes each, respectively. In general, it can be observed that the 6 clusters presented very distinct expression patterns. Overall, the proposed two-step clustering method (ICAclust) performed well compared to K-means, a traditional method used for cluster analysis of temporal gene expression data. In ICAclust, genes with similar expression pattern over time were clustered together.pt_BR
dc.languageen_USpt_BR
dc.publisherPLOSpt_BR
dc.rightsacesso abertopt_BR
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/*
dc.sourcePLoS Onept_BR
dc.subjectGene expressionpt_BR
dc.subjectSimulationpt_BR
dc.subjectModelingpt_BR
dc.subjectClustering algorithmspt_BR
dc.subjectStatistical datapt_BR
dc.subjectRNA sequencingpt_BR
dc.subjectPrincipal component analysispt_BR
dc.subjectRNA synthesispt_BR
dc.subjectSwinept_BR
dc.subjectIndependent component analysispt_BR
dc.titleIndependent Component Analysis (ICA) based-clustering of temporal RNA-seq datapt_BR
dc.typeArtigopt_BR
Appears in Collections:DES - Artigos publicados em periódicos

Files in This Item:
File Description SizeFormat 
ARTIGO_Independent Component Analysis (ICA) based-clustering of temporal RNA-seq data.pdf3,07 MBAdobe PDFView/Open


This item is licensed under a Creative Commons License Creative Commons