Please use this identifier to cite or link to this item:
http://repositorio.ufla.br/jspui/handle/1/59309
Title: | Transformada de wavelet discreta não decimada para o agrupamento de genomas de vírus das famílias coronaviridae e paramyxoviridae |
Other Titles: | Undecimated discrete wavelet transform for the clustering of virus genomes of the coronaviridae and paramyxoviridae families |
Authors: | Sáfadi, Thelma Ferreira, Leila Maria Guimarães, Paulo Henrique Sales Alencar, Airlane Pereira Herval , Ana Paula Festucci de |
Keywords: | Transformada de Wavelet Genomas virais Agrupamento de dados Coronaviridae Paramyxoviridae Vírus Análise de dados genômicos Viroses respiratórias Wavelet Transform Viral Genomes Data Clustering Viruses Genomic data analysis |
Issue Date: | 31-Aug-2023 |
Publisher: | Universidade Federal de Lavras |
Citation: | ERNESTO, Dulcídia Carlos Guezimane. Transformada de wavelet discreta não decimada para o agrupamento de genomas de vírus das famílias coronaviridae e paramyxoviridae. 2024. 111p. Tese (Doutorado em Estatística e Experimentação Agropecuária) - Universidade Federal de Lavras, 2023. |
Abstract: | This work aimed to implement two forms of analysis of sequence similarities of two virus families, under the wavelet domain. Wavelets are commonly used when working with an extensive non-stationary database. The wavelet transform technique works with data in real time, and allows the time series to be decomposed into levels, thus allowing at each level of decomposition to increase the level of detail in the series, and thus observe details omissions, which cannot be observed in the original series. After decomposing the GC content of each of the sequences under study, two different forms of grouping were implemented in order to verify sequences with some level of similarity. Cluster analysis was carried out using penalized regression in the domain of lasso, ridge and elastic net penalties, and on the other hand, we also used the Hurst exponent implemented through 5 different techniques, namely: peng method, R analysis /S, aggregate variance, differentiated aggregate variance and by the method of absolute moments. At the end of the study, it can be concluded that weaker variants of the Coronaviridae family are associated with strains of the Paramyxoviridae family. And on the other hand, the elastic net (from the 1st to the 3rd level), the absolute moments method and the differentiated aggregate variance method performed better in relation to the other methodologies. |
URI: | http://repositorio.ufla.br/jspui/handle/1/59309 |
Appears in Collections: | Estatística e Experimentação Agropecuária - Doutorado (Teses) |
Files in This Item:
This item is licensed under a Creative Commons License