Please use this identifier to cite or link to this item:
http://repositorio.ufla.br/jspui/handle/1/33255
Title: | Evaluation of genome similarities: a wavelet-domain approach |
Other Titles: | Avaliação da similaridade de genomas: uma abordagem pelo domínio de ondaletas |
Authors: | Sáfadi, Thelma Silva, Alessandra Querino da Bueno Filho, Júlio Sílvio de Souza Nascimento, Moysés Lima, Renato Ribeiro de |
Keywords: | Transformada não-decimada de ondaletas Mycobacterium tuberculosis Método de variância agregada Agrupamento de genomas Non-decimated wavelet transform Method of aggregate variance Clustering of genomes |
Issue Date: | 22-Mar-2019 |
Publisher: | Universidade Federal de Lavras |
Citation: | FERREIRA, L. M. Evaluation of genome similarities: a wavelet-domain approach. 2018. 89 p. Tese (Doutorado em Estatística e Experimentação Agropecuária)-Universidade Federal de Lavras, Lavras, 2019. |
Abstract: | The wavelets arised to solve the problems when you work with non-stationary data, signals contaminated with noise, large data volume, detection of self-similarity, separation of components in a signal, among others. The technique called the “wavelet transform” corresponds to one of its main characteristics, because the data (signal, image or function) can be decomposed in the frequency domain as well as in the time domain. The low frequencies (larger scales) correspond to a global information, which generally extends over the analyzed data, while the high frequencies (reduced scales) correspond to more detailed information, which lasts a relatively short period of time. The present work was divided in the presentation of three different genome cluster analysis techniques using wavelets. These techniques were employed in ten sequences of the Mycobacterium tuberculosis genome. The first technique used to grouping the of genomes was the use of energy (variance). This energy was obtained by summing the detail coefficients by the square of each level of decomposition (five levels) of the original signal by means of the Daubechies wavelet with four null moments. As a result, the formation of 3 distinct groups was found. The second technique approached the junction of wavelets with the methodology Elastic net. In this analysis, after obtaining the levels of decomposition using wavelets, the Elastic net was applied at each level, where it was possible to verify the formation of the groups. The results showed that levels 4 and 5 were the ones that presented the best formation of the groups, being found three different groups. The third technique involved the combination of wavelets with the Hurst exponent. From the results obtained of the levels of decomposition by wavelets, using the same configurations of the first and second techniques previously described, the Hurst exponent was calculated for each level of decomposition, using five methods of estimation of the Hurst exponent. Each method presented different group formations, but the method that presented the similar results according to the two previous techniques was the method of aggregate variance. |
URI: | http://repositorio.ufla.br/jspui/handle/1/33255 |
Appears in Collections: | Estatística e Experimentação Agropecuária - Doutorado (Teses) |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
ARTIGO_Evaluation of genome similarities - a wavelet-domain approach.pdf | 4,89 MB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.