Please use this identifier to cite or link to this item: http://repositorio.ufla.br/jspui/handle/1/10285
Title: Caracterização de sítios polimórficos e sequências repetitivas, e estabelecimento de coleção nuclear de caiaué [Elaeis oleifera (Kunth) Cortés]
Authors: Souza Júnior, Manoel Teixeira
Formighieri, Eduardo Fernandes
Alves, Alexandre Alonso
Laviola, Bruno Galveas
Keywords: Palma de óleo
Palm oil
Biologia computacional
Bioinformatics
Genotipagem por sequenciamento
Genotyping-by-sequencing (GBS)
Conservação genética
Genetic conservation
Issue Date: 26-Aug-2015
Citation: FERREIRA FILHO, J. A. Caracterização de sítios polimórficos e sequências repetitivas, e estabelecimento de coleção nuclear de caiaué [Elaeis oleifera (Kunth) Cortés]. 2015. 112 p. Dissertação (Mestrado em Biotecnologia Vegetal) - Universidade Federal de Lavras, Lavras, 2015.
Abstract: The objectives of this study were to characterize polymorphic sites and repetitions and establish a core collection for American oil palm (Elaeis oleifera). The genome draft used in this study had a 130X coverage by Illumina Hiseq 2000 and was compared with the publicly available draft of E. oleifera, as well as with the also publicly available genome of E. guineensis, through Nucmer software. In silico search was made to identify regions of tandem repeats and transposable elements in this genome draft. A bank of sequences, generated by DArTSeq platform for genotypes of E. oleifera, was mapped against the public E. guineensis genome using BWA software. The SAMtools software package was used to identify SNPs. The gene models of date palm (Phoenix dactylifera) were mapped on the genome of American oil palm. For the design of core collections, we used the strategy of maximizing the diversity (M) with 500 loci SNPs markers based on genotyping by sequencing. 68.24 and 72.83% of the draft analyzed was aligned against the E. oleifera genomes and E. guineensis, respectively. A total of 328,879 and 618,284 of tandem repeats and transposable elements loci were identified, respectively. It was possible to characterize 17,412/2,370 PAVs/SNPs, and 25,203 gene models, with single position in the genome. Core collections models were obtained with 37, 55, 109, 127, 138, 276, 26, and 16 individuals. As a result of the optimal adjustment of the validated parameters maintained while taking the least number of accessions, the model of 109 individuals (20% of entire collection) was chosen as the ideal to establish the core collection of E. oleifera. The draft of E. oleifera generated by Embrapa sampled much of the genomes to which it was compared, representing much of this highly complex genome with an affordable cost of sequencing technology. More than half (55%) of the draft consists of repetitions, especially retrotransposons. The identification of these regions rich on repetitive sequences will contribute to adjustments in the strategy to generate to further sequence this genome. The set of PAVs/SNPs mapped markers provide a substantially uniform coverage throughout the genome and gene regions of E. guineensis. The core collection model generated in this study will allow an improvement of the strategy to more efficiently conserve the germoplasm of American oil palm.
URI: http://repositorio.ufla.br/jspui/handle/1/10285
Appears in Collections:Biotecnologia Vegetal - Mestrado (Dissertações)

Files in This Item:
File Description SizeFormat 
DISSERTACAO_Caracterização de sítios polimórficos e sequências repetitivas,.pdf1,58 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.