A apresentação está carregando. Por favor, espere

A apresentação está carregando. Por favor, espere

Identification of new expressed sequence tags covering putative splicing regions in Drosophila melanogaster. Marco Aurélio Valtas Cunha 1, Valeria Valente.

Apresentações semelhantes


Apresentação em tema: "Identification of new expressed sequence tags covering putative splicing regions in Drosophila melanogaster. Marco Aurélio Valtas Cunha 1, Valeria Valente."— Transcrição da apresentação:

1 Identification of new expressed sequence tags covering putative splicing regions in Drosophila melanogaster. Marco Aurélio Valtas Cunha 1, Valeria Valente 1, Josane de Freitas Sousa 1, Nadia Monesi 2, Rafaela Martins Maia 1, Daniela Dover de Araujo 1, Wilson Araujo da Silva Jr 3, Marco Antonio Zago 3, Waleska K. Martins 4, Luis F. L. Reis 4, Emmanuel Dias Neto 4, Sandro José de Souza 4, Andrew John George Simpson 4, Ricardo Guelerman Pinheiro Ramos 1, Enilza Maria Espreafico 1, Maria Luisa Paçó-Larson 1 1 Departamento de Biologia Celular, Molecular e de Bioagentes Patogênicos, Faculdade de Medicina de Ribeirão Preto, Universidade de São Paulo, Ribeirão Preto 14049-900, SP, Brazil, 2 Departamento de Análises Clínicas, Toxicológicas e Bromológicas, Faculdade de Ciências Farmacêuticas de Ribeirão, Universidade de São Paulo, CEP 14040-903, Ribeirão Preto, Brasil, 3 Departamento de Clínica Médica, Faculdade de Medicina de Ribeirão Preto, Universidade de São Paulo, Ribeirão Preto 3900 14049-900, SP, Brazil, 4 Ludwig Institute for Cancer Research, Rua Professor Antonio Prudente, 109, São Paulo 01509-010, SP, Brazil. ORESTES Genomic Scaffolds Genomic Scaffolds split against genomic Full-length cDNAs Full-length cDNAs do not match full-lenght cDNA do not match full-lenght cDNA split against cDNA split against cDNA dbEST new isoform of characterized cDNA new isoform of characterized cDNA newly validated newly validated new isoform of non-characterized genes new isoform of non-characterized genes Figure 1. The ORESTES were used as queries in BLASTs against the genomic scaffold sequences, full-length cDNAs and dbEST databases. This analysis resulted in the identification of 88 (non-redundant) new expressed sequence tags covering putative splicing regions: 2 corresponding to new isoforms of characterized genes, 7 corresponding to new isoforms of predicted but non-characterized genes and 79 validating the gene region for the first time. Representative diagrams of sequence alignments between expressed and genomic sequences flanking novel splicing sites identified by ORESTES Abstract One important lesson learned from projects on complex genomes is that the definition of genes from genome sequences alone using computational predictions is subject to serious limitations. For instance, untranslated regions are refractory to this kind of analysis and intron/exon structures are poorly predicted by these means. Therefore, the immediate challenge in the post-genomic era is to identify transcribed regions in the genome. In order to generate sequence information on D.melanogaster transcriptome, we have generated over 10.000 ESTs, using the Open Reading Frame Expressed Sequence Tags (ORESTES) methodology. Expressed sequence tag (EST) analysis is an important tool for identifying transcription units but is also subject to errors. EST information can be misleading if, for example, it derives from immature mRNA, genomic contamination or spurious transcription of intergenic regions. The presence of a splice site upon alignment with the genomic sequence is a criterion that increases the EST data confidence. We have used this criterion to analyze the D. melanogaster ORESTES databank. Of 9,157 ORESTES (after excluding sequences derived from rRNA, mtDNA and yeast DNA), 2,702 have shown interrupted alignment with the genomic sequence. To identify among these the ones not yet represented in public databases, the 2,702 ORESTES were searched by BLAST against a set of Drosophila full-length cDNA sequences and ESTs present in the NCBI databanks. Using a Pearl script that we developed, complemented by visual inspection of the alignments, we have identified 88 ORESTES representing new expressed sequences in Drosophila. Among those, two are possibly derived from new isoforms of characterized genes (coding for calreticulin and elongation factor-2); seven match existing ESTs, but show a different splicing pattern and therefore could be derived from different isoforms of uncharacterized genes. The genomic DNA surrounding these novel ORESTES are being analyzed by examining the sequence alignments with Drosophila full-length cDNAs or ESTs present in the NCBI databanks, and BDGP/CG GadFly predicted genes, in an attempt to define in silico new transcription units. This information will lay the basis for the experimental validation of new transcripts. Financial Support: Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP) and Ludwig Institute for Cancer Research. Figure 2. Genomic sequences flanking the ORESTES alignment regions were used as queries in BLAST searches against a set of Drosophila protein-coding sequences compiled by Ashburner et al. (http: //www.fruitfly.org/sequence/download.htm/), Drosophila ESTs present in the dbEST database, ORESTES database and the predicted transcripts compiled in the GadFly (release 2) database. genomic sequence, ESTs from dbEST, ORESTES, predicted transcripts. Annotated functions of the predicted genes are: (A) actin biding, (B) cell adhesion, (C) enzyme, (D) transporter, (E) unknown, (F) polynucleotide adenylyltransferase, (G) phosphatase. (H) shows an unnanotated region. A B C D E F G H Analysis flowchart of the ORESTES which split upon alignment with the Drosophila genomic sequence


Carregar ppt "Identification of new expressed sequence tags covering putative splicing regions in Drosophila melanogaster. Marco Aurélio Valtas Cunha 1, Valeria Valente."

Apresentações semelhantes


Anúncios Google