Please use this identifier to cite or link to this item: http://hdl.handle.net/10773/27687
Title: Statistical, computational and visualization methodologies to unveil gene primary structure features
Author: Pinheiro, M.
Afreixo, V.
Moura, G.
Freitas, A.
Santos, M. A. S.
Oliveira, J. L.
Keywords: Bioinformatics software
Codon context
Codon bias
Contingency tables
Residual analysis
Cluster analysis
Issue Date: 2006
Publisher: Schattauer
Abstract: Gene sequence features such as codon bias, codon context, and codon expansion (e.g. trinucleotide repeats) can be better understood at the genomic scale level by combining statistical methodologies with advanced computer algorithms and data visualization through sophisticated graphical interfaces. This paper presents the ANACONDA system, a bioinformatics application for gene primary structure analysis. Codon usage tables using absolute metrics and software for multivariate analysis of codon and amino acid usage are available in public databases. However, they do not provide easy computational and statistical tools to carry out detailed gene primary structure analysis on a genomic scale. We propose the usage of several statistical methods--contingency table analysis, residual analysis, multivariate analysis (cluster analysis)--to analyze the codon bias under various aspects (degree of association, contexts and clustering). The developed solution is a software application that provides a user-guided analysis of codon sequences considering several contexts and codon usage on a genomic scale. The utilization of this tool in our molecular biology laboratory is focused on particular genomes, especially those from Saccharomyces cerevisiae, Candida albicans and Escherichia coli. In order to illustrate the applicability and output layouts of the software these species are herein used as examples. The statistical tools incorporated in the system are allowing to obtain global views of important sequence features. It is expected that the results obtained will permit identification of general rules that govern codon context and codon usage in any genome. Additionally, identification of genes containing expanded codons that arise as a consequence of erroneous DNA replication events will permit uncovering new genes associated with human disease.
Peer review: yes
URI: http://hdl.handle.net/10773/27687
DOI: 10.1267/METH06020163
ISSN: 0026-1270
Appears in Collections:CESAM - Artigos
DETI - Artigos
DBio - Artigos
DMat - Artigos
IEETA - Artigos

Files in This Item:
File Description SizeFormat 
Pinheiro et al. 2006.pdf1.37 MBAdobe PDFView/Open


FacebookTwitterLinkedIn
Formato BibTex MendeleyEndnote Degois 

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.