Utilization of different methodologies for the characterization of Hb Hasharon heterozygotes |
Genet. Mol. Res. 5 (1): 1-6 (2006)
|
|
|
Authors |
|
A.R. Chinelato-Fernandes, C.F. Mendiburu and C.R. Bonini-Domingos |
|
|
Abstract | Abstract HTML | Full Text HTML | Full Text PDF |
ABSTRACT. Hb Hasharon has an electrophoretic mobility similar to that of Hb S in cellulose acetate and a mobility between Hb S and C at acid pH. In high-performance liquid chromatography, Hb Hasharon shows a distinct chromatographic profile and retention time. The origin of this variant is a mutation in codon 47 (GAC ® CAC) of the a2-globin gene, resulting in the replacement of asparagine by histidine during the translation process. Ten blood samples from individuals suspected of being Hb Hasharon carriers were analyzed. In addition to classic laboratory tests and high-performance liquid chromatography, molecular analysis by polymerase chain reaction with restriction fragment length polymorphism designed in the laboratory was performed to confirm this mutation. The study of these cases showed that a combination of classical and molecular methodologies is necessary in the diagnosis of hemoglobinopathies for a correct hemoglobin mutant identification. The accurate identification of hemoglobin variants is essential for genetic counseling and choice of therapy.
|
The effect of simulated censored data on estimates of heritability of longevity in the Thoroughbred racing industry |
Genet. Mol. Res. 5 (1): 7-15 (2006)
|
|
|
Authors |
|
Eleanor M. Burns, Richard M. Enns and Dorian J. Garrick |
|
|
Abstract | Abstract HTML | Full Text HTML | Full Text PDF |
ABSTRACT. We examined the impact of censored data on estimates of heritability of longevity. Longevity, defined as the length of productive racing life of an individual, is influenced by many factors. A simulated data set, modelled on the Irish Thoroughbred industry, was used to estimate heritabilities of longevity. Several scenarios representing various levels of censoring of performance data were created. The heritability of longevity was estimated for each scenario and compared to the estimated heritability of 0.120 for the complete data set. It was found that the estimates of heritability (0.107, 0.106, 0.082) were biased downwards with (10, 20, and 25%, respectively) censoring of data from poor-performing animals. We found that for a complete reporting it is necessary to reduce bias in the estimation of heritability of longevity.
|
Frequencies of phenylalanine hydroxylase mutations I65T, R252W, R261Q, R261X, IVS10nt11, V388M, R408W, Y414C, and IVS12nt1 in Minas Gerais, Brazil |
Genet. Mol. Res. 5 (1): 16-23 (2006)
|
|
|
Authors |
|
Luciana Lara dos Santos, Myrian de Castro Magalhães, Adriana de Oliveira Reis, Ana Lúcia Pimenta Starling, José Nélio Januário, Cleusa Graça da Fonseca, Marcos José Burle de Aguiar and Maria Raquel Santos Carvalho |
|
|
Abstract | Abstract HTML | Full Text HTML | Full Text PDF |
ABSTRACT. In order to determine the phenylketonuria (PKU) mutation spectrum in the population of Minas Gerais State, Brazil, 78 unrelated PKU patients found by the neonatal screening program from 1993 to 2003 were tested for nine phenylalanine hydroxylase mutations. These mutations were selected due to their high frequencies in other Brazilian populations and in Portugal, where the largest contingent of the Caucasian component of the Brazilian population originated from. The most frequent mutations were V388M (21%), R261Q (16%), IVS10nt11 (13.4%), I65T (5.7%), and R252W (5%). The frequencies of the other four mutations (R261X, R408W, Y414C, and IVS12nt1) did not reach 2%. By testing these nine mutations, we were able to identify 64% of the PKU alleles in our sample. V388M frequency was higher than in any other known population and almost three times larger than that observed in Portugal, probably reflecting genetic drift. The mutation profile, as well as the relative frequency of the different mutations, suggest that the Minas Gerais population more closely resembles that of Portugal than do the other Brazilian populations that have already been tested.
|
Mining ORESTES no-match database: can we still contribute to cancer transcriptome? |
Genet. Mol. Res. 5 (1): 24-32 (2006)
|
|
|
Authors |
|
Rogério da Silva Fonseca, Dirce Maria Carraro and Helena Brentani |
|
|
Abstract | Abstract HTML | Full Text HTML | Full Text PDF |
ABSTRACT. The Human Cancer Genome Project generated about 1 million expressed sequence tags by the ORESTES method, principally with the aim of obtaining data from cancer. Of this total, 341,680 showed no similarity with sequences in the public transcript databases, referred to as “no-match”. Some of them represent low abundance or difficult to detect human transcripts, but part of these sequences represent genomic contamination or immature mRNA. We performed a bioinformatics pipeline to determine the novelty of ORESTES “no-match” datasets from prostate or breast tissues. We started with 14,908 clusters mapped on the human genome. A total of 2226 clusters originating from more than two libraries or singletons with gaps upon genome alignment were selected. Ninety-four clusters with canonical splice sites representing the most stringent criteria to be considered a gene were subjected to manual inspection regarding genomic hits. Of the manually inspected clusters, 49.6% contained new sequences where 42.2% were probable low-expression alternative forms of the characterized genes and 7.4% unpredicted genes. RT-PCR followed by sequencing was performed to validate the largest spliced sequence from 8 clusters, resulting in the confirmation of five sequences as true human transcript fragments. Some of them were differentially expressed between tumor and normal tissue by an in silico analysis. We can conclude that after clean up of the no-match dataset, we still have about 939 new exons and 165 unpredicted genes that could complete the prostate or breast transcriptome.
|
The time has come: a new scene for PKU treatment |
Genet. Mol. Res. 5 (1): 33-44 (2006)
|
|
|
Authors |
|
Luciana Lara dos Santos, Myrian de Castro Magalhães, José Nélio Januário, Marcos José Burle de Aguiar and Maria Raquel Santos Carvalho |
|
|
Abstract | Abstract HTML | Full Text HTML | Full Text PDF |
ABSTRACT. Phenylketonuria (PKU) is one of the few genetic diseases in which mental retardation can be prevented. Hence, diagnosis and treatment must be established early. PKU treatment consists of a phenylalanine-restricted diet supplemented with a phenylalanine-free mixture of amino acids. However, it is difficult to adhere to this diet. In the last decade, a better comprehension of the biochemistry, genetics and molecular basis of the disease, as well as the need for easier treatment, led to the development of several new therapeutic strategies for PKU. In the present study, we evaluated these new therapeutic options in terms of theoretical basis, methodologies, efficacy, and costs.
|
Micronucleus test on gas station attendants |
Genet. Mol. Res. 5 (1): 45-54 (2006)
|
|
|
Authors |
|
Cibelem Iribarrem Benites, Lílian Lund Amado, Rita Aloma Packeiser Vianna and Maria da Graça Martino-Roth |
|
|
Abstract | Abstract HTML | Full Text HTML | Full Text PDF |
ABSTRACT. In the present study, the micronucleus test was applied in exfoliated cells of buccal mucosa to assess the mutagenicity risk associated with occupational exposure for gas station attendants. For each individual, 2000 exfoliated buccal cells were analyzed for micronucleus frequency. A highly significant difference was found between exposed and control groups. Likewise, a significant difference was found between these groups regarding the frequency of binucleated and broken egg cells. To determine whether smoking, alcohol habit, age, gender, or working time could exert any additional effect, we determined the frequency of micronuclei and binucleated and broken egg cells amongst exposed and control individuals. The results allowed us to conclude that the individuals studied belong to a risk group and should periodically undergo biological monitoring and appropriate care.
|
Increase in mitochondrial DNA quantity and impairment of oxidative phosphorylation in bovine fibroblast cells treated with ethidium bromide for 15 passages in culture |
Genet. Mol. Res. 5 (1): 55-62 (2006)
|
|
|
Authors |
|
Marcos Roberto Chiaratti and Flávio Vieira Meirelles |
|
|
Abstract | Abstract HTML | Full Text HTML | Full Text PDF |
ABSTRACT. Bovine fetal fibroblast cells were treated with ethidium bromide at a low concentration for 15 passages in culture to determine its effect on mitochondrial DNA copy number and on cell metabolism. Mitochondrial membrane potential and lactate production were estimated in order to characterize cell metabolism. In addition, mitochondrial DNA ND5 in proportion to a nuclear gene (luteinizing hormone receptor) was determined at the 1st, 2nd, 3rd, 10th, and 15th passages using semi-quantitative PCR amplification. Treated cells showed a lower mitochondrial membrane potential and higher levels of lactate production compared with control cells. However, the mitochondrial DNA/nuclear DNA ratio was higher in treated cells compared with control cells at the 10th and 15th passages. This ratio changed between the 3rd and 10th passages. Despite a clear impairment in mitochondrial function, ethidium bromide treatment did not lead to mitochondrial DNA depletion. It is possible that in response to a lower synthesis of ATP, due to an impairment in oxidative phosphorylation, treated cells develop a mechanism to resist the ethidium bromide effect on mtDNA replication, resulting in an increase in mitochondrial DNA copy number.
|
A novel polymorphic Alu insertion embedded in a LINE 1 retrotransposon in the human X chromosome (DXS225): identification and worldwide population study |
Genet. Mol. Res. 5 (1): 63-71 (2006)
|
|
|
Authors |
|
Rinaldo Wellerson Pereira, Simone Silva dos Santos and Sérgio Danilo Junho Pena |
|
|
Abstract | Abstract HTML | Full Text HTML | Full Text PDF |
ABSTRACT. We describe a novel polymorphic Alu insertion (DXS225) on the human X chromosome (Xq21.3) embedded into an L1 retrotransposon. The DXS225 polymorphism was genotyped in 684 males from the CEPH Human Genome Diversity Panel. This insertion was found in all regions of the globe, suggesting that it took place before modern humans spread from Africa ca. 100,000 years ago. However, only one Amerindian population (Karitiana) showed this insertion allele, which may have been introduced by European admixture. Thus, it appears likely that the Alu insertion was absent from pre-Columbian America. Analysis of molecular variance worldwide demonstrated that 92.2% of the genetic variance was concentrated within populations. DXS225 is flanked by two microsatellites (DXS8114 and DXS1002), which are 86 kb apart and are in very strong linkage disequilibrium. The combination of a unique event polymorphism on the X chromosome in linkage disequilibrium with two rapidly evolving microsatellites should provide a useful tool for studies of human evolution.
|
Plant microsatellite genotyping with 4-color fluorescent detection using multiple-tailed primers |
Genet. Mol. Res. 5 (1): 72-78 (2006)
|
|
|
Authors |
|
Alexandre Missiaggia and Dario Grattapaglia |
|
|
Abstract | Abstract HTML | Full Text HTML | Full Text PDF |
ABSTRACT. We extended the concept of fluorescent microsatellite genotyping with a single-universal tailed primer to the simultaneous use of three different tailed primers to allow multiplexed 4-color detection for medium throughput genotyping of plant species. The method was tested on Eucalyptus DNA samples using three forward primer sequences of human microsatellite markers labeled with different fluorescent dyes. The robustness of the method was tested for the simultaneous detection and genetic analysis of microsatellites in a genetic mapping experiment. This method allows reliable and cost-effective genotyping with the same level of multiplexing attained in regular microsatellite fluorescent detection assays. Besides the enhanced quality of the genotypic data provided by the fluorescent detection method when compared to colorimetric ones, the economy brought about by this method becomes greater with an increasing number of microsatellite markers. This method has been particularly useful for genotyping populations of several tropical tree species addressing community-wide population genetics and conservation questions.
|
Ion-exchange chromatography used to isolate a spermadhesin-related protein from domestic goat (Capra hircus) seminal plasma |
Genet. Mol. Res. 5 (1): 79-87 (2006)
|
|
|
Authors |
|
Dárcio Ítalo Alves Teixeira, Luciana Magalhães Melo, Carlos Alberto de Almeida Gadelha, Rodrigo Maranguape Silva da Cunha, Carlos Bloch Jr., Gandhi Rádis-Baptista, Benildo Sousa Cavada and Vicente José de Figueirêdo Freitas |
|
|
Abstract | Abstract HTML | Full Text HTML | Full Text PDF |
ABSTRACT. Mammalian seminal plasma contains among others, proteins called spermadhesins, which are the major proteins of boar and stallion seminal plasma. These proteins appear to be involved in capacitation and sperm-egg interaction. Previously, we reported the presence of a protein related to spermadhesins in goat seminal plasma. In the present study, we have further characterized this protein, and we propose ion-exchange chromatography to isolate this seminal protein. Semen was obtained from four adult Saanen bucks. Seminal plasma was pooled, dialyzed against distilled water and freeze-dried. Lyophilized proteins were loaded onto an ion-exchange chromatography column. Dialyzed-lyophilized proteins from the main peak of DEAE-Sephacel were applied to a C2/C18 column coupled to an RP-HPLC system, and the eluted proteins were lyophilized for electrophoresis. The N-terminal was sequenced and amino acid sequence similarity was determined using CLUSTAL W. Additionally, proteins from DEAE-Sephacel chromatography step were dialyzed and submitted to a heparin-Sepharose high-performance liquid chromatography. Goat seminal plasma after ion-exchange chromatography yielded 6.47 ± 0.63 mg (mean ± SEM) of the major retained fraction. The protein was designated BSFP (buck seminal fluid protein). BSFP exhibited N-terminal sequence homology to boar, stallion and bull spermadhesins. BSFP showed no heparin-binding capabilities. These results together with our previous data indicate that goat seminal plasma contains a protein that is structurally related to proteins of the spermadhesin family. Finally, this protein can be efficiently isolated by ion-exchange and reverse-phase chromatography.
|
Thesis Abstract Electrophoretic and chromatographic profile for “S-like” hemoglobin (Perfil eletroforético e cromatográfico das hemoglobinas "S-like") |
Genet. Mol. Res. 5 (1): 88 (2006)
|
|
|
Authors |
|
Luciana de Souza Ondei |
|
|
| Full Text HTML | Full Text PDF |
|
Thesis Abstract Hemoglobin and HFE gene polymorphisms in Brazilian populations in regions endemic for malaria (Estudo do polimorfismo de hemoglobinas e do gene HFE em populações brasileiras de áreas endêmicas de malária) |
Genet. Mol. Res. 5 (1): 89-90 (2006)
|
|
|
Authors |
|
Felipe Rafael Torres |
|
|
| Full Text HTML | Full Text PDF |
|
Letter to the Editor Are genetically modified crops compatible with sustainable agriculture? |
Genet. Mol. Res. 5 (1): 91-92 (2006)
|
|
|
Authors |
|
Nagib M.A. Nassar |
|
|
| Full Text HTML | Full Text PDF |
|
X-meeting - 1st International Conference of the AB3C Caxambu, MG, Brazil, October 4-7, 2005 |
| Genet. Mol. Res. 5 (1): 93-283 (2006) |
|
|
- Guest Editor
- Sandro José de Souza, Ludwig Institute for Cancer Research
- Associated Editor from GMR
- Vasco Azevedo, UFMG
- Associated Guest Editors
- Beatriz Stransky, UFRJ
- Darren Natale, Georgetown University
- Glória Franco, UFMG
- Guillherme de Oliveira, Fiocruz
- Helaine Carrer, USP
- Igor Polikarpov, USP
- João Eduardo Ferreira, USP
- Nalvo de Almeida Jr., UFMS
- Wilson Araújo da Silva Jr., USP
- Genet. Mol. Res. 5 (1): 93-283 (2006)
|
|
- Introduction
- Genet. Mol. Res. 5 (1) 2006
|
|
|
|
| Full Text HTML | Full Text PDF
|
SpotWhatR: a user-friendly microarray data analysis system |
Genet. Mol. Res. 5 (1): 93-107 (2006)
|
|
|
Authors |
|
Tie Koide, Silvia M. Salem-Izacc, Suely L. Gomes and Ricardo Z.N. Vêncio |
|
|
Abstract | Abstract HTML | Full Text HTML | Full Text PDF |
ABSTRACT. SpotWhatR is a user-friendly microarray data analysis tool that runs under a widely and freely available R statistical language (http://www.r-project.org) for Windows and Linux operational systems. The aim of SpotWhatR is to help the researcher to analyze microarray data by providing basic tools for data visualization, normalization, determination of differentially expressed genes, summarization by Gene Ontology terms, and clustering analysis. SpotWhatR allows researchers who are not familiar with computational programming to choose the most suitable analysis for their microarray dataset. Along with well-known procedures used in microarray data analysis, we have introduced a stand-alone implementation of the HTself method, especially designed to find differentially expressed genes in low-replication contexts. This approach is more compatible with our local reality than the usual statistical methods. We provide several examples derived from the Blastocladiella emersonii and Xylella fastidiosa Microarray Projects. SpotWhatR is freely available at http://blasto.iq.usp.br/~tkoide/SpotWhatR, in English and Portuguese versions. In addition, the user can choose between “single experiment” and “batch processing” versions.
|
Gene Class Expression: analysis tool of Gene Ontology terms with gene expression data |
Genet. Mol. Res. 5 (1): 108-114 (2006)
|
|
|
Authors |
|
Gislaine S.P. Pereira, Rodrigo M. Brandão, Silvana Giuliatti, Marco A. Zago and Wilson A. Silva Jr. |
|
|
Abstract | Abstract HTML | Full Text HTML | Full Text PDF |
ABSTRACT. Serial analysis of gene expression (SAGE) technology produces large sets of interesting genes that are difficult to analyze directly. Bioinformatics tools are needed to interpret the functional information in these gene sets. We present an interactive web-based tool, called Gene Class, which allows functional annotation of SAGE data using the Gene Ontology (GO) database. This tool performs searches in the GO database for each SAGE tag, making associations in the selected GO category for a level selected in the hierarchy. This system provides user-friendly data navigation and visualization for mapping SAGE data onto the gene ontology structure. This tool also provides graphical visualization of the percentage of SAGE tags in each GO category, along with confidence intervals and hypothesis testing.
|
GenoMycDB: a database for comparative analysis of mycobacterial genes and genomes |
Genet. Mol. Res. 5 (1): 115-126 (2006)
|
|
|
Authors |
|
Marcos Catanho, Daniel Mascarenhas, Wim Degrave and Antonio Basílio de Miranda |
|
|
Abstract | Abstract HTML | Full Text HTML | Full Text PDF |
ABSTRACT. Several databases and computational tools have been created with the aim of organizing, integrating and analyzing the wealth of information generated by large-scale sequencing projects of mycobacterial genomes and those of other organisms. However, with very few exceptions, these databases and tools do not allow for massive and/or dynamic comparison of these data. GenoMycDB (http://www.dbbm.fiocruz.br/GenoMycDB) is a relational database built for large-scale comparative analyses of completely sequenced mycobacterial genomes, based on their predicted protein content. Its central structure is composed of the results obtained after pair-wise sequence alignments among all the predicted proteins coded by the genomes of six mycobacteria: Mycobacterium tuberculosis (strains H37Rv and CDC1551), M. bovis AF2122/97, M. avium subsp. paratuberculosis K10, M. leprae TN, and M. smegmatis MC2 155. The database stores the computed similarity parameters of every aligned pair, providing for each protein sequence the predicted subcellular localization, the assigned cluster of orthologous groups, the features of the corresponding gene, and links to several important databases. Tables containing pairs or groups of potential homologs between selected species/strains can be produced dynamically by user-defined criteria, based on one or multiple sequence similarity parameters. In addition, searches can be restricted according to the predicted subcellular localization of the protein, the DNA strand of the corresponding gene and/or the description of the protein. Massive data search and/or retrieval are available, and different ways of exporting the result are offered. GenoMycDB provides an on-line resource for the functional classification of mycobacterial proteins as well as for the analysis of genome structure, organization, and evolution.
|
Building multiple sequence alignments with a flavor of HSSP alignments |
Genet. Mol. Res. 5 (1): 127-137 (2006)
|
|
|
Authors |
|
Roberto Hiroshi Higa, Sergio Aparecido Braga da Cruz, Paula Regina Kuser, Michel Eduardo Beleza Yamagishi, Renato Fileto, Stanley Robson de Medeiros Oliveira, Ivan Mazoni, Edgard Henrique dos Santos and Adauto Luiz Mancini and Goran Neshich |
|
|
Abstract | Abstract HTML | Full Text HTML | Full Text PDF |
ABSTRACT. Homology-derived secondary structure of proteins (HSSP) is a well-known database of multiple sequence alignments (MSAs) which merges information of protein sequences and their three-dimensional structures. It is available for all proteins whose structure is deposited in the PDB. It is also used by STING and JavaProtein Dossier to calculate and present relative entropy as a measure of the degree of conservation for each residue of proteins whose structure has been solved and deposited in the PDB. However, if the STING and JavaProtein Dossier are to provide support for analysis of protein structures modeled in computers or being experimentally solved but not yet deposited in the PDB, then we need a new method for building alignments having a flavor1 of HSSP alignments (myMSAr). The present study describes a new method and its corresponding databank (SH2QS - database of sequences homologue to the query [structure-having] sequence). Our main interest in making myMSAr was to measure the degree of residue conservation for a given query sequence, regardless of whether it has a corresponding structure deposited in the PDB. In this study, we compare the measurement of residue conservation provided by corresponding alignments produced by HSSP and SH2QS. As a case study, we also present two biologically relevant examples, the first one highlighting the equivalence of analysis of the degree of residue conservation by using HSSP or SH2QS alignments, and the second one presenting the degree of residue conservation for a structure modeled in a computer, which , as a consequence, does not have an alignment reported by HSSP.
|
BayBoots: a model-free Bayesian tool to identify class markers from gene expression data |
Genet. Mol. Res. 5 (1): 138-142 (2006)
|
|
|
Authors |
|
Ricardo Z.N. Vêncio, Diogo F.C. Patrão, Cassio S. Baptista, Carlos A.B. Pereira and Bianca Zingales |
|
|
Abstract | Abstract HTML | Full Text HTML | Full Text PDF |
ABSTRACT. One of the goals of gene expression experiments is the identification of differentially expressed genes among populations that could be used as markers. For this purpose, we implemented a model-free Bayesian approach in a user-friendly and freely available web-based tool called BayBoots. In spite of a common misunderstanding that Bayesian and model-free approaches are incompatible, we merged them in the BayBoots implementation using the Kernel density estimator and Rubin’s Bayesian Bootstrap. We used the Bayes error rate (BER) instead of the usual P values as an alternative statistical index to rank a class marker’s discriminative potential, since it can be visualized by a simple graphical representation and has an intuitive interpretation. Subsequently, Bayesian Bootstrap was used to assess BER’s credibility. We tested BayBoots on microarray data to look for markers for Trypanosoma cruzi strains isolated from cardiac and asymptomatic patients. We found that the three most frequently used methods in microarray analysis: t-test, non-parametric Wilcoxon test and correlation methods, yielded several markers that were discarded by a time-consuming visual check. On the other hand, the BayBoots graphical output and ranking was able to automatically identify markers for which classification performance was consistent. BayBoots is available at: http://www.vision.ime.usp.br/~rvencio/BayBoots.
|
New strategy to detect single nucleotide polymorphisms |
Genet. Mol. Res. 5 (1): 143-153 (2006)
|
|
|
Authors |
|
Miguel Galves, José Augusto Amgarten Quitzau and Zanoni Dias |
|
|
Abstract | Abstract HTML | Full Text HTML | Full Text PDF |
ABSTRACT. A great effort has been made to identify and map a large set of single nucleotide polymorphisms. The goal is to determine human DNA variants that contribute most significantly to population variation in each trait. Different algorithms and software packages, such as PolyBayes and PolyPhred, have been developed to address this problem. We present strategies to detect single nucleotide polymorphisms, using chromatogram analysis and consensi of multiple aligned sequences. The algorithms were tested using HIV datasets, and the results were compared with those produced by PolyBayes and PolyPhred using the same dataset. Our algorithms produced significantly better results than these two software packages.
|
A comparative analysis of highly conserved sex-determining genes between Apis mellifera and Drosophila melanogaster |
Genet. Mol. Res. 5 (1): 154-168 (2006)
|
|
|
Authors |
|
Alexandre S. Cristino, Adriana Mendes do Nascimento, Luciano da Fontoura Costa and Zilá Luz Paulino Simões |
|
|
Abstract | Abstract HTML | Full Text HTML | Full Text PDF |
ABSTRACT. A comparison of the most conserved sex-determining genes between the fruit fly, Drosophila melanogaster, and the honey bee, Apis mellifera, was performed with bioinformatics tools developed for computational molecular biology. An initial set of protein sequences already described in the fruit fly as participants of the sex-determining cascade was retrieved from the Gene Ontology database (http://www.geneontology.org/) and aligned against a database of protein sequences predicted from the honey bee genome. The doublesex (dsx) gene is considered one of the most conserved sex-determining genes among metazoans, and a male-specific partial cDNA of putative A. mellifera dsx gene (Amdsx) was identified experimentally. The theoretical predictions were developed in the context of sequence similarity. Experimental evidence indicates that dsx is present in embryos and larvae, and that it encodes a transcription factor widely conserved in metazoans, containing a DM DNA-binding domain implicated in the regulation of the expression of genes involved in sexual phenotype formation.
|
Analysis of slipped sequences in EST projects |
Genet. Mol. Res. 5 (1): 169-181 (2006)
|
|
|
Authors |
|
Christian Baudet and Zanoni Dias |
|
|
Abstract | Abstract HTML | Full Text HTML | Full Text PDF |
ABSTRACT. Slippage is an important sequencing problem that can occur in EST projects. However, very few studies have addressed this. We propose three new methods to detect slippage artifacts: arithmetic mean method, geometric mean method, and echo coverage method. Each method is simple and has two different strategies for processing sequences: suffix and subsequence. Using the 291,689 EST sequences produced in the SUCEST project, we performed comparative tests between our proposed methods and the SUCEST method. The subsequence strategy is better than the suffix strategy, because it is not anchored at the end of the sequence, so it is more flexible to find slippage at the beginning of the EST. In a comparison with the SUCEST method, the advantage of our methods is that they do not discard the majority of the sequences marked as slippage, but instead only remove the slipped artifact from the sequence. Based on our tests the echo coverage method with subsequence strategy shows the best compromise between slippage detection and ease of calibration.
|
Exploring molecular networks using MONET ontology |
Genet. Mol. Res. 5 (1): 182-192 (2006)
|
|
|
Authors |
|
João Paulo Müller da Silva, Ney Lemke, José Carlos Mombach, José Guilherme Camargo de Souza, Marialva Sinigaglia and Renata Vieira |
|
|
Abstract | Abstract HTML | Full Text HTML | Full Text PDF |
ABSTRACT. The description of the complex molecular network responsible for cell behavior requires new tools to integrate large quantities of experimental data in the design of biological information systems. These tools could be used in the characterization of these networks and in the formulation of relevant biological hypotheses. The building of an ontology is a crucial step because it integrates in a coherent framework the concepts necessary to accomplish such a task. We present MONET (molecular network), an extensible ontology and an architecture designed to facilitate the integration of data originating from different public databases in a single- and well-documented relational database, that is compatible with MONET formal definition. We also present an example of an application that can easily be implemented using these tools.
|
Predicting enzyme class from protein structure using Bayesian classification |
Genet. Mol. Res. 5 (1): 193-202 (2006)
|
|
|
Authors |
|
Luiz C. Borro, Stanley R.M. Oliveira, Michel E.B. Yamagishi, Adaulto L. Mancini, José G. Jardine, Ivan Mazoni, Edgard H. dos Santos, Roberto H. Higa, Paula R. Kuser and Goran Neshich |
|
|
Abstract | Abstract HTML | Full Text HTML | Full Text PDF |
ABSTRACT. Predicting enzyme class from protein structure parameters is a challenging problem in protein analysis. We developed a method to predict enzyme class that combines the strengths of statistical and data-mining methods. This method has a strong mathematical foundation and is simple to implement, achieving an accuracy of 45%. A comparison with the methods found in the literature designed to predict enzyme class showed that our method outperforms the existing methods.
|
DBCollHIV: a database system for collaborative HIV analysis in Brazil |
Genet. Mol. Res. 5 (1): 203-215 (2006)
|
|
|
Authors |
|
Luciano V. Araújo, Marcelo A. Soares, Suelene M. Oliveira, Pedro Chequer, Amilcar Tanuri, Ester C. Sabino and João E. Ferreira |
|
|
Abstract | Abstract HTML | Full Text HTML | Full Text PDF |
ABSTRACT. We developed a database system for collaborative HIV analysis (DBCollHIV) in Brazil. The main purpose of our DBCollHIV project was to develop an HIV-integrated database system with analytical bioinformatics tools that would support the needs of Brazilian research groups for data storage and sequence analysis. Whenever authorized by the principal investigator, this system also allows the integration of data from different studies and/or the release of the data to the general public. The development of a database that combines sequences associated with clinical/epidemiological data is difficult without the active support of interdisciplinary investigators. A functional database that securely stores data and helps the investigator to manipulate their sequences before publication would be an attractive tool for investigators depositing their data and collaborating with other groups. DBCollHIV allows investigators to manipulate their own datasets, as well as integrating molecular and clinical HIV data, in an innovative fashion.
|
Establishment of a heterologous system for the expression of Canavalia brasiliensis lectin: a model for the study of protein splicing |
Genet. Mol. Res. 5 (1): 216-223 (2006)
|
|
|
Authors |
|
Walderly Melgaço Bezerra, Cristina Paiva da Silveira Carvalho, Renato de Azevedo Moreira and Thalles Barbosa Grangeiro |
|
|
Abstract | Abstract HTML | Full Text HTML | Full Text PDF |
ABSTRACT. During its biosynthesis in developing Canavalia brasiliensis seeds, the lectin ConBr undergoes a form of protein splicing in which the order of the N- and C-domains of the protein is reversed. To investigate whether these events can occur in other eukaryotic organisms, an expression system based on Pichia pastoris cells was established. A DNA fragment encoding prepro-ConBr was cloned into the vector pPICZB, and the recombinant plasmid was transformed in P. pastoris strain GS115. Ten clones were screened for effective recombinant protein production. Based on Western blot analysis of the two clones with the highest level of protein expression: 1) diffuse high-molecular mass immunoreactive bands were produced as early as 24 h after induction; 2) a single-, high-molecular mass protein was secreted into the medium, and 3) a significant fraction of the recombinant polypeptides that cross-reacted with anti-ConBr antibodies comprised a band of approximately 34.5 kDa. Diffuse protein bands with high molecular masses are attributed to hyperglycosylation at the single potential N-glycosylation site located in the linker peptide of prepro-ConBr. In contrast, native ConBr is made up of three polypeptides, the intact a chain (aa 1-237) and the fragments b (aa 1-118) and g (aa 119-237), which have apparent molecular masses of 30, 16 and 12 kDa, respectively. Apparently, the yeast P. pastoris is not able to carry out all the complex post-translational proteolytic processing necessary for the biosynthesis of ConBr.
|
In silico prediction of yeast deletion phenotypes |
Genet. Mol. Res. 5 (1): 224-232 (2006)
|
|
|
Authors |
|
Soma Saha and Steffen Heber |
|
|
Abstract | Abstract HTML | Full Text HTML | Full Text PDF |
ABSTRACT. Analysis of gene deletions is a fundamental approach for investigating gene function. We evaluated an algorithm that uses classification techniques to predict the phenotypic effects of gene deletions in yeast. We used a modified simulated annealing algorithm for feature selection and weighting. The selected features with high weights were phylogenetic conservation scores for bacteria, fungi (excluding Ascomycota), Ascomycota (excluding Saccharomyces cerevisiae), plants, and mammals, degree of paralogy, and number of protein-protein interactions. Classification was performed by weighted k-nearest neighbor and with support vector machine algorithms. To demonstrate how this approach might complement existing experimental procedures, we applied our algorithm to predict essential genes and genes causing morphological alterations in yeast.
|
An empirical examination of the standard errors of maximum likelihood phylogenetic parameters under the molecular clock via bootstrapping |
Genet. Mol. Res. 5 (1): 233-241 (2006)
|
|
|
Authors |
|
Carlos G. Schrago |
|
|
Abstract | Abstract HTML | Full Text HTML | Full Text PDF |
ABSTRACT. The molecular clock theory has greatly enlightened our understanding of macroevolutionary events. Maximum likelihood (ML) estimation of divergence times involves the adoption of fixed calibration points, and the confidence intervals associated with the estimates are generally very narrow. The credibility intervals are inferred assuming that the estimates are normally distributed, which may not be the case. Moreover, calculation of standard errors is usually carried out by the curvature method and is complicated by the difficulty in approximating second derivatives of the likelihood function. In this study, a standard primate phylogeny was used to examine the standard errors of ML estimates via the bootstrap method. Confidence intervals were also assessed from the posterior distribution of divergence times inferred via Bayesian Markov Chain Monte Carlo. For the primate topology under evaluation, no significant differences were found between the bootstrap and the curvature methods. Also, Bayesian confidence intervals were always wider than those obtained by ML.
|
A picture of gene sampling/expression in model organisms using ESTs and KOG proteins |
Genet. Mol. Res. 5 (1): 242-253 (2006)
|
|
|
Authors |
|
Maurício de Alvarenga Mudado and José Miguel Ortega |
|
|
Abstract | Abstract HTML | Full Text HTML | Full Text PDF |
ABSTRACT. The expressed sequence tag (EST) is an instrument of gene discovery. When available in large numbers, ESTs may be used to estimate gene expression. We analyzed gene expression by EST sampling, using the KOG database, which includes 24,154 proteins from Arabidopsis thaliana (Ath), 17,101 from Caenorhabditis elegans (Cel), 10,517 from Drosophila melanogaster (Dme), and 26,324 from Homo sapiens (Hsa), and 178,538 ESTs for Ath, 215,200 for Cel, 261,404 for Dme, and 1,941,556 for Hsa. BLAST similarity searches were performed to assign KOG annotation to all ESTs. We determined the amount of gene sampling or expression dedicated to each KOG functional category by each model organism. We found that the 25% most-expressed genes are frequently shared among these organisms. The KOG protein classification allowed the EST sampling calculation throughout the glycolysis pathway. We calculated the KOG cluster coverage and inferred that 50 to 80 K ESTs would efficiently cover 80-85% of the KOG database clusters in a transcriptome project. Since KOG is a database biased towards housekeeping genes, this is probably the number of ESTs needed to include the more commonly expressed genes in these organisms. We also examined a still unaddressed question: what is the minimum number of ESTs that should be produced in a transcriptome project?
|
Gene networks as a tool to understand transcriptional regulation |
Genet. Mol. Res. 5 (1): 254-268 (2006)
|
|
|
Authors |
|
Diogo Fernando Veiga, Fábio Fernandes da Rocha Vicente and Gustavo Bastos |
|
|
Abstract | Abstract HTML | Full Text HTML | Full Text PDF |
ABSTRACT. Gene regulatory networks, or simply gene networks (GNs), have shown to be a promising approach that the bioinformatics community has been developing for studying regulatory mechanisms in biological systems. GNs are built from the genome-wide high-throughput gene expression data that are often available from DNA microarray experiments. Conceptually, GNs are (un)directed graphs, where the nodes correspond to the genes and a link between a pair of genes denotes a regulatory interaction that occurs at transcriptional level. In the present study, we had two objectives: 1) to develop a framework for GN reconstruction based on a Bayesian network model that captures direct interactions between genes through nonparametric regression with B-splines, and 2) to demonstrate the potential of GNs in the analysis of expression data of a real biological system, the yeast pheromone response pathway. Our framework also included a number of search schemes to learn the network. We present an intuitive notion of GN theory as well as the detailed mathematical foundations of the model. A comprehensive analysis of the consistency of the model when tested with biological data was done through the analysis of the GNs inferred for the yeast pheromone pathway. Our results agree fairly well with what was expected based on the literature, and we developed some hypotheses about this system. Using this analysis, we intended to provide a guide on how GNs can be effectively used to study transcriptional regulation. We also discussed the limitations of GNs and the future direction of network analysis for genomic data. The software is available upon request.
|
A fully resolved consensus between fully resolved phylogenetic trees |
Genet. Mol. Res. 5 (1): 269-283 (2006)
|
|
|
Authors |
|
José Augusto Amgarten Quitzau and João Meidanis |
|
|
Abstract | Abstract HTML | Full Text HTML | Full Text PDF |
ABSTRACT. Nowadays, there are many phylogeny reconstruction methods, each with advantages and disadvantages. We explored the advantages of each method, putting together the common parts of trees constructed by several methods, by means of a consensus computation. A number of phylogenetic consensus methods are already known. Unfortunately, there is also a taboo concerning consensus methods, because most biologists see them mainly as comparators and not as phylogenetic tree constructors. We challenged this taboo by defining a consensus method that builds a fully resolved phylogenetic tree based on the most common parts of fully resolved trees in a given collection. We also generated results showing that this consensus is in a way a kind of “median” of the input trees; as such it can be closer to the correct tree in many situations.
|