Funpec-RpAbout The JournalEditorial BoardCurrent IssueAll IssuesSearchIndexersInstructions For AuthorsContactSponsorsLinks

BayBoots: a model-free Bayesian tool to identify class markers from gene expression data
Ricardo Z.N. Vêncio1,2, Diogo F.C. Patrão3, Cassio S. Baptista4, Carlos A.B. Pereira1 and
Bianca Zingales4
1BIOINFO-USP Núcleo de Pesquisas em Bioinformática and Departamento de Estatística,
Instituto de Matemática e Estatística, Universidade de São Paulo, Rua do Matão, 1010,
05508-090 São Paulo, SP, Brasil
2Instituto Israelita de Ensino e Pesquisa Albert Einstein, Hospital Israelita Albert Einstein,
Av. Albert Einstein, 627, 05651-901 São Paulo, SP, Brasil
3Hospital do Câncer A.C. Camargo, R. Prof. Antonio Prudente, 109, 01509-010 São Paulo, SP, Brasil
4Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo,
Av. Prof. Lineu Prestes, 748, 05508-000 São Paulo, SP, Brasil
Corresponding author: R.Z.N. Vêncio
E-mail: rvencio@vision.ime.usp.br
Genet. Mol. Res. 5 (1): 138-142 (2006)
Received January 10, 2006
Accepted February 17, 2006
Published March 31, 2006

ABSTRACT. One of the goals of gene expression experiments is the identification of differentially expressed genes among populations that could be used as markers. For this purpose, we implemented a model-free Bayesian approach in a user-friendly and freely available web-based tool called BayBoots. In spite of a common misunderstanding that Bayesian and model-free approaches are incompatible, we merged them in the BayBoots implementation using the Kernel density estimator and Rubin’s Bayesian Bootstrap. We used the Bayes error rate (BER) instead of the usual P values as an alternative statistical index to rank a class marker’s discriminative potential, since it can be visualized by a simple graphical representation and has an intuitive interpretation. Subsequently, Bayesian Bootstrap was used to assess BER’s credibility. We tested BayBoots on microarray data to look for markers for Trypanosoma cruzi strains isolated from cardiac and asymptomatic patients. We found that the three most frequently used methods in microarray analysis: t-test, non-parametric Wilcoxon test and correlation methods, yielded several markers that were discarded by a time-consuming visual check. On the other hand, the BayBoots graphical output and ranking was able to automatically identify markers for which classification performance was consistent. BayBoots is available at: http://www.vision.ime.usp.br/~rvencio/BayBoots.

Key words: Microarray, Bioinformatics, Statistics, Web tool, Gene expression, Bayesian bootstrap

 

Copyright © 2006 by FUNPEC