|
- BayBoots: a model-free Bayesian tool to identify class markers from gene expression data
- Ricardo Z.N. Vêncio1,2, Diogo F.C. Patrão3, Cassio S. Baptista4, Carlos A.B. Pereira1 and
- Bianca Zingales4
- 1BIOINFO-USP Núcleo de Pesquisas em Bioinformática and Departamento de Estatística,
- Instituto de Matemática e Estatística, Universidade de São Paulo, Rua do Matão, 1010,
- 05508-090 São Paulo, SP, Brasil
- 2Instituto Israelita de Ensino e Pesquisa Albert Einstein, Hospital Israelita Albert Einstein,
- Av. Albert Einstein, 627, 05651-901 São Paulo, SP, Brasil
- 3Hospital do Câncer A.C. Camargo, R. Prof. Antonio Prudente, 109, 01509-010 São Paulo, SP, Brasil
- 4Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo,
- Av. Prof. Lineu Prestes, 748, 05508-000 São Paulo, SP, Brasil
- Corresponding author: R.Z.N. Vêncio
- E-mail: rvencio@vision.ime.usp.br
- Genet. Mol. Res. 5 (1): 138-142 (2006)
- Received January 10, 2006
- Accepted February 17, 2006
- Published March 31, 2006
ABSTRACT. One of the goals of gene expression experiments is the identification of differentially expressed genes among populations that could be used as markers. For this purpose, we implemented a model-free Bayesian approach in a user-friendly and freely available web-based tool called BayBoots. In spite of a common misunderstanding that Bayesian and model-free approaches are incompatible, we merged them in the BayBoots implementation using the Kernel density estimator and Rubin’s Bayesian Bootstrap. We used the Bayes error rate (BER) instead of the usual P values as an alternative statistical index to rank a class marker’s discriminative potential, since it can be visualized by a simple graphical representation and has an intuitive interpretation. Subsequently, Bayesian Bootstrap was used to assess BER’s credibility. We tested BayBoots on microarray data to look for markers for Trypanosoma cruzi strains isolated from cardiac and asymptomatic patients. We found that the three most frequently used methods in microarray analysis: t-test, non-parametric Wilcoxon test and correlation methods, yielded several markers that were discarded by a time-consuming visual check. On the other hand, the BayBoots graphical output and ranking was able to automatically identify markers for which classification performance was consistent. BayBoots is available at:
http://www.vision.ime.usp.br/~rvencio/BayBoots.
Key words: Microarray, Bioinformatics, Statistics, Web tool, Gene expression, Bayesian bootstrap
|