Funpec-RpAbout The JournalEditorial BoardCurrent IssueAll IssuesSearchIndexersInstructions For AuthorsContactSponsorsLinks

Optimization of a multiplex minisequencing protocol for population studies and medical genetics
Cláudia M.B. Carvalho1 and Sérgio D.J. Pena1,2
1Departamento de Bioquímica e Imunologia, Universidade Federal de Minas Gerais,
Belo Horizonte, MG, Brasil
2GENE-Núcleo de Genética Médica de Minas Gerais, Belo Horizonte, MG, Brasil
Corresponding author: S.D.J. Pena
E-mail: spena@dcc.ufmg.br
Genet. Mol. Res. 4 (2): 115-125 (2005)
Received March 24, 2005
Accepted April 1, 2005
Published April 13, 2005

ABSTRACT. Several technologically sophisticated high-throughput techniques have been recently developed for the study of human single nucleotide polymorphisms and the diagnosis of point mutations in human diseases. However, there is also a need for simple and inexpensive techniques suitable for clinical services and small research laboratories. Minisequencing meets the latter requirements. It is simple, non-radioactive and can be easily multiplexed by adding oligonucleotide tails of increasing size to the sequencing oligonucleotide primers. To optimize the minisequencing protocol, we designed a test multiplex system capable of typing simultaneously 12 different human autosomal single nucleotide polymorphisms. We discovered that the quality of minisequencing primers and the careful selection of the tail sequences were especially critical for success. This optimized protocol permits rapid genotyping at low cost and can serve as a blueprint for the creation of multiplex minisequencing systems suitable to virtually any typing application in population studies and medical genetics.

Key words: DNA, Polymorphisms, Single nucleotide polymorphisms, Minisequencing, Multiplex

INTRODUCTION

The Human Genome Project has produced a large amount of genetic data, including the identification of more than ten million single nucleotide polymorphisms (SNPs) that are available in public databases (e.g., dbSNP: http://www.ncbi.nlm.nih.gov/SNP and TSC: http://snp.cshl.org). Because of their abundance and widespread genomic distribution, SNPs have become the most useful genome markers in human molecular genetics and are used in gene mapping (Rioux et al., 2001; Ozaki et al., 2002; Kammerer et al., 2005), in population genetics and in evolutionary studies (Underhill et al., 2001; Sanchez et al., 2003; Liljedahl et al., 2003; Shriver et al., 2004) as well as individual identification markers in forensic applications (Syvanen et al., 1993; Just et al., 2004; Inagaki et al., 2004). Moreover, point mutations, which are studied using the same techniques as SNPs, have a fundamental importance in molecular medicine, as causative agents of human diseases (Amir et al., 1999; Lai et al., 2001; Prokunina et al., 2002; see Human Gene Mutation Database at http://archive.uwcm.ac.uk/uwcm/mg/hgmd0.html).

Several molecular methodologies have been developed for discovery and genotyping of SNPs, as reviewed by Syvanen (2001) and Kwok and Chen (2003). SNP discovery relies mostly on direct DNA sequencing or on denaturing high-performance liquid chromatography (dHPLC), while SNP genotyping methods rely on methodology for allelic discrimination (Kwok and Chen, 2003). Very advanced techniques have been developed with amazing throughput capabilities. For instance, recently Hinds et al. (2005) used a microarray platform to type simultaneously 1,586,383 SNPs in 71 Americans of European, African, and Asian ancestry!

In spite of these fantastic advances, most clinical services and small research laboratories still make extensive use of the rather inefficient restriction fragment length polymorphism approach to genotype SNPs (Gibbs et al., 1986; Lander and Botstein, 1986; Dietz-Band et al., 1990; Santos et al., 1996; Bazrafshani et al., 2000; Haak et al., 2004) and diagnose diseases. Although restriction enzyme digestion of PCR products is a simple technique, it is not amenable to automation and cannot be used in a multiplex format. Thus, there is a need to develop multiplex techniques that are efficient and yet simple enough to be used in unsophisticated laboratories.

In 1990, Syvanen et al. first proposed the technique of primer extension (minisequencing) for SNP genotyping. The same group later made several alterations on the method, substituting fluorescent labels for radioactivity and introducing several improvements (Syvanen, 1999). In order to set up an efficient and yet simple and low-cost technique for SNP typing, we decided to optimize Syvanen’s extension primer technique in a multiplex format. As a model test, we constructed a multiplex system with 12 polymorphic SNPs. We here demonstrate the applicability of this method, which indeed has proven to be easy, uncomplicated and yet highly effective.

MATERIAL AND METHODS

DNA samples

In the development of the minisequencing protocol we used DNA samples from white Brazilian individuals as described previously (Alves-Silva et al., 2000; Carvalho-Silva et al., 2001). DNA samples from 200 unrelated individuals were mixed in a pool to verify allele heterozygosity.

SNP selection and primer design

We chose SNPs from the public database dbSNP (http://www.ncbi.nlm.nih.gov/SNP/). We picked 15 polymorphisms, each one placed at a different chromosome, all with heterozygosity in the range between 0.4 to 0.6 and all located in introns or the 3’- or 5’-untranslated region of mRNAs. All selected primers were checked for spurious matches with other human sequences using Genome BLAST (http://www.ncbi.nlm.nih.gov/genome/seq/HsBlast.html).

Primers were designed using the primer analysis software Oligo, version 4.0 and synthesized commercially. We added a T7 universal tail to one primer of each pair in order to improve multiplex optimization. Detailed information about the primers, the amplified product and the SNP detected by each pair, can be seen in Table 1.


Liquid-phase minisequencing

Syvanen et al. (1990) were the first to propose the principle of minisequencing, initially using a solid phase format. In contrast, we chose a liquid phase and developed a multiplex typing strategy that can, in principle, be used to study any SNP. Figure 1 shows the principle of the method.


PCR

PCR primers for the 15 SNP amplicons and the T7 primer were combined into a single multiplex reaction, generating different PCR product sizes. The best conditions obtained are provided in Table 1. The primer containing the T7 tail was used at a concentration 10 times smaller than that of the non-tailed primer and the T7 primer. This protocol seems to improve the multiplex results, possibly by making the reaction more uniform. All PCR reactions were done in a volume of 12.5 µl with 50 ng genomic DNA, 2.0 mM MgCl2, 1x AmpliTaq Gold buffer, 200 µM dNTPs, 1 unit Taq DNA Polymerase (AmpliTaq Gold; Applied Biosystems, Foster City, CA, USA). Amplification was achieved in an M.J. Research Cycler (M.J. Research, Watertown, MA, USA) and consisted of 95°C for 5 min, followed by 34 cycles for 30 s at 95°C, 30 s at 50°C, 1 min at 72°C, and a final extension for 30 min at 72°C. The PCR products were resolved in 6% polyacrylamide gels and silver stained according to Santos et al. (1993).

ExoSap treatment

In order to eliminate the excess of primers and dNTPs, the PCR products were digested by an “ExoSAP” mix consisting of 2 units/µl E. coli exonuclease I (Exo I) and 0.2 units/µl shrimp alkaline phosphatase (SAP) and 1x SAP buffer. The enzymes were purchased from the U.S.B. Corporation (Cleveland, OH, USA). Five microliters of PCR product was added to five microliters of ExoSAP mix and incubated at 37°C for 30 min. The enzymes were afterwards inactivated by heating at 80°C for 15 min.

Design of minisequencing primers

Table 2 shows the genotyping primers designed for each SNP. Primers were designed with the 3’-extremity complementary to the last base immediately before the polymorphic residue. In order to distinguish between the sizes of the detection products, the primers were synthesized ranging from 20 to 107 nucleotides. The desired sized of the primers were adjusted by addition of a piece of a “neutral” sequence at the 5’ extremity. The polynucleotide sequence of the “neutral” sequences was based on the sequence of pUC18, since this plasmid does not match any human sequences in the NCBI non-redundant database. In the minisequencing multiplex, the distance between the two SNP alleles varied from one to 10 nucleotides, depending on the allele present at the SNP target and the subsequent sequences (Table 2). For each 3-bp DNA fragment size interval, one SNP locus could be detected. All the primers used in minisequencing were purified by HPLC.


Minisequencing reaction and electrophoresis

Multiplex PCR minisequencing was performed in a 12.5-µl volume with 1 µl purified PCR product and 0.01-0.5 µM of the primers (Table 2). The PCR reaction contained 0.5 µM ddATP labeled with fluorescein (NEL402-Perkin Elmer Life Sciences, Boston, MA, USA), 0.5 µM unlabeled dCTP, dTTP and dGTP, 3.5 mM MgCl2, 1x Thermo Sequenase buffer, and 1 unit Thermo Sequenase DNA Polymerase (Amersham Biosciences, Uppsala, Sweden). The thermal cycling was performed as “hot start”: the enzyme was added 2 min after an initial denaturation. The “cycle sequencing” reaction consisted of 80°C for 5 min, followed by 30 cycles of 95°C for 30 s, 55°C for 30 s and 72°C for 20 s.

The minisequencing products were applied in a 6% denaturing polyacrylamide gel and resolved in a fluorescent automatic DNA sequencer ALF (Amersham Biosciences). The analyses were done in the software package Allelinks (Amersham Biosciences).

RESULTS

PCR multiplex

It was necessary to experiment with the concentration of each primer to obtain a good multiplex result. The final concentrations are shown in Table 1. A silver-stained polyacrylamide gel of the multiplex products is shown in Figure 2. We were forced to eliminate one SNP (12q) from the multiplex format, because it did not produce visible peaks in it, despite working very well in the simplex format.


Minisequencing simplex

Before performing the multiplex minisequencing, we checked the quality of each SNP minisequencing in a simplex reaction. We used a single individual to do this as can be seen in Figure 3A. We had to exclude one SNP (10q) from the multiplex because its product was consistently weak and the background in its vicinity was high (see Figure 3A). Figure 3B shows the simplex minisequencing results using a pool of 200 unrelated Brazilian individuals and permits visualization of both alleles at all selected loci.


Minisequencing multiplex

To obtain good results in the multiplex minisequencing protocol it was necessary to experiment with primer concentrations, so that peak sizes became relatively even (Figure 4). We were forced to eliminate one SNP (5q) from the multiplex format, because it did not show products despite working very well in the simplex format. Thus, after having to remove three SNPs from the original set of 15, we obtained a reliable multiplex minisequencing protocol that allowed us to type 12 SNPs in a simple, direct and routine fashion.


DISCUSSION

We used SNPs chosen from a public database as a test model to develop a multiplex minisequencing protocol. The protocol has wide applicability in the genotyping of SNPs or in the diagnosis of point mutations in human diseases. During the development of this protocol we observed several important aspects that have to be observed to obtain satisfactory results.

In the first multiplex PCR reaction that will provide the templates for the minisequencing reaction, the most important aspect identified was the optimization of the yields of each PCR product. Weakly amplified products never worked well in the minisequencing protocol. However, strong amplification is no guarantee of success, especially with large amplicons. For instance, the SNPs 5q (413 bp) and 12q (576 bp) both presented a good yield in PCR and strong minisequencing peaks if they were done in simplex format, but not in multiplex. In general, shorter products are easier to adjust to the multiplex format and should be preferred. A uniform tail in the primers, such as the T7 sequence that we utilize, helps with the optimization of the multiplex system, because it will reduce the sequence differences among the primers during the PCR.

During the optimization of the purification step we also tested other protocols. One of them was based on magnetic beads coated with streptavidin (Syvanen et al., 1993). We synthesized biotin-linked primers, which have affinity to the streptavidin beads, allowing the elimination of primers and dNTPs not used in the PCR reaction. Although this is an elegant approach, the enzymatic purification method using ExoSAP was easier, faster and more efficient.

The quality of the primers was observed to be an important factor in minisequencing protocols. The multiplex reaction is done using a mix of different size primers with diverse sequences that could interact and be extended by DNA polymerase generating strong background signals. Besides, each primer has contaminants produced during its synthesis. Initially we used non-purified primers during minisequencing multiplex optimization and obtained high background signals and some weak peaks. When we switched to HPLC-purified primers we observed that the purification step was especially important for primers longer than 60-mers (data not shown).

Some minisequencing products presented stronger signal than others. Because of this we altered each primer concentration in the multiplex until we could obtain relatively even results. We could not establish why the primers presented such different peak heights. We used the Spearman coefficient and Kendall coefficient (SPSS Inc., Chicago, IL, USA) to evaluate the relationship between the peak signal quality in the multiplex and specific primer characteristics such as melting temperature (Tm), CG content or primer size through. However, no significant correlation coefficients were seen (data not shown).

The minisequencing primer tail must be selected with caution. Initially, we used tails formed by GATC repetition but we obtained product sizes different from the predicted ones. Most probably GATC tails can form internal annealing bonds that resist the denaturing conditions used in the gels. The selection of tails based on the sequence of pUC18 solved this problem.

In conclusion we successfully produced a reliable multiplex system for simultaneous typing of 12 SNPs. This can be used as a model for accurate, simple and inexpensive genotyping of single nucleotide polymorphisms or in the diagnosis of point mutations in human diseases.

ACKNOWLEDGMENTS

We would like to thank Neuza Antunes, Kátia Barroso and Míriam Rodrigues for technical assistance. Research supported by the Conselho Nacional de Desenvolvimento Científico e Tecnológico of Brazil (CNPq).

REFERENCES

Alves-Silva, J., da Silva Santos, M., Guimaraes, P.E., Ferreira, A.C., Bandelt, H.J., Pena, S.D. and Prado, V.F. (2000). The ancestry of Brazilian mtDNA lineages. Am. J. Hum. Genet. 67: 444-461.

Amir, R.E., Van den Veyver, I.B., Wan, M., Tran, C.Q., Francke, U. and Zoghbi, H.Y. (1999). Rett syndrome is caused by mutations in X-linked MECP2, encoding methyl-CpG-binding protein 2. Nat. Genet. 23: 185-188.

Bazrafshani, M.R., Ollier, W.E. and Hajeer, A.H. (2000). A novel PCR-RFLP assay for the detection of the single nucleotide polymorphism at position -1082 in the human IL-10 gene promoter. Eur. J. Immunogenet. 27: 119-120.

Carvalho-Silva, D.R., Santos, F.R., Rocha, J. and Pena, S.D. (2001). The phylogeography of Brazilian Y-chromosome lineages. Am. J. Hum. Genet. 68: 281-286

Dietz-Band, J.N., Turco, A.E., Willard, H.F., Vincent, A., Skolnick, M.H. and Barker, D.F. (1990). Isolation, characterization, and physical localization of 33 human X-chromosome RFLP markers. Cytogenet. Cell Genet. 54: 137-141.

Gibbs, D.A., Headhouse-Benson, C.M. and Watts, R.W. (1986). Family studies of the Lesch-Nyhan syndrome: the use of a restriction fragment length polymorphism (RFLP) closely linked to the disease gene for carrier state and prenatal diagnosis. J. Inherit. Metab. Dis. 9: 45-57.

Haak, W., Burger, J. and Alt, K.W. (2004). ABO genotyping by PCR-RFLP and cloning and sequencing. Anthropol. Anz. 62: 397-410.

Hinds, D.A., Stuve, L.L., Nilsen, G.B., Halperin, E., Eskin, E., Ballinger, D.G., Frazer, K.A. and Cox, D.R. (2005). Whole-genome patterns of common DNA variation in three human populations. Science 307: 1072-1079.

Inagaki, S., Yamamoto, Y., Doi, Y., Takata, T., Ishikawa, T., Imabayashi, K., Yoshitome, K., Miyaishi, S. and Ishizu, H. (2004). A new 39-plex analysis method for SNPs including 15 blood group loci. Forensic Sci. Int. 144: 45-57.

Just, R.S., Irwin, J.A., O’Callaghan, J.E., Saunier, J.L., Coble, M.D., Vallone, P.M., Butler, J.M., Barritt, S.M. and Parsons, T.J. (2004). Toward increased utility of mtDNA in forensic identifications. Forensic Sci. Int. 146 (Suppl): S147-S149.

Kammerer, S., Roth, R.B., Hoyal, C.R., Reneland, R., Marnellos, G., Kiechle, M., Schwarz-Boeger, U., Griffiths, L.R., Ebner, F., Rehbock, J., Cantor, C.R., Nelson, M.R. and Braun, A. (2005). Association of the NuMA region on chromosome 11q13 with breast cancer susceptibility. Proc. Natl. Acad. Sci. USA 102: 2004-2009.

Kwok, P.Y. and Chen, X. (2003). Detection of single nucleotide polymorphisms. Curr. Issues Mol. Biol. 5: 43-60.

Lai, C.S., Fisher, S.E., Hurst, J.A., Vargha-Khadem, F. and Monaco, A.P. (2001). A forkhead-domain gene is mutated in a severe speech and language disorder. Nature 413: 519-523.

Lander, E.S. and Botstein, D. (1986). Mapping complex genetic traits in humans: new methods using a complete RFLP linkage map. Cold Spring Harb. Symp. Quant. Biol. Pt 1: 49-62.

Liljedahl, U., Karlsson, J., Melhus, H., Kurland, L., Lindersson, M., Kahan, T., Nystrom, F., Lind, L. and Syvanen, A.C. (2003). A microarray minisequencing system for pharmacogenetic profiling of antihypertensive drug response. Pharmacogenetics 13: 7-17.

Ozaki, K., Ohnishi, Y., Iida, A., Sekine, A., Yamada, R., Tsunoda, T., Sato, H., Hori, M., Nakamura, Y. and Tanaka, T. (2002). Functional SNPs in the lymphotoxin-alpha gene that are associated with susceptibility to myocardial infarction. Nat. Genet. 32: 650-654.

Prokunina, L., Castillejo-Lopez, C., Oberg, F., Gunnarsson, I., Berg, L., Magnusson, V., Brookes, A.J., Tentler, D., Kristjansdottir, H., Grondal, G., Bolstad, A.I., Svenungsson, E., Lundberg, I., Sturfelt, G., Jonssen, A., Truedsson, L., Lima, G., Alcocer-Varela, J., Jonsson, R., Gyllensten, U.B., Harley, J.B., Alarcon-Segovia, D., Steinsson, K. and Alarcon-Riquelme, M.E. (2002). A regulatory polymorphism in PDCD1 is associated with susceptibility to systemic lupus erythematosus in humans. Nat. Genet. 32: 666-669.

Rioux, J.D., Daly, M.J., Silverberg, M.S., Lindblad, K., Steinhart, H., Cohen, Z., Delmonte, T., Kocher, K., Miller, K., Guschwan, S., Kulbokas, E.J., O’Leary, S., Winchester, E., Dewar, K., Green, T., Stone, V., Chow, C., Cohen, A., Langelier, D., Lapointe, G., Gaudet, D., Faith, J., Branco, N., Bull, S.B., McLeod, R.S., Griffiths, A.M., Bitton, A., Greenberg, G.R., Lander, E.S., Siminovitch, K.A. and Hudson, T.J. (2001). Genetic variation in the 5q31 cytokine gene cluster confers susceptibility to Crohn disease. Nat. Genet. 29: 223-228.

Sanchez, J.J., Borsting, C., Hallenberg, C., Buchard, A., Hernandez, A. and Morling, N. (2003). Multiplex PCR and minisequencing of SNPs - a model with 35 Y chromosome SNPs. Forensic Sci. Int. 137: 74-84.

Santos, F.R., Pena, S.D.J. and Epplen, J.T. (1993). Genetic and population study of an Y-linked tetranucleotide repeat DNA polymorphism with a simple non-isotopic technique. Hum. Genet. 90: 655-656.

Santos, S.E., Ribeiro-Dos-Santos, A.K., Meyer, D. and Zago, M.A. (1996). Multiple founder haplotypes of mitochondrial DNA in Amerindians revealed by RFLP and sequencing. Ann. Hum. Genet. 60 (Pt. 4): 305-319.

Shriver, M.D., Kennedy, G.C., Parra, E.J., Lawson, H.A., Sonpar, V., Huang, J., Akey, J.M. and Jones, K.W. (2004). The genomic distribution of population substructure in four populations using 8,525 autosomal SNPs. Hum. Genomics 1: 274-286.

Syvanen, A.C. (1999). From gels to chips: “minisequencing” primer extension for analysis of point mutations and single nucleotide polymorphisms. Hum. Mutat. 13: 1-10.

Syvanen, A.C. (2001). Accessing genetic variation: genotyping single nucleotide polymorphisms. Nat. Rev. Genet. 2: 930-942.

Syvanen, A.C., Aalto-Setala, K., Harju, L., Kontula, K. and Soderlund, H. (1990). A primer-guided nucleotide incorporation assay in the genotyping of apolipoprotein E. Genomics 8: 684-692.

Syvanen, A.C., Sajantila, A. and Lukka, M. (1993). Identification of individuals by analysis of biallelic DNA markers, using PCR and solid-phase minisequencing. Am. J. Hum. Genet. 52: 46-59.

Underhill, P.A., Passarino, G., Lin, A.A., Shen, P., Mirazon, Lahr, M., Foley, R.A., Oefner, P.J. and Cavalli-Sforza, L.L. (2001). The phylogeography of Y chromosome binary haplotypes and the origins of modern human populations. Ann. Hum. Genet. 65 (Pt. 1): 43-62.

   Copyright © 2005 by FUNPEC