Protocols for obtaining and analyzing DNA/NGS data for phylogenetics and phylogeography

molecular laboratory supplies

Here, I'll list some of the main supplies we use in the Lab. Students from our lab or our collaborators' labs can feel free to email me for login information for making online orders from the different vendors in the United States or Brazil. I can also provide email and phone number info that is sometimes necessary for making or checking on purchases from the Brazilian vendors.



dna extraction tips & protocols

As noted above, I tend to use Qiagen DNeasy Blood & Tissue extraction kits, or Invitrogen kits, to get whole genomic DNA from the fishes and lizards we study, and the starting material is typically white muscle or fin clip tissue.

I prefer to conduct my extractions using 96-well extraction plates with Qiagen DNeasy 96 kits, and the DNeasy 96 protocol for Animal Tissues. I do this wherever possible, because though more expensive this is much easier and faster! However, when working in the Johnson Lab at BYU, we apply some mild modifications to the Qiagen protocol. A copy of the handbook for these Qiagen DNeasy kits is provided free from (2006 version).  We modify this Animal Tissues protocol on pgs. 35-40, because the lab centrifuge only reaches up to a max. speed of 4150 xg.  Specifically, to accommodate this, we do our first lysate spin (handbook step #10) for 15 min at 4150 xg (instead of 10 min at 6000 xg), we do the Buffer AW1 spin (handbook step #12) for 7.5-8 min at 4150 xg (instead of 5 min at 6000 xg), and we do the "topless" (no cover) Buffer AW2 spin (handbook step #14) for a first spin for 20 min at 4150 xg (instead of 15 min at 6000 xg) and a second spin of 2 min.  I personally almost always do 1 elution step with 200 ul of water (with 1 min incubation at room temperature), and this has worked very well for me, with very low error rates and reasonable DNA yields.

In Brazil, we are using Invitrogen PureLink® DNA kits to conduct extractions, and these are essentially exactly the same as the Qiagen kits, except the kits employ individual stand-alone columns for each sample, and thus limiting factors include the number of spaces and speed of available centrifuges, the speed with which you (or students) are capable of working on each column, and speed of moving columns between racks and centrifuges. Another difference is that, when using the Invitrogen kits, we always take the extra time to add RNAase to the samples before moving them to the columns.



PCR & Sanger sequencing protocols

What follow are a series of sections containing tips and protocols for conducting PCR and Sanger sequencing. Similar thermocycling routines are useful for other procedures, such as DNA digestion and ligation used during genomic library prepration, e.g. for NGS sequencing experiments.

PCR Cleanup using Millipore 96 well plates (purple, multiscreen 96)

  1. Bring total PCR volume up to 100 μl (for 25 μl reactions that you used 5 μl of for your check gel, you would have ~20–21 μl reaction volume remaining on your PCR plate after completing a check gel analysis; thus, you would need to add ~79–80 μl of autoclaved/dH2O water to bring the reaction volume up to a total of 100 μl).

  2. Transfer all 100 μl of diluted PCR reaction to a Millipore 96 well clean up plate.

  3. Turn on the Millipore vacuum manifold pump.  Place the Millipore plate on vacuum manifold (with empty or partially empty 96 well collecting plate underneath—check to make sure it is not full) for 10–15 minutes until wells are empty, making sure that the pressure gauge stays at least -24 in Hg (in the Johnson Lab, our Millipore pump usually drops down to -19 Hg; this should be maintained for the entire duration of vacuum filtration).

  4. Optional: Cover wells that are not being used if you have problems getting sufficient suction pressure in step 3 above.

    1. Optional: Cover wells that are not being used if you have problems getting sufficient suction pressure in step 3 above.

    2. Note: the wells of the plate should appear shiny and slightly wet all the time.  The diluted PCR will probably mostly get sucked through the plate fairly quickly, but should be vacuumed for several more minutes after most of the water has disappeared below.

  5. After completing the vacuum filtration step above, turn off the pump and remove the Millipore plate from the vacuum manifold.  Blot the bottom of the plate with a paper towel to remove excess water.

  6. Re-suspend clean DNA in 30 μl of autoclaved water added into each well.

  7. Place the plate on a vortex / mixer for approximately 8–10 minutes.  Use tape to hold the plate in place and make sure to switch the vortex off and set the dial to 2.5–3 (on a typical vortex machine, e.g. Scientific Industries Vortex Genie 2).

  8. Pipette product out of wells and transfer to labeled tube or plate.

  9. Alternatively, pipette product out of wells and directly into cycle sequencing reactions.



Faster PCR Cleanup using ExoSAP-IT (Exonuclease I and Shrimp Alkaline Phosphatase in buffer)

  1. Place 4 μl of ExoSAP-IT stock into each well containing ~12-15 μl PCR reactions that you desire to clean.

  2. Incubate for one cycle at 37 degrees C for 30 minutes to degrade remaining primers and nucleotides.

  3. Incubate for one cycle at 80 degrees C for 15 minutes to inactivate ExoSAP-IT.

  4. PCR products are now cleaned of main contaminants, and are now ready for DNA sequencing reactions or other down-stream lab work/analyses.



Cycle sequencing reactions using Big Dye v3.0 dye-terminator chemistry

Most labs reduce the amount of Big Dye Terminator Reaction mix they use in each cycle sequencing reaction, in order to make the available mix last longer, and hence save money (Big Dye is expensive!). Dilutions of 1/8 or 1/16 relative to the standard full reaction concentration of 4 μl  of 2.5x Big Dye mix per 10 μl total cycle sequencing reaction volume (or 8 μl per 20 μl total volume) are commonly used. The following table provides cycle sequencing reaction mix setups using each of the common dilutions.

1/8th and 1/16th Cycle Sequencing Reaction mix setups

Reaction sizes: 1/8th 1/16th Modified 1/16th
H2O 2.5 μl 2.75 μl 2.25 μl
5x Big Dye buffer 1.5 μl 1.75 μl 1.75 μl
Big Dye 1.0 μl 0.5 μl 0.5 μl
Primer (10 μM) 0.4 μl 0.4 μl 0.5 μl
DNA (10-75 ng/ μl) 5.0 μl 5.0 μl 5.0 μl
Total Volume: 10.4 μl 10.4 μl 10 μl

We run our cycle sequencing reactions on ABI 9700 PCR thermocyclers using the following protocol: 95° C for 2 min, followed by 35 cycles of 95° C for 10 s, 52° C for 5 s, and 60° C for 4 min.  Afterward, we use a standard 4° C hold (forever).  In my research, I exclusively use Modified 1/16th mixes for maximum efficiency.



Sephadex sequencing reaction cleanup

The following DNA cycle sequencing reaction cleanup protocol relies on Sephadex beads from GE Healthcare and is specific to the Johnson Lab at BYU. For example, the protocol contains directions for prepping plates of sequences for submission to the BYU DNA Sequencing Center core facility.  Still, the procedure is written in practical terms and can easily be modified to work in almost any molecular evolution or molecular biology lab.

  1. Obtain Millipore multiscreen filter plates (96 well, item doesn’t say “PCR”).  Add a used 96-well, v-bottom plate to the bottom of the filter plate.

  2. Obtain SephadexTM G-50 fine beads and plate loader.

  3. Put plate loader on aluminum foil to catch excess SephadexTM.  Then pour SephadexTM onto plate loader and fill all necessary holes (one for each sample being cleaned).

  4. Turn empty filter plate upside down and slide onto (full) plate loader.  Hold firmly, flip the two plates over, and tap plate loader gently (e.g. I use scissors for this) to get the SephadexTM to fall down into each corresponding well.  Place leftover SephadexTM on foil back into container.

  5. Add 300 μl dH2O to each well, making sure that the wells are full.  Then get balance plate and fill wells with water, as needed to balance the samples.

  6. Let plate stand for 10–15 minutes.  During this time, the water is causing the SephadexTM resin to expand.

  7. Spin plates in centrifuge (making sure to use balance plate) for 2 min at 2500 rpm x_g_.

  8. Add 20 μl dH2O to sequencing reactions.

  9. Dump water from each 96 well plate and place clean, non-balance v-bottomed plates (used in steps 5–7) back into cupboard.

  10. Add a new clean 96-well, v-bottom plate underneath each filter plate except the balance plate, making sure rows A–H are aligned on both the filter and collection plates.  Make sure each v-bottom collection plate is labeled with user/lab name and submission number/codes.  Alternatively, if you do not have final submission data, you may simply label the plates according to their order as plates #1, #2, etc. so that you know which plate is which.

  11. Transfer all sequencing samples to corresponding filter plates.

  12. Spin plates in centrifuge (again, using the same balance plate) for 2 min at 2500 rpm x_g_.

  13. Dry samples in vacufuge (vacuum centrifuge) at 60° C for 30–45 min, making sure to use a matching balance plate as needed.  Check your plate(s) at the end of vacufuging, and ensure that all water has been dried down to the bottom of the plate.

    1. Note: you may from time to time forget your plate of samples on the vacufuge… but don’t worry, vacufuging as per above for 1 hour has no adverse effects.
  14. Cover plate with thermal seal and mark off all unused wells.  Write DNA sequence submission numbers on each plate, then cover each plate and place in the DNASC (DNA Sequencing Center) fridge for processing (note the correct shelf in the fridge, which will be labeled “Electrophoresis ready”).



Cycle sequencing cleanup using EtOH/EDTA DNA precipitation procedure

For starting CS reaction volumes of 10 μl...

  1. Add 2.5 μl of 125 mM EDTA to each sample.

  2. Add 27.5 μl of 100% EtOH (cold, from 4 degrees Celsius refrigerator) to each sample.

  3. Shake slowly on vortex for a few seconds (say 5-10 seconds for plates; for tubes, invert the tube 4 or 5 times, or buzz once quickly on vortex).

  4. Wait 15 minutes.

  5. Centrifuge plate at 2,000 xg (rcf) at 4 degrees C for 30 minutes.

  6. Remove samples from centrifuge, then cover with a paper towel and invert. Place back in centrifuge and spin for 1 min at 140 x_g_ (rcf).

  7. Add 30 μl 70% EtOH (cold, from 4 degrees C refrigerator) to each sample.

  8. Centrifuge for 1,650 xg (rcf) at 4 degrees C for 15 minutes.

  9. Remove samples from centrifuge, cover with paper towel and invert. Place in centrifuge and spin for 1 min at 140 x_g_ (rcf).

  10. Samples are uncovered and mostly dry, but they need to be heated to ensure that they are completely dry. Place samples upright and uncovered into 37-40 degrees C incubator, and dry them there for 15 min. (Alternative: leave tubes/wells upright and uncovered, but optionally with the entire plate loosely covered in aluminum foil, for 1 h.)

  11. Resuspend dye-terminator-tagged DNA samples with 10 μl HI-Di Formamide, vortex VERY slowly (or by hand on surface of lab bench) for 10-60 s to mix.

  12. Incubate sample at 94 degrees C for 2 min before placing on ABI automatic sequencer.



PCR tips  

Peter Unmack's Molecular Systematics Project Tips - This page containing tips for organizing samples and conducting PCR, clean up, and sequence submissionIn the Johnson Lab, we use a series of PCR data sheets originally developed by Peter Unmack.  Here are links to Peter's example Excel spreadsheets for each step, including files for planning each of your extraction, PCR, PCR clean up, and sequencing procedure, from  See Peter's website for an example Master list for keeping samples in order. Everyone in the BYU Department of Biology uses our in-house sequencing facility, the BYU DNA Sequencing Center (DNASC; run by PhD biochemist Ed Wilcox), for Sanger sequencing on ABI automated sequencers and some NGS sequencing (e.g. Illumina runs).  Thus, for members of the Johnson Lab or Department that have come to this site (or others using DNASC services), I here also provide a sequence submission protocol for submitting plate information to the DNASC.  



PCR primers

What follows is a table of oligonucleotide DNA primers commonly used in my research (all sequences are listed from 5' to 3'), many of which were developed by Peter J. Unmack or provided by him to the Johnson Lab, at BYU. In the primer names, capital "L's" or "F's" at the beginning of the name indicate that the primer corresponds to the "light" DNA chain and is a forward primer; capital "H's" or "R's" indicate the primer sits on the "heavy" DNA chain and is a reverse primer. Annealing temperatures of 48 degrees C work well for most of the mitochondrial DNA gene primers listed here, although it is at times advantageous to increase specificity by slightly upping this to 49 degrees C (ad lib.). Typically, the nuclear genes amplify well using annealing temperatures of 52 degrees C, but some are run in nested PCRs. You will want to check the original papers, where referenced ("Source" column), to see what their exact thermoprofiles were. Also see my publications, and refs. therein, for more information, including precise amplification conditions that my colleagues and I have used for each primer; particularly good examples of this include Unmack et al. (2012) and Bagley et al. (2013, 2015, 2016).

Genome Gene Primer Sequence Source Taxon notes
mitochondrial Cytb L14725 GAYTTGAARAACCAYCGTTG Hrbek et al. (2007) Poeciliidae
mitochondrial Cytb H15982 CCTAGCTTTGGGAGYTAGG Hrbek et al. (2007) Poeciliidae
mitochondrial Cytb HA / OUT(H) CAACGATCTCCGGTTTACAAGAC Schmidt et al. (1998) cyprinids (but work in most taxa)
mitochondrial Cytb HD / INH GGGTTGTTTGATCCTGTTTCGT Schmidt et al. (1998) cyprinids (but work in most taxa)
mitochondrial Cytb LA-NEW / OUT(L) GTGACTTGAAAAACCACCGTT Schmidt et al. (1998), but modified by Peter J. Unmack cyprinids (but work in most taxa)
mitochondrial Cytb LD / INL CCATTCGTCATCGCCGGTGC Schmidt et al. (1998) cyprinids
mitochondrial Cytb H15927 TTACAAGACCGACGCTCTGA Peter J. Unmack Poeciliidae
mitochondrial Cytb Glu31 TGRCTTGAAAAACCACCGTTGT Peter J. Unmack fishes
mitochondrial Cytb Glu18 TAACCAGGACTAATGRCTTGAA Peter J. Unmack fishes
mitochondrial Cytb Thr29 ACCTTCGATCTCCTGATTACAAGAC Peter J. Unmack catfishes
mitochondrial Cytb GPThr27 TCTTCGGATTACAAAACCG Peter J. Unmack Galaxiella
mitochondrial Cytb POE.15940 GACTTTAACCTCCGACTTTCG Peter J. Unmack Poeciliidae
mitochondrial Cytb POE.15931 CCCTCGACTTTCGGTTTACAAG Peter J. Unmack Poeciliidae
mitochondrial Cytb RF.Thr.48 GCAGTAGGAGGGAATTTAACCTTCG Peter J. Unmack Rainbowfishes
mitochondrial ATPase 6/8 lys.22F AAAGCGTTAGCCTTTTAAGC Peter J. Unmack Atheriniformes
mitochondrial ATPase 6/8 lys.31F GCCTTTTAAGCTAAAGATTGG Peter J. Unmack Atheriniformes
mitochondrial ATPase 6/8 co3.62R TTATTAGAAGGGCGGCAACTG Peter J. Unmack Atheriniformes
mitochondrial ATPase 6/8 co3.23R GGCTTGGGTCAACTATGTGGT Peter J. Unmack Atheriniformes
nuclear RPS7 1F TGGCCTCTTCCTTGGCCGTC Chow & Hazama (1998) most fishes
nuclear RPS7 2R AACTCGTCTGGCTTTTCGCC Chow & Hazama (1998) most fishes
nuclear RPS7 3R GCCTTCAGGTCAGAGTTCAT Chow & Hazama (1998) most fishes
nuclear RPS7 2F AGCGCCAAAATACTGAAGCC Chow & Hazama (1998) most fishes
nuclear RPS7 2F.2.CAT GCCATGTTCAGTACCAGTGC Peter J. Unmack catfishes
nuclear RPS7 3R.2 CATCTCCAGCTCAAGMAGAG Peter J. Unmack fishes
nuclear RPS7 3R.10 TCAGAGTTCATCTCCAGCTC Peter J. Unmack fishes
nuclear RPS7 1F.2 CTCTTCCTTGGCCGTCGTTG Peter J. Unmack fishes
nuclear RPS7 2R.67 TACCTGGGARATTCCAGACTC Peter J. Unmack fishes
nuclear RPS7 3R.24 AGCTGAGCCTTCAGGTCAGAG Peter J. Unmack fishes
nuclear X-src (TK) src.e7.1F TGACAGACGTTTGTCCCGTACTGAAGC Peter J. Unmack fishes
nuclear X-src (TK) src.e7.2F GTACTGAAGCCTCAGACTCAGG Peter J. Unmack fishes
nuclear X-src (TK) src.e8.1F CTGAAGCCTGGCACCATGTC Peter J. Unmack fishes
nuclear X-src (TK) src.e8.2F CACCATGTCCCCTGAGGCKTTCCTGC Peter J. Unmack fishes
nuclear X-src (TK) src.e8.R ACGGCATAAAGCTGAACCAG Peter J. Unmack fishes
nuclear X-src (TK) src.e10.endR ATGAGKCGAGCCAGACCGAAATCAGC Peter J. Unmack fishes
nuclear X-src (TK) src.e10.end2R CCGAAATCAGCCACTTTACAMACCAG Peter J. Unmack fishes
nuclear X-yes (TK) yesA GATCGCAGACGGCATGGCCTCC Schartl et al. (1995) fishes
nuclear X-yes (TK) yesB CTGCCCTTTGTTACCAGCTCTGTC Schartl et al. (1995) fishes
nuclear X-yes (TK) yes.F1 GAGAGAATGAACTACATCCATAG Peter J. Unmack fishes
nuclear X-yes (TK) yes.F2 GACAACCTGGTCTGTAAGATCGC Peter J. Unmack fishes
nuclear X-yes (TK) yes.R1 GACCACACGTCTGATTTGATTGTGAA Peter J. Unmack fishes
nuclear X-yes (TK) yes.R2 GATTTGATTGTGAAGCGACCGTACA Peter J. Unmack fishes
nuclear Rag1 RAG1.f409.mod AGGTGYTTGTGCCGTCTCTGTGG Peter J. Unmack fishes
nuclear Rag1 Rag1.F74 TTTCGGAATGGAAGTTTAAGCTSTTTCG Sullivan et al. (2006) catfishes
nuclear Rag1 Rag1.F89 TTTAAGCTGTTTCGAGTTCGTTCATTGG Peter J. Unmack catfishes
nuclear Rag1 R1.4090R CTGAGTCCTTGTGAGCTTCCATRAAYTT Guillermo Orti fishes

Primer References ("Source" column in primer table above)

Chow S, Hazama K (1998) Universal PCR primers for S7 ribosomal protein gene introns in fish. Molecular Ecology 7:1255–1256.

Hrbek T, Seekinger J, Meyer A (2007) A phylogenetic and biogeographic perspective on the evolution of poeciliid fishes. Mol. Phylogenet. Evol. 43:986-998.

Schartl M, Wilde B, Schlupp I, Parzefall J (1995) Evolutionary origin of a parthenoform, the Amazon molly Poecilia formosa, on the basis of molecular genealogy. Evolution 49:827-835.

Schmidt TR, Bielawski JP, Gold JR (1998) Molecular phylogenetics and evolution of the cytochrome b gene in the cyprinid genus Lythrurus (Actinopterygii: Cypriniformes). Copeia 1998:14-22.

Sullivan JP, Lundberg JG, Hardman M (2006) A phylogenetic analysis of the major groups of catWshes (Teleostei: Siluriformes) using rag1 and rag2 nuclear gene sequences. Molecular Phylogenetics and Evolution 41:636-662.

[More coming soon, including primers for ldh-A (muscle-type lactose dehydrogenase; Quattro and Jones 1999) and Glyt (glycosyltransferase; Li et al. 2007) that Justin used during his dissertation work.]



NGS library prep

Protocols for double digest RAD-seq (ddRad-seq) library preparation

If you don't want to bother with library prep, see my blog entry here for links to companies/labs for outsourcing single and dd RAD-seq runs for de novo SNP discovery.

During my postdoctoral training, I have learned to prepare genomic libraries for generating Rad-tags (Rad SNP loci) using double-digest RAD-seq (ddRAD-seq) experiments with Next-Generation Sequencing (NGS) platforms. With our Brazilian collaborators, we have been using an altered version of the Peterson et al. (2012) ddRAD protocol, modified for the Ion Torrent platform, because this is faster and cheaper for us that using other platforms or outsourcing. We use similar methods to prep libraries and conduct ddRad-seq runs on the Illumina platform in the Eckert Lab. The main difference between prepping ddRAD-seq libraries for Ion Torrent runs versus Illumina Mi-Seq or Hi-Seq runs is that these two different platforms use different sets of primers! So, no big deal really; but you can get as fancy with the primers and adapters as you would like. 

Here, I am posting links to RAD-family methods, but focusing on ddRAD protocols, and I plan to expand this list in the future as I come across new protocols. 

For in-house RAD-seq library prep and sequencing, perhaps the most currently used protocol is the "original" ddRad-seq protocol of Peterson et al. (2012). A variety of different evolutionary genetics/population genomics labs have taken up this protocol and modified it for their particular study systems/organisms or NGS machines. I have found it very helpful to look over the protocols that these labs have posted freely online. Below, I give links to a few such examples from various institutions, as follows:




Parchman TL, Gompert Z, Mudge J, Schilkey FD, Benkman CW, Buerkle CA (2012) Genome‐wide association genetics of an adaptive trait in lodgepole pine. Molecular Ecology, 21(12), 2991-3005. HTML

Peterson BK, Weber JN, Kay EH, Fisher HS, Hoekstra HE (2012) Double digest RADseq: an inexpensive method for de novo SNP discovery and genotyping in model and non-model species. PLoS ONE, 7, e37135. HTML