Accessibility navigation


The automation and evaluation of nested clade phylogeographic analysis

Panchal, M. and Beaumont, M. A. (2007) The automation and evaluation of nested clade phylogeographic analysis. Evolution, 61 (6). pp. 1466-1480. ISSN 0014-3820

Full text not archived in this repository.

It is advisable to refer to the publisher's version if you intend to cite from this work. See Guidance on citing.

To link to this item DOI: 10.1111/j.1558-5646.2007.00124.x

Abstract/Summary

Nested clade phylogeographic analysis (NCPA) is a popular method for reconstructing the demographic history of spatially distributed populations from genetic data. Although some parts of the analysis are automated, there is no unique and widely followed algorithm for doing this in its entirety, beginning with the data, and ending with the inferences drawn from the data. This article describes a method that automates NCPA, thereby providing a framework for replicating analyses in an objective way. To do so, a number of decisions need to be made so that the automated implementation is representative of previous analyses. We review how the NCPA procedure has evolved since its inception and conclude that there is scope for some variability in the manual application of NCPA. We apply the automated software to three published datasets previously analyzed manually and replicate many details of the manual analyses, suggesting that the current algorithm is representative of how a typical user will perform NCPA. We simulate a large number of replicate datasets for geographically distributed, but entirely random-mating, populations. These are then analyzed using the automated NCPA algorithm. Results indicate that NCPA tends to give a high frequency of false positives. In our simulations we observe that 14% of the clades give a conclusive inference that a demographic event has occurred, and that 75% of the datasets have at least one clade that gives such an inference. This is mainly due to the generation of multiple statistics per clade, of which only one is required to be significant to apply the inference key. We survey the inferences that have been made in recent publications and show that the most commonly inferred processes (restricted gene flow with isolation by distance and contiguous range expansion) are those that are commonly inferred in our simulations. However, published datasets typically yield a richer set of inferences with NCPA than obtained in our random-mating simulations, and further testing of NCPA with models of structured populations is necessary to examine its accuracy.

Item Type:Article
Refereed:Yes
Divisions:Life Sciences > School of Biological Sciences
ID Code:10066
Uncontrolled Keywords:Coalescent, nested clade phylogeographic analysis, phylogeography, sequence analysis, simulation, MITOCHONDRIAL-DNA HAPLOTYPES, CLADISTIC-ANALYSIS, PHENOTYPIC, ASSOCIATIONS, STATISTICAL PHYLOGEOGRAPHY, GEOGRAPHICAL-DISTRIBUTION, POPULATION HISTORY, COALESCENT THEORY, GENE GENEALOGIES, COMPUTER-PROGRAM, SEQUENCE DATA

University Staff: Request a correction | Centaur Editors: Update this record

Page navigation