Journal of Heredity Advance Access originally published online on January 11, 2006
Journal of Heredity 2006 97(2):119-132; doi:10.1093/jhered/esj012
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Regional Genetic Structuring and Evolutionary History of the Impala Aepyceros melampus
From the Department of Evolutionary Biology, Institute of Biology, University of Copenhagen, Universitetsparken 15, DK-2100 Copenhagen Ø, Denmark
Address correspondence to E. D. Lorenzen at the address above, or e-mail: edlorenzen{at}bi.ku.dk.
| Abstract |
|---|
|
|
|---|
Samples of 162 impala antelope (Aepyceros melampus) from throughout its distribution range in sub-Saharan Africa were surveyed using eight polymorphic microsatellite loci. Furthermore, 155 previously published mitochondrial DNA (mtDNA) sequences from the same localities were reanalyzed. Two subspecies of impala are presently recognizedthe isolated black-faced impala (Aepyceros melampus petersi) in southwest Africa and the common impala (Aepyceros melampus melampus) abundant in southern and east Africa. All tests performed indicated significant genetic differentiation at the subspecific level. Furthermore, individual-based analyses split the common impala subspecies into two distinct genetic groups, conforming with regional geographic affiliation to southern or east Africa. This was supported by assignment tests, genetic distance measures, pairwise
values, and analysis of molecular variance. We suggest that the presence of such previously unknown regional structuring within the subspecies reflects a pattern of colonization from a formerly large panmictic population in southern Africa toward east Africa. This scenario was supported by a progressive decline in population diversity indices toward east Africa and a significant increase in the quantity
/(1
). Both microsatellite and mtDNA data indicated a genetic distinctiveness of the Samburu population in Kenya.
| Introduction |
|---|
|
|
|---|
Defining the distribution of wild animal populations is essential for determining the appropriate scale of their conservation and management (Beaumont and Bruford 1999; Moritz 1994; Paetkau 1999). A population's distribution is a product of ecological and evolutionary processes and has important long-term consequences (Moritz 1999). However, it is difficult to determine whether topographic features such as rivers or mountains, political jurisdictions such as country boundaries, or differences in morphology actually represent natural boundaries in terms of population genetics. It can therefore be useful to confirm whether the subjective classifications are consistent with genetic information (Beaumont and Bruford 1999; Pritchard et al. 2000; Rosenberg et al. 2001).
Distributed throughout southern and east Africa, the impala is one of the most abundant antelope species on the continent, numbering more than 1.5 million individuals (East 1998). It is a medium-sized antelope (Jarman and Jarman 1973), and its distinctiveness presently warrants it the sole representative of the Subfamily Aepycerotinae (Meester and Setzer 1977; Skinner and Smithers 1990), although disagreements as to the taxonomic affinity persist (Skinner and Smithers 1990). It has previously been placed with the Antilopinae and the Alcelaphinae and has been considered close to the Reduncinae (reviewed in Kingdon 1982; Meester and Setzer 1977). Presently, two subspecies of impala are recognized (Haltenorth and Diller 1977; Kingdon 1997; Nowak 1999). The geographically isolated and morphologically distinct black-faced impala (Aepyceros melampus petersi, Bocage 1879) is found only in the extreme semiarid zone in northern Namibia and southwest Angola (East 1998; Meester and Setzer 1977; Skinner and Smithers 1990; Swart 1967; Figure 1). It is listed as vulnerable by the International Union for the Conservation of Nature and Natural Resources (East 1998), numbering only some 2,000 individuals in the wild (Green and Rothstein 1998). The common impala (Aepyceros melampus melampus, Lichtenstein 1812) is abundant throughout its range and is widely distributed from Kenya and southern Uganda to Northern KwaZulu-Natal in South Africa, extending westward in the more southerly parts of its range (Figure 1). Favored for farming and hunting, the subspecies has been widely introduced to privately owned land and game reserves in Zimbabwe, South Africa, and Namibia (East 1998). The intraspecific taxonomy of the common impala has been subject to disagreements. Up to five subspecies have previously been listed (Meester and Setzer 1977), but with overlapping and ill-defined boundaries. Presently one subspecies is recognized (Kingdon 1997).
|
Previous analyses of the genetic structure of African bovids have revealed a significant regional distinction of populations into southern and eastern Africa. For example, hartebeest (Alcelaphus buselaphus), topi (Damaliscus lunatus), kudu (Tragelaphus strepsiceros), sable antelope (Hippotragus niger), and roan antelope (Hippotragus equinus) exhibited varying degrees of regional differentiation (Arctander et al. 1999; Mathee and Robinson 1999; Nersting and Arctander 2001; Van Hooft et al. 2002). It has been suggested that these previously pan-African species became extinct during the Pleistocene except in refuges in the south and east. The violent climatic fluctuations characteristic of this period have therefore influenced the appearance, geographical expansion, and subsequent evolution of these species (Arctander et al. 1999). Interestingly, the wildebeest (Connochaetes taurinus) survived only in the south, with subsequent recolonization toward the east leading to its current distribution range (Arctander et al. 1999). This scenario of refugia is supported by the fact that a similar pattern of spatial variation has been observed in carnivores (Girman et al. 1993) and rodents (Mathee and Robinson 1997).
The objectives of this study were to analyze DNA polymorphism in the impala (A. melampus) in order to (1) determine the spatial genetic structuring of the species across its distribution based on nuclear microsatellite loci, particularly within the common impala subspecies (A. m. melampus) in southern and east Africa, and (2) investigate the evolutionary history of the impala using both microsatellites and mitochondrial DNA (mtDNA) loci.
| Materials and Methods |
|---|
|
|
|---|
Samples
We used eight microsatellite markers to genotype a total of 162 impala, collected from 11 localities in seven different countries across the species' distribution range in southwest, south, and east Africa (Figure 1). Eighty-three samples of the black-faced impala subspecies (A. m. petersi) were collected from three sites in and around Etosha National Park, Namibia: Ombika (OM, n = 16), Olifantsbad (OL, n = 33), and Halali (HA, n = 34). The 79 common impala (A. m. melampus) samples were sampled between 1992 and 1997 from Chobe National Park, Botswana (CH, n = 10); Shangani in Zimbabwe (SH, n = 11); Kafue National Park, Zambia (KAF, n = 5); South Luangwa National Park, Zambia (LU, n = 6); Selous Game Reserve, Tanzania (SE, n = 12); Burko Forest Reserve, Tanzania (BK, n = 13); Lake Mburu National Park, Uganda (LM, n = 12); and Samburu National Reserve, Kenya (SA, n = 10) (Figure 1).
All samples were collected by remote skin biopsy sampling (Karesh et al. 1987) from dry skin or ear nicks. In the field, samples were immediately stored in a solution of 25% dimethyl sulfoxide saturated with NaCl (Amos and Hoelzel 1991). In the laboratory, samples were stored at 80°C.
Laboratory Methods
DNA Extraction and Amplification
Total genomic DNA was extracted from tissue using the Qiagen DNeasy Tissue Kit. The double-stranded DNA was purified, desalted, and concentrated through QiaQuick PCR purification kit columns and amplified using the polymerase chain reaction (PCR). We analyzed eight microsatellite loci originally isolated in domestic cattle, goat, and sheep genomes (one bovine, two caprine, and five ovine), which amplified robustly and repeatedly and were found highly polymorphic (Table 1 with associated reaction conditions).
|
PCR amplifications were modified from the original protocols as follows: 10 min at 95°C, 10 cycles of touchdown with each cycle stepped down by 1°C followed by 2540 cycles consisting of 95°C for 30 s, annealing fixed at the lowest temperature of the touchdown cycles for 30 s and extension at 72°C for 30 s, and final extension for 1 h. Cycles were carried out in a 10-µl reaction volume containing 100200 ng of genomic DNA, 1x PCR buffer (15 mM Tris-HCl, pH 8.3, 50 mM KCl), 5 mM MgCl2, 0.2 mM of each deoxynucleoside triphosphate, 0.5 µM of each primer (forward primer labeled with FAM or JOE; Applied Biosystems, Foster City, CA), 0.5 units AmpliTaq Gold DNA polymerase, 0.1 mg/ml of purified bovine serum albumin and deionized water to 10 µl.
Microsatellite Genotyping
The PCR products were separated on a 5% denaturing, polyacrylamide gel with 1x TBE (0.089 M Tris, 0.089 M borate and 0.0020 M EDTA) buffer in an automated DNA sequencer (ABI 377, Perkin Elmer Inc., Norwalk, CT). Microsatellite allele sizes were scanned using the ABI software GENESCAN 3.1 (Applied Biosystems), and individual genotypes were determined using GENOTYPER 2.1. When ambiguous banding patterns occurred, samples were genotyped again to confirm allele sizes. If the PCR failed or no bands could be scored after multiple trials, the individual sample was re-extracted, and where it failed again, it was abandoned.
Statistical Analyses
Impala population structure was inferred from the spatial distribution of microsatellite genotypes across all the eight loci. In order to cope with the fairly continuous impala distributions we applied two different statistical approaches. First, we ignored any a priori notation of groupings and rather based the analyses on a spatial scale using a model-based Bayesian procedure (Pritchard et al. 2000). Next, we characterized the amount and distribution of genetic variation within and among groups or localities and performed the analyses on sample allele or genotype frequencies. These localities were predefined according to the geographical proximity of individuals and borders between countries (Figure 1).
Individual-Based Analyses
Spatial Structure.
Population structure based on the genotypes of individuals was assessed using a model-based Bayesian procedure, implemented in the program STRUCTURE version 2.1 (Pritchard et al. 2000). This model involves placing individuals into K (unknown) populations of origin and simultaneously assigns the individuals to the populations with explicit estimates of their 90% confidence intervals. These populations of origin are genetic clusters with distinctive allele frequencies, and individuals are probabilistically assigned to one cluster (the population of origin) or more than one cluster (the parental populations) if they are genetically admixed. K is chosen in advance but can be varied across independent runs of the algorithm. Individuals can have membership in multiple clusters, with the membership coefficients summing to 1 across clusters.
We analyzed spatial structure based on the "admixture model" and correlated allele frequencies; runs were based on 106 iterations and a burn-in period of 50,000 iterations. Ten individual repetitions of each estimate of K (211) were run in order to check for consistency in the results.
Frequency-Based Analyses
Genetic Diversity and Genotypic Linkage Disequilibrium.
Allele frequencies, observed heterozygosities (HO), and expected heterozygosities (HE) were estimated using the software GENEPOP version 3.1c updated from Raymond and Rousset (1995), as implemented for online access by E. Morgan (http://wbiomed.curtin.edu.au/genepop/). We used the Microsatellite Toolkit ver 3.1 (Park 2001) add-in utility for Microsoft Excel to compute the mean number of alleles (MNA) per locus for the 11 populations.
Deviations of genotypic frequencies from Hardy-Weinberg (HW) proportions were expressed as f, the Weir and Cockerham (1984) estimator of FIS, using randomization tests. Probabilities of significance were estimated using the Markov chain method (dememorization 1,000, batches 1,000, iterations per batch 1,000) (Guo and Thomson 1992) as implemented in GENEPOP. The null hypothesis of no genotypic linkage disequilibrium was tested in all samples, and significance values were computed for each locus pair by unbiased estimates of Fisher's exact tests using the Markov chain method available in GENEPOP. To determine the statistical significance of these tests, the probability values were adjusted for multiple simultaneous table-wide tests using the sequential Bonferroni adjustments (Rice 1989).
Severe reductions in effective population sizes of the samples (i.e., bottlenecks) were tested using the program BOTTLENECK version 1.2.02 (Piry et al. 1999). The tests are based on the fact that populations that have experienced a recent reduction in effective population size exhibit a more rapid reduction in allelic diversity than heterozygosity (i.e., gene diversity HE) at polymorphic loci. Hence, in a recently bottlenecked population, HE (as expected under HW) exceeds the heterozygosity expected in a population at mutation-drift equilibrium (HEQ) (Cornuet and Luikart 1996; Luikart and Cornuet 1998). The two-phase microsatellite mutation model (TPM) is intermediate to the stepwise mutation model (SMM) and the infinite allele model (IAM), and most microsatellite data sets better fit the TPM than the SMM or IAM (Di Rienzo et al. 1994). The TPM recommended by the authors of BOTTLENECK for microsatellites consists of mostly one-step mutations, and a small percentage (5%10%) of multistep changes (Luikart et al. 1998). We ran the program using a 95% one-step mutations and 5% multistep mutations (with an exponential distribution of step numbers having an average of four steps) (Luikart et al. 1998; Piry et al. 1999). To see if results differed with the mutation model assumed, the program was run also assuming an SMM and IAM. Cornuet and Luikart (1996) and Luikart et al. (1998) proposed three tests, which are implemented in the BOTTLENECK software for the detection of HE excess. Of these, we applied the "sign test" and "Wilcoxon sign-rank test" (the latter recommended by Maudet et al. 2002 and Piry et al. 1999). The "standardized differences test" requires at least 20 polymorphic loci (Cornuet and Luikart 1996) and was therefore not applied to this data set.
Population Structure.
Population structure was quantified using analysis of molecular variance (AMOVA; Excoffier et al. 1992) as implemented by the program ARLEQUIN version 2.000 (Schneider et al. 2000). This uses F statistics introduced by Wright (1951) and modified by Weir and Cockerham (1984) to partition the total genetic variation into specified subdivisions and tests for overall levels of differentiation. The proportion of genetic variation was determined for the following components: between individuals within samples (populations grouped by sites of collection) (FIS), among samples pooled within broad regional scales (southwest, south, and east) (FSR), and among these regional scales (FRT). The significance of each variance component was tested with permutation tests (Excoffier et al. 1992).
The Weir and Cockerham (1984) FST analogue
, a measure of among-population variance in allele frequencies, was calculated as pairwise differences between all samples using the ARLEQUIN software and its significance tested by permuting individuals among samples. We did not use Slatkin's (1995) RST value, developed as an analogue to FST, as this takes explicit account of the SMM (Balloux and Lugon-Moulin 2002). Estoup et al. (1995) have shown that population data from compound microsatellites fit the IAM better than the SMM, and hence, the latter is not appropriate for our data, where six out of eight loci are compound or imperfect (Table 1). The linear regression of
/(1
) against the geographical distance between pairs of common impala sampling sites was calculated, as suggested by Rousset (1997), and tested for significance by a Mantel test (Mantel 1967).
Following Hansen et al. (2001), we used the program GENECLASS2 (Piry et al., 2004, available at http://www.montpellier.inra.fr/CBGP/softwares/index.html) for individual assignment tests as defined by Paetkau et al. (1995). These were used to determine the likelihood of each individual's genotype being found at the location from which it was sampled as described by Cornuet et al. (1999). The individual was excluded from a given candidate population if the individual probability of belonging to the particular population was lower than 1%. To avoid possible bias of self-assignment of individuals to their population of origin, the "leave one out" procedure was followed excluding the tested individual when calculating the allele frequency distribution of their own population. In order to avoid problems potentially caused by nonsampled microsatellite alleles (when a population has a zero frequency for a given allele, individuals having that allele will be assigned a zero likelihood of originating from that population), we used a frequency-based approach (Paetkau et al. 1995) assuming an allele frequency of 0.01 in the event of zero frequency. Probabilities of individuals belonging to populations were calculated using a resampling algorithm developed by Paetkau et al. (2004), with 10,000 simulated individuals, as recommended by the authors of the program.
The extent of divergence among the 11 populations was also quantified by Nei's standard genetic distance DS (Nei 1972). Pairwise distances were used to estimate a population phylogram using the neighbor-joining (NJ) algorithm (Saitou and Nei 1987) available in PHYLIP version 3.5c (Felsenstein 1993). Percent confidence values on tree topology were estimated by 1,000 bootstraps performed resampling loci, and the tree was visualized using TREEVIEW (Page 1996).
Sequence Data
One hundred and fifty-five sequences of the mitochondrial control region (
400 bp) (GenBank accession numbers AF 301740AF 301894; Nersting and Arctander 2001) from individuals throughout the impala range were reanalyzed. To verify that the sequences were not of nuclear origin, Nersting and Arctander (2001) sequenced the control region fragment, cyt b, from selected samples. Sixty-four of the control region sequences were from the populations analyzed in the present study, and were therefore genotyped using microsatellites. An AMOVA was performed to determine the genetic variations: individuals within samples, among samples within regions (southwest, south, and east) (
SR), and among regions (
RT). The tests and the significance of each variance component were determined with permutation tests (Excoffier et al. 1992) in the program ARLEQUIN (Schneider et al. 2000). An NJ tree was estimated from the matrix of Nei's net (DA) number of nucleotide differences between populations (Nei and Li 1979) using PHYLIP 3.5c (Felsenstein 1993) and visualized using TREEVIEW (Page 1996).
| Results |
|---|
|
|
|---|
HW and Genotypic Linkage Disequilibrium
After sequential Bonferroni correction for multiple tests, no site-by-locus comparison in the common impala was significantly different from HW proportions. There was one deviation in the black-faced sample for locus OarFCB 48 (P < .05), which showed an excess of homozygotes, which could be due to null alleles. Analyzing the seven remaining loci without OarFCB 48 did not result in any significant departures from HW proportions in the populations. However, pooling the black-faced samples into one large population resulted in heterozygote deficiencies at two loci (OarFCB 48 and OarFCB 304), leading to significant departures from HW (P < .05). The deficiency at OarFCB 304 was interpreted as the Wahlund effect, indicating the differentiation between the black-faced impala populations, which were therefore analyzed separately. Only one allele was observed in KAF at OarFCB 48, and no HW test was performed on this site-by-locus comparison.
None of the individual tests for genotypic linkage disequilibrium were significant after Bonferroni correction for multiple testing (data not shown). The variation at the eight loci was therefore seen as statistically independent.
Individual-Based Analyses
Spatial Structure.
Population structure was assessed using the entire sample set (n = 162). At K = 2, the clusters were anchored by subspecific origin (Figure 2a). Increasing the value of K to 3 resulted in the common impala group splitting into two groups, conforming to the geographical proximity of the samples (Figure 2b). CH, SH, KAF, and LU in the south clustered together, as did LM and SA in the east. SE was assigned with almost equal probability to both common impala clusters (49% to south, 51% to east). BK showed an intermediate clustering probability (26% to south, 73% to east) between values observed for SE and those observed for LM and SA, the latter clustering together with very high probability (94.2% and 98.5%, respectively). The two clusters representing the southern and eastern common impala samples remained the same for K = 4. The proportion of black-faced individuals assigned to each population was roughly symmetric for K = 4 and K = 5 (Figure 2c,d), although the latter split the common cluster obtained from the previous value. SA was the only population that retained a proportion of membership >0.80 to one of the clusters (q = 0.90). LM, which also had a highly significant
value in the pairwise comparisons (see subsequently), had a q value of 0.67 for belonging to another of the clusters.
|
Frequency-Based Analyses
Genetic Diversity.
In total, 82 alleles were observed across the eight loci surveyed. The number of alleles per locus ranged from 4 (OarFCB 48) to 15 (OarFCB 304) (Appendix, Table A1), with an average of 10 alleles per locus.
The common impala subspecies had a higher number of unique alleles (35% of the alleles genotyped) with a total cumulative frequency (Cfreq) of 20%. The black-faced impala subspecies showed only two unique alleles (Cfreq = 0.7%). Within the common impala subspecies, the southern samples had 15 alleles not found in the eastern samples (Cfreq = 8%). The eastern samples had 16 alleles not found in the southern samples (Cfreq = 7%).
The amount of genetic variability in the samples was estimated as the number of alleles per locus and as observed (HO) and expected (HE) heterozygosity (the latter is shown in the Appendix, Table A1). The measures varied substantially among loci and samples. Average values of observed and expected heterozygosity were HO = 0.61 (min = 0.10, max = 1) and HE = 0.67 (min = 0.19, max = 0.98). Within the common impala, both the average number of alleles and the observed heterozygosity tended to be slightly higher in the southern samples than in the east. The highest average heterozygosity was observed in the southern common samples CH and SH (78% and 70%, respectively). These also had relatively high MNA values (5.8 and 5.3, respectively) exceeded only by SE (MNA = 6.4). The lowest average heterozygosity across all samples was observed in SA (48%), which was also the least variable (MNA = 3.75).
Recent bottlenecks in the history of the impala were not statistically supported. None of the studied samples exhibited a statistically significant excess of HE over HEQ across the eight microsatellites under any of the microsatellite mutation models assumed, regardless of the method applied.
Quantitative estimates of hierarchical gene diversity indicated significant genetic population structure at every level where 19.3% of the genetic variation occurred between regions, 5.1% was due to differentiation among samples within each region, and 75.6% was due to variation in samples (Table 2). The largest percentage of within-region variation among samples was observed in east Africa (
SR = 0.14), attributed in part to the genetic distinctiveness of SA (with this sample removed,
SR = 0.09), when compared to the southwest and south (with
SR = 0.05 and 0.03, respectively).
|
Population Structure.
In the pairwise
comparisons, all but three comparisons within the common impala showed significant differentiation between samples at the 5% level (Table 3). This was within the southern common impala, where KAF was not significantly differentiated from CH, SH, or LU. Within the east African samples, all pairwise comparisons were highly significant (P < .001). At the 1% level, the pairwise comparisons between KAF and SE and between CH and SH no longer retained significance, and lowering the P value to the .01% level resulted in the comparison between KAF and BK no longer being significant.
|
Figure 3 shows the regression of
/(1
) against geographical distance. A Mantel test showed that the positive association between genetic differentiation and geographical distances separating samples was significant (P < .01).
|
Results of the assignment tests are depicted in Table 4. The black-faced impala individuals not assigned to sample of origin were all assigned to another black-faced impala sample. Regarding the common impala, all individuals within the southern group were assigned to region of origin; no individuals were assigned to samples in the east African region. Within east Africa, the geographically intermediate samples from SE and BK had individuals assigned to samples in southern Africa and individuals from LM and SA not assigned to their population of origin were assigned to other east African samples.
|
The genetic distances of the microsatellites between impala samples is illustrated in Figure 4 as an NJ tree. There is a marked topology in the tree, the genetically distinct and geographically isolated black-faced impala forming a sister group to the southern common impala samples with a bootstrap value of 99. Distances between black-faced and southern impala ranged between 0.55 and 1.53 (mean DS = 0.93) (Table 3). Furthermore, a clear grouping within the common samples along a south to east gradient was indicated. CH, SH, KAF, and LU in the south clustered separately from SE, BK, LM, and SA in the east with a bootstrap value of 86. SA was separated from the remaining east African samples by a strikingly long branch (DS = 0.400.57; mean = 0.50). Distances between SA and the southern common impala samples (CH, SH, KAF, and LU; DS = 0.951.34; mean = 1.06) exceeded those separating the black-faced impala from the southern commons.
|
Sequence Data
The AMOVA performed on the sequence data revealed a high degree of genetic substructure among regional groups (
RT = 0.34; Table 2). Within the east African region, the
SR value was appreciably higher than those observed in the southern region (0.29 and 0.06, respectively). In the NJ tree based on DA between populations, there was a clear segregation of the black-faced impala populations and the common impala populations into two separate clades. No regional structure within the common impala samples was revealed using control region data. Interestingly, the sample from SA was separated from the rest of the common impala by a marginally longer branch (data not shown).
| Discussion |
|---|
|
|
|---|
Spatial Population Structure
The use of highly polymorphic microsatellite loci and assignment tests has greatly increased the potential for understanding population structure across a landscape. The Bayesian approach as implemented in STRUCTURE has proved to be a powerful instrument in many instances where genetic differentiation is high (Maudet et al. 2002; Pritchard et al. 2000; Randi and Lucchini 2002; Randi et al. 2001; Rosenberg et al. 2001) and can also be used when population structure is more complex (Rosenberg et al. 2002). Correspondence between regional affiliation and genetic ancestry has previously been reported (Bowcock et al. 1994; Manel et al. 2002; Pritchard et al. 2000; Rosenberg et al. 2002), and some regional genetic structuring within the impala was detected without using prior geographical information.
With K = 2 (Figure 2a), clusters were anchored by the two presently recognized subspecies of impala: A. m. petersi and A. m. melampus (Kingdon 1997; Shortridge 1934; Skinner and Smithers 1990; Swart 1967). This was supported by the NJ tree, where the geographically isolated black-faced impala constituted a conspicuous and well-supported sister group to the common impala using both microsatellite and mtDNA data (Figure 4). With K = 3 (Figure 2b), the common impala was split into two clusters across its spatial scale, representing southern and eastern regions. Selous National Reservelocated in southeast Tanzania midway between south and eastclustered in both with equal probability (Figure 2b). The genetic affiliation of eastern samples to the southern cluster decreased with geographic distance; Burko Forest Reserve in northern Tanzania had a correspondingly lower probability of clustering in the south, revealing a level of genetic admixture between the two regions. A certain degree of regional affiliation within the common impala was also revealed using prior geographic information in the assignment tests (Table 4). Individuals from Selous and Burko were assigned (with decreasing fractions) to southern samples. Recent gene flow between previously allopatric regional groups harboring separate evolutionary histories or a degree of isolation by distance of one large population could explain the mixed clustering of the intermediate samples. However, some caution should be expressed in the interpretation of the results as sample sizes for the intermediate KAF and LU were small (n < 10). The regional split of the common impala into a south and an east African group was well supported in the microsatellite distance tree (Figure 4) but was not obvious in the haplotypic tree (Figure 3; Nersting and Arctander 2001). Reanalyzing the mtDNA data using populations as operational taxonomic units did not uncover any regional structuring within the subspecies.
Pairwise comparisons (Table 3) revealed a degree of regional affiliation between southern common impalas and gene flow between south and east. Within the eastern region, all pairwise comparisons were highly significant, reflecting a high degree of differentiation between samples. Within the southern region, however, only two pairwise comparisons were highly significant, suggesting a higher level of gene flow or less genetic differentiation among samples in the south. Furthermore, in concordance with the assignment tests, the genetic affinity between samples from the two regions increased with decreasing geographical proximity. Comparisons between geographically intermediate Kafue National Park in Zambia (southern region) and Selous and Burko in Tanzania (eastern region) revealed low genetic differentiation.
In the individual-based analyses, increasing the value of K to 4 (Figure 2c) resulted in the splitting of the black-faced impala into two groups. As significant genetic structuring is detected only if the underlying genetic differentiation is high, the pattern observed suggested a lack of genetic structuring within the black-faced subspecies, a higher degree of genetic differentiation being found within the common impala. This was supported by the assignment tests, where no correlation was found between the fraction of black-faced individuals assigned to a location other than their origin and the proximity of that location to the sampling site. This was expected; the black-faced impala populations in Etosha National Park are derived from 280 individuals captured close to the Angolan border and released in the park around 1970 (Green and Rothstein 1998; Lorenzen and Siegismund 2004).
Higher genetic diversity values (MNA and HE; Appendix, Table A1) were observed in the southern African common impala than in the southwest or east, Chobe National Park (Botswana) and Shangani (Zimbabwe) being the most diverse. Selous (southeast Tanzania) had the highest observed MNA, likely an effect of being an intermediate compilation of the disparate genetic clusters in south and east (Figure 2b). Samburu (Kenya) had the lowest HE and MNA observed across all samples. This was also seen in the mtDNA analyses (Nersting and Arctander 2001), where Samburu had a markedly lower within-population nucleotide diversity value (0.75%). Highest values were again observed in the south (3.66% in Chobe, 2.64% in Shangani), Burko and Lake Mburu National Park in Uganda showing intermediate values (1.11% and 1.14%, respectively).
Microsatellites attributed a greater component of the total variance to the variance within samples than did the AMOVA on control region sequences (Table 2). This was expected, as microsatellites mutate at a higher rate than the mitochondrial control region. Therefore, a higher degree of variation is found in the former, which is apparent at the individual level (Estoup and Angers 1998). The east African region showed a higher degree of among-sample variation and lower within-sample diversity (Appendix, Table A1) in both analyses. This was not only caused by the marked genetic divergence of the Samburu impala supported by microsatellite and mtDNA data, individual-based analyses, a markedly lower MNA and HE, and highly significant
values in all pairwise comparisons but also caused by greater fragmentation within this region. The control region analysis partitioned a greater percentage of the overall variance to differences among the three geographical regions, which was not surprising given the genetic distinctiveness of the black-faced haplotypes (see Nersting and Arctander 2001 for further details). Even though lower on an absolute scale, the differentiation among regions at the microsatellites approaches about half the maximum possible. As Hedrick (1999) showed, the upper limit for FST [we used the Weir and Cockerham (1984) analogue
] in highly variable markers is less than the expected average homozygosity in the samples, which is 0.38 in our case.
Population History
The south/east contrast seen in the spatial genetic structuring of the common impala could be a reflection of distinct regional evolutionary histories and is supported by the individual-based analyses and the distance tree. Subsequent gene flow after secondary contact could explain these results coupled with the diversity indices,
pairwise comparisons, assignment tests, and the regression analysis. Given that previous studies of several other species of African bovids have shown the same pattern of vicariance, sable antelope (H. niger), roan antelope (H. equinus), hartebeest (A. buselaphus), topi (D. lunatus), wildebeest (C. taurinus), kudu (T. strepsiceros), and buffalo (Syncerus caffer) (Arctander et al. 1999; Mathee and Robinson 1999; Nersting and Arctander 2001; Van Hooft et al. 2002), argues for such an interpretation.
Conversely, the considerable gene flow between south and eastthe degree of genetic affiliation increasing with geographic proximitycould be the result of isolation by distance, the distinctiveness of the east African localities being the result of discontinuous sampling of discrete populations. The higher genetic diversity values and lacking differentiation between localities in the south coupled with the genetic distance tree could suggest a colonization scenario toward the east from the large, long-standing, panmictic, and more genetically diverse southern population. A similar pattern has previously been proposed for the wildebeest (Connochaetes taurinus), whose present distribution range in east Africa is believed to be the result of a more recent expansion from southern Africa (Arctander et al. 1999). However, it is difficult to distinguish history from recurrent processes within the common impala subspecies from our results.
The large amount of alleles unique to the common impala subspecies, their high cumulative frequency, the larger haplotypic diversity, and the greater genetic variability suggest that the black-faced subspecies resulted from a migration westward. The microsatellite distance tree supports a migration from the southern region; however, this could reflect frequency similarity rather than population history. Subsequent prolonged isolation of this founding population in the southwest, perhaps during the climatic fluctuations of the Pleistocene (deMenocal 1995; Partridge et al. 1995), has resulted in the genetically and morphologically distinct subspecies. A Pleistocene refuge for arid-adapted species in southwest Africa has previously been proposed for the roan antelope (Mathee and Robinson 1999), the topi (Arctander et al. 1999), and the kudu (Nersting and Arctander 2001). However, if the impala was previously distributed throughout southern Africa, subsequent allopatry between the southwestern and southern populations could have resulted in the present subspecific distributions. With respect to the Etosha populations analyzed, as there has been no gene flow between locations since they were established 30 years ago, the level of genetic differentiation between populations can be seen as a measure of the degree of founder effect and genetic drift within the park since the populations were established.
Individuals from Samburu National Reserve in Kenya, the most northern locality within the impala distribution range, continuously grouped within one cluster with very high probabilities in the individual-based analyses (Figure 2e), emphasizing the genetic distinctiveness of this population. This coupled with microsatellite (Figure 4) and mtDNA distance data, a markedly lower MNA and HE (Appendix, Table A1) and highly significant
values in all pairwise comparisons (Table 3), supported the marked genetic divergence of the Samburu population. This could be explained by the reported significant decrease in impala numbers in the Samburu National Reserve in the 1970s and the 1980s (East 1997). During this period, the deterioration of Kenya's wildlife sector reached crisis proportions. An unprecedented wave of poaching reduced the country's elephant population by >85% and the black rhino population by >98%. Parks were affected by inadequate equipment for antipoaching patrols and road maintenance, illegal encroachment of cattle, and rampant corruption at entry points where vital revenue was being lost (Leakey 1988). This affected wildlife populations across species across the entire country. The impala population in Samburu was seriously affected but has since stabilized and now numbers some 800 individuals (East 1997). Although not statistically supported, this circumstantial evidence could indicate a bottleneck in the recent history of this population. Another reason for the genetic distinctiveness of the Samburu impala could perhaps be found in the subspecific distributions of the kudu (T. strepsiceros). The kudu is an antelope with the same wide, congruent distribution range as the impala, with equivalent habitat requirements. Three subspecies are recognized by Kingdon (1997), with the subspecies Tragelaphus strepsiceros chora being confined to north-east Africa. The southernmost range of T. s. chora is Samburu. If a subspecies with a similar distribution range has existed in the history of the impala, and since perished in the north, the impala in Samburu National Reserve could be the last remnant of a genetically distinct impala subspecies in north-east Africa.
| Conclusions |
|---|
|
|
|---|
In conclusion, individual-based analyses using microsatellites uncovered regional structuring between southern and east African common impala (A. m. melampus), not previously observed using mitochondrial control region data. We propose that the genetic differences within this subspecies reflect isolation by distance after a recent colonization from a long-standing population in southern Africa toward the east and north. The isolation has not been for a duration long enough to be evident with mtDNA. This was supported by high levels of genetic admixture only within and between southern impala and geographically intermediate samples from the east, a progressive decline in genetic diversity toward the east, and the lacking morphological diagnostics of impala from the two regions (Kingdon 1997; Nowak 1999). Both microsatellite and mtDNA data indicated a genetic distinctiveness of the Samburu population in Kenya.
| Appendix A |
|---|
|
|
|---|
Table A1.. Measures of genetic variation across eight microsatellite loci in black-faced and common impala by collection site
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Summary of sample sizes (n), inbreeding coefficient f of Weir and Cockerham (1984), expected heterozygosities (HE), and mean number of alleles (MNA) of the populations studied are listed. Numbers in bold depict significant deviations from expected HW proportions after Bonferroni correction.
| Supplementary Data |
|---|
|
|
|---|
One supplementary color figure is available at Journal of Heredity online (www.jhered.oxfordjournals.org).
| Acknowledgments |
|---|
This study was funded by the Danish International Development Agency Wildlife Genetics Program and the Danish Natural Science Research Council. Jonathan Pritchard provided assistance with STRUCTURE, Martin Bay Hebsgaard provided assistance with ILLUSTRATOR, and Silvain Piry helped with GENECLASS2. Thanks to Anders J. Hansen, Henrik Glenner, and John Okello for helpful comments on the manuscript.
| Footnotes |
|---|
Corresponding Editor: Oliver Ryder
Received November 9, 2004
Accepted November 18, 2005
| References |
|---|
|
|
|---|
-
Amos B and Hoelzel AR, 1991. Long-term preservation of whale skin for DNA analysis. In: Genetic ecology of whales and dolphins (Hoelzel AR and Donovan GP, eds). Cambridge: The International Whaling Commission; 99103.
Arctander P, Johansen C, and Coutellec-Vreto MA, 1999. Phylogeography of three closely related African bovids (tribe Alcelaphini). Mol Biol Evol 16:17241739.[Abstract]
Balloux F and Lugon-Moulin N, 2002. The estimation of population differentiation with microsatellite markers. Mol Ecol 11:155165.[CrossRef][Medline]
Beaumont MA and Bruford MW, 1999. Microsatellites in conservation genetics. In: Microsatellites. Evolution and applications (Goldstein DB and Schlötterer C, eds). Oxford: Oxford University Press; 165182.
Bhebhe E, Kogi J, Holder DA, Arevalo E, Derr JN, Linn RA, Ruvuna F, Davis SK, and Taylor JF, 1994. Caprine microsatellite dinucleotide repeat polymorphisms at the SR-CRSP-6, SR-CRSP-7, SR-CRSP-8, SR-CRSP-9 and SR-CRSP-10 loci. Anim Genet 25:203.[Web of Science][Medline]
Bowcock AM, Ruiz-Linares A, Tomfohrde J, Minch E, Kidd JR, and Cavalli-Sforza LL, 1994. High resolution of human evolutionary trees with polymorphic microsatellites. Nature 31:455457.
Buchanan FC and Crawford AM, 1992. Ovine dinucleotide repeat polymorphism at the MAF209 locus. Anim Genet 23:183.[Web of Science][Medline]
Buchanan FC and Crawford AM, 1993. Ovine microsatellites at the OarFCB11, OarFCB128, OarFCB193, OarFCB266 and OarFCB304 loci. Anim Genet 24:145.[Medline]
Buchanan FC, Galloway SM, and Crawford AM, 1994. Ovine microsatellites at the OarFCB5, OarFCB19, OarFCB20, OarFCB48, OarFCB129 and OarFCB226 loci. Anim Genet 25:60.[Web of Science][Medline]
Cornuet JM and Luikart G, 1996. Description and power analysis of two tests for detecting recent population bottlenecks from allele frequency data. Genetics 144:20012014.[Abstract]
Cornuet JM, Piry S, Luikart G, Estoup A, and Solignac M, 1999. New methods employing multilocus genotypes to select or exclude populations as origins of individuals. Genetics 153:19892000.
deMenocal PB, 1995. Plio-Pleistocene African climate. Science 270:5359.
Di Rienzo A, Peterson AC, Garza JC, Valdes AM, Slatkin M, and Freimer NB, 1994. Mutational processes of simple-sequence repeat loci in human populations. Proc Natl Acad Sci USA 91:31663170.
East R, 1997. Kenya. Antelope survey update No. 5 (East R, ed). IUCN/CCS Antelope Specialist Group Report; 340.
East R, 1998. Antelope Specialist Group Report. Gland, Switzerland: IUCN.
Estoup A and Angers B, 1998. Microsatellites and minisatellites for molecular ecology: theoretical and empirical considerations. In: Advances in molecular ecology (Carvalho GR, ed). Amsterdam: IOS Press; 340.
Estoup A, Taillez C, Cornuet JM, and Solignac M, 1995. Size homoplasy and mutational processes of interrupted microsatellites in apidae species, Apis mellifera and Bombus terrestris. Mol Biol Evol 12:10741084.[Abstract]
Excoffier L, Smouse PE, and Quattro JM, 1992. Analysis of molecular variance inferred from metric distances among DNA haplogroups: applications to human mitochondrial DNA restriction data. Genetics 131:479491.[Abstract]
Felsenstein J, 1993. PHYLIP (phylogeny inference package) version 3.5c. Distributed by the author. Department of Genetics, University of Washington, Seattle.
Georges M and Massey JM, 1992. Polymorphic DNA markers in Bovidae. Patent WO92/13102.
Girman DJ, Kat PW, Mills MG, Ginsberg JR, Borner M, Wilson V, Fanshawe JH, Fitzgibbon C, Lau LM, and Wayne RK, 1993. Molecular genetic and morphological analyses of the African wild dog (Lycaon pictus). J Hered 84:450459.
Green WCH and Rothstein A, 1998. Translocation, hybridization, and the endangered black-faced impala. Conserv Biol 12:475480.[CrossRef]
Guo SW and Thomson EA, 1992. Performing the exact test of Hardy-Weinberg proportions for multiple alleles. Biometrics 48:361372.[CrossRef][Web of Science][Medline]
Haltenorth T and Diller H, 1977. Bestimmungsbuch: Säugetiere Afrikas und Madagaskars. Munich: BLV Verlagsgesellschaft; 9091.
Hansen MM, Kenchington E, and Nielsen EE, 2001. Assigning individual fish to populations using microsatellite DNA markers. Fish Fish 2:93112.
Hedrick PW, 1999. Perspective: highly variable genetic loci and their interpretation in evolution and conservation. Evolution 53:313318.[CrossRef][Web of Science]
Jarman MV and Jarman PJ, 1973. Daily activity of impala. East Afr Wildl J 11:7592.
Karesh WB, Smith F, and Frazier-Taylor H, 1987. A remote method for obtaining skin biopsy samples. Conserv Biol 1:261262.
Kingdon J, 1982. East African mammals, volume 3D (bovids). New York: Academic Press.
Kingdon J, 1997. The Kingdon field guide to African mammals. London: Academic Press.
Leakey R, 1988. A plea for urgent action. Swara 11:67.
Lorenzen ED and Siegismund HR, 2004. No evidence of hybridization between the black-faced impala (Aepyceros melampus petersi) and the common impala (A. m. melampus) in Etosha NP, Namibia. Mol Ecol 13:30073019.[CrossRef][Medline]
Luikart G and Cornuet JM, 1998. Empirical evaluation of a test for identifying recently bottlenecked populations from allele frequency data. Conserv Biol 12:228237.[CrossRef]
Luikart G, Cornuet JM, Allendorf FW, and Sherwin WB, 1998. Distortion of allele frequency distributions provides a test for recent population bottlenecks. J Hered 89:238247.
Manel S, Berthier P, and Luikart G, 2002. Detecting wildlife poaching: identifying the origin of individuals with Bayesian assignment tests and multilocus genotypes. Conserv Biol 16:650659.[CrossRef]
Mantel N, 1967. The detection of disease clustering and a generalized regression approach. Cancer Res 27:209220.
Mathee CA and Robinson TJ, 1997. Mitochondrial DNA phylogeography and comparative cytogenetics of the springhare, Pedetes capensis (Mammalia: Rodentia). J Mamm Evol 4:5373.
Mathee CA and Robinson TJ, 1999. Mitochondrial DNA population structure of roan and sable antelope: implications for the translocation and conservation of the species. Mol Ecol 8:227238.[CrossRef][Medline]
Maudet C, Miller C, Bassano B, Breitenmoser-Würsten C, Gauthier D, Obexer-Ruff G, Michallet J, Taberlet P, and Luikart G, 2002. Microsatellite DNA and recent statistical methods in wildlife conservation management: applications in alpine ibex (Capra ibex [ibex]). Mol Ecol 11:421436.[CrossRef][Medline]
Meester J and Setzer HW, 1977. The mammals of Africa: an identification manual. Washington, DC: Smithsonian Institute Press; 5657.
Moritz C, 1994. Defining evolutionary significant units for conservation. Trends Ecol Evol 9:373375.[CrossRef]
Moritz C, 1999. Conservation units and translocations: strategies for conserving evolutionary processes. Hereditas 130:217228.[CrossRef][Web of Science]
Nei M, 1972. Genetic distances between populations. Am Nat 106:283292.[CrossRef][Web of Science]
Nei M and Li WH, 1979. Mathematical model for studying genetic variation in terms of restriction endonucleases. Proc Natl Acad Sci USA 76:52695273.
Nersting LG and Arctander P, 2001. Phylogeography and conservation of impala and greater kudu. Mol Ecol 10:711719.[CrossRef][Medline]
Nowak D, 1999. Walker's mammals of the world, 6th ed. Baltimore: Johns Hopkins University Press; 11941196.
Paetkau D, 1999. Using genetics to identify intraspecific conservation units: a critique of current methods. Conserv Biol 13:15071509.[CrossRef]
Paetkau D, Calvert W, Stirling I, and Strobeck C, 1995. Microsatellite analysis of population structure in Canadian polar bears. Mol Ecol 4:347354.[Medline]
Paetkau D, Slade R, Burden M, and Estoup A, 2004. Genetic assignment methods for the direct, real-time estimation of migration rate: a simulation-based exploration of accuracy and power. Mol Ecol 13:5565.[CrossRef][Medline]
Page RDM, 1996. TREEVIEW: an application to display phylogenetic trees on personal computers. Comput Appl Biosci 12:357358.
Park SDE, 2001. Trypanotolerance in West African cattle and the population genetic effects of selection (PhD dissertation). Ireland: University of Dublin.
Partridge TC, Wood B, and deMenocal PB, 1995. The influence of global climatic change and regional uplift on large mammalian evolution on east and southern Africa. In: Paleoclimate and evolution with emphasis of human origins (Vrba E, Denton G, Partridge TC, and Burckle L, eds). New Haven: Yale University Press; 330355.
Piry S, Alapetite A, Cornuet JM, Paetkau D, Baudouin L, and Estoup A, 2004. GeneClass2: a software for genetic assignment and first-generation migrant detection. J Hered 95:536539.
Piry S, Luikart G, and Cornuet JM, 1999. BOTTLENECK: a computer program for detecting recent reductions in the effective population size using allele frequency data. J Hered 90:502503.
Pritchard JK, Stephens M, and Donnelly PJ, 2000. Inference of population structure using multilocus genotype data. Genetics 155:945959.
Randi E and Lucchini V, 2002. Detecting rare introgression of domestic dog genes into wild wolf (Canis lupus) populations by Bayesian admixture analyses of microsatellite variation. Conserv Genet 3:2943.[CrossRef]
Randi E, Pierpaoli M, Beaumont MA, Ragni B, and Sforzi A, 2001. Genetic identification of wild and domestic cats (Felis silvestris), and their hybrids using Bayesian clustering methods. Mol Biol Evol 18:16791693.
Raymond M and Rousset F, 1995. GENEPOP (version 1.2), population genetics software for exact tests and ecumenicism. J Hered 7:248249.
Rice WR, 1989. Analyzing tables of statistical tests. Evolution 43:223225.[CrossRef][Web of Science]
Rosenberg NA, Burke T, Elo K, Feldman MW, Freidlin PJ, Groenen MAM, Hillel J, Maki-Tanila A, Tixier-Boichard M, Vignal A, Wimmers K, and Weigend S, 2001. Empirical evaluation of genetic clustering methods using multilocus genotypes from 20 chicken breeds. Genetics 159:699713.
Rosenberg NA, Pritchard JK, Weber JL, Cann HM, Kidd KK, Zhivotovsky LA, and Feldman MW, 2002. Genetic structure of human populations. Science 298:29812985.
Rousset F, 1997. Genetic differentiation and estimation of gene flow from F-statistics under isolation by distance. Genetics 145:12191228.[Abstract]
Saitou N and Nei M, 1987. The neighbour-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol 4:406425.[Abstract]
Schneider S, Roessli D, and Excoffier L, 2000. Arlequin: a software for population genetics data analysis. Version 2.000. Genetics and Biometry Laboratory, Department of Anthropology, University of Geneva.
Shortridge GC, 1934. The mammals of south west Africa, vol II. London: William Heinemann; 557559.
Skinner JD and Smithers RHN, 1990. The mammals of the southern African subregion. Pretoria: University of Pretoria Press.
Slatkin M, 1995. A measure of population subdivision based on microsatellite allele frequencies. Genetics 139:457462.[Web of Science][Medline]
Swart PS, 1967. New data on the black-faced impala (Aepyceros melampus petersi Bocage). Cimbebasia 20:318.
Van Hooft WF, Groen AF, and Prins HH, 2002. Phylogeography of the African buffalo based on mitochondrial and Y-chromosomal loci: Pleistocene origin and population expansion of the Cape buffalo subspecies. Mol Ecol 11:267279.[CrossRef][Medline]
Weber JL, 1990. Informativeness of human (dC-dA)n(dG-dT)n polymorphisms. Genomics 7:524530.[CrossRef][Web of Science][Medline]
Weir BS and Cockerham CC, 1984. Estimating F-statistics for the analysis of population structure. Evolution 38:13581370.[CrossRef][Web of Science]
Wright S, 1951. The genetical structure of populations. Ann Eugen 15:323354.
This article has been cited by other articles:
![]() |
J. P. Huelsenbeck and P. Andolfatto Inference of Population Structure Under a Dirichlet Process Model Genetics, April 1, 2007; 175(4): 1787 - 1802. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||




