Journal of Heredity 2003:94(2)
© 2003 The American Genetic Association 94:133-147
Population Structure of Anopheles gambiae in Africa
From the Entomology Branch, Division of Parasitic Diseases, Centers for Disease Control and Prevention, 4770 Buford Highway, Chamblee, GA 30041 (Lehmann, Licht, and Hawley); Department of Biology, Emory University, Atlanta, GA 30322 (Lehmann and Licht); Unité d'Entomologie Médicale, CIRMF, BP 769, Franceville, Gabon (Elissa); National Institute for Medical Research (NIMR), Box 9653 Dar Es Salaam, Tanzania (Maega); National Malaria Control Center, P.O. Box 32509, Lusaka, Zambia (Chimumbwa); Laboratoire D'Entomologie, Avenue des Huileries, B.P. 1197, Ministry of Health, Kinshasa, Democratic Republic of Congo (Watsenga); and Laboratoire de l'Institut de Recherche pour le Developpement, Organisation de lutte Contre les Grandes Endemies en Afrique Centrale, Yaoundé, Cameroon (Wondji and Simard).
Address correspondence to Tovi Lehmann at the address above, or e-mail: lbt2{at}cdc.gov.
| Abstract |
|---|
|
|
|---|
The population structure of Anopheles gambiae in Africa was studied using 11 microsatellite loci in 16 samples from 10 countries. All loci are located outside polymorphic inversions. Heterogeneity among loci was detected and two putative outlier loci were removed from analyses aimed at capturing genome-wide patterns. Two main divisions of the gene pool were separated by high differentiation (FST > 0.1). The northwestern (NW) division included populations from Senegal, Ghana, Nigeria, Cameroon, Gabon, Democratic Republic of Congo (DRC), and western Kenya. The southeastern (SE) division included populations from eastern Kenya, Tanzania, Malawi, and Zambia. Inhospitable environments for A. gambiae along the Rift Valley partly separate these divisions. Reduced genetic diversity in the SE division and results of an analysis based on private alleles support the hypothesis that a recent bottleneck, followed by colonization from the NW populations shaped this structure. In the NW division, populations possessing the M rDNA genotype appeared to form a monophyletic clade. Although genetic distance increased with geographic distance, discontinuities were suggested between certain sets of populations. The absence of heterozygotes between sympatric M and S populations in the DRC and the high differentiation in locus 678 (
) contrasted with low differentiation in all other loci (
) and with the persistence of departures from HardyWeinberg expectations within each form in the DRC. Neither recent reproductive isolation alone nor selection alone can explain these results, a situation that is compatible with incipient speciation. Because it is possible that the molecular forms play different roles in malaria transmission, future studies should treat them separately. Malaria claims the lives of more than one million people every year, mostly in Africa (Marshall 2000). As the principal vector of malaria in Africa, Anopheles gambiae is arguably that continent's most dangerous animal. More effective methods need to be developed to alleviate malaria's burden. Knowledge of A. gambiae's population structure may be useful for identifying heterogeneity in disease transmission due to distinct vector populations and for tracking and predicting the spread of genes of interest, such as those conferring insecticide resistance.
Many studies of population structure assume migration-drift equilibrium and use F statistics (Wright 1978) to estimate gene flow between populations. The limitations of this approach have been stressed in several recent studies (Whitlock and McCauley 1999), but the lack of alternative analytical tools to interpret genetic variation between populations remains a problem, especially for pest species whose demographics are thought to be historically unstable. Understanding the population structure of A. gambiae, the African malaria mosquito, requires consideration of the historical instability in population size (Donnelly et al. 2001), selection effects on large segments of chromosomal regions contained within inversions, and speciation events (e.g., Coluzzi et al. 1979, 1985; Lanzaro et al. 1998; Powell et al. 1999; Taylor et al. 2001; Toure et al. 1998). It is therefore not surprising that a continental analysis of the population structure of A. gambiae has never been published. In this study we describe the population structure of A. gambiae on a continental scale and attempt to infer the processes that have shaped it using conventional, adapted, and ad hoc statistics.
Early studies of A. gambiae in West Africa revealed heterozygote deficits for certain inversions and suggested that the species is subdivided into partly or fully isolated chromosomal forms termed Mopti, Bamako, Bissau, Forest, and Savanna (Bryan et al. 1982; Coluzzi et al. 1979, 1985). These studies also showed that the frequency of certain inversion arrangements increased every dry season and decreased every wet season, demonstrating selective effects. Most subsequent studies encompassed small geographic scales. Surprisingly high genetic similarity was found between chromosomal forms outside polymorphic inversions, except in ribosomal DNA (rDNA), where two distinct genotypes (molecular forms) have been found (Favia et al. 1997; Gentile et al. 2001; Mukabayire et al. 2001). In Mali and Burkina Faso, one rDNA genotype (M) was solely associated with the Mopti chromosomal form and the other (S) was associated with the Savanna and Bamako forms. In neighboring countries, however, this association broke down, as both genotypes were found in the Savanna and Forest chromosomal forms (della Torre et al. 2001). Consistent with near-complete isolation between these populations, M/S heterozygotes were very rare (
0.3%), even in sympatric populations (della Torre et al. 2001; Wondji et al. 2001), and a large difference was found in the frequency of the kdr mutation (on the second chromosome) between them (Weill et al. 2000). Study of reproductive isolation between A. gambiae populations has shifted from chromosomal to molecular forms, but the debate on the taxonomic status of all these populations has yet to be resolved (e.g., Black and Lanzaro 2001; Tripet et al. 2001). In contrast to the situation in west Africa, no indication of reproductive isolation among populations has been found in east Africa (Kamau et al. 1998; Lehmann et al. 1997; Petrarca and Beier 1992; Smits et al. 1996). High differentiation was found between populations separated by the Rift Valley complex (Kamau et al. 1999; Lehmann et al. 1999, 2000), but differentiation was surprisingly low between populations from east and west Africa (Besansky et al. 1997; Kamau et al. 1999; Lehmann et al. 1996, 1999). An analysis of the structure of A. gambiae populations from a continental perspective may help resolve some of these enigmas.
In this study we analyze the population structure of A. gambiae based on variation at 11 microsatellite loci in populations from Kenya, Tanzania, Malawi, Zambia, Democratic Republic of Congo (DRC), Gabon, Cameroon, Nigeria, Ghana, and Senegal. To minimize confounding genome-wide patterns with locus-specific effects, we selected loci distributed throughout the genome, but located outside known polymorphic inversions. The following questions were addressed: What are the main divisions of the gene pool? How are they related to the rDNA genotypes and to geography? What can be inferred about the processes that shaped this structure?
| Materials and Methods |
|---|
|
|
|---|
Samples and Collection Methods
Sixteen collections were made between 1992 and 2000 from the locations shown in Figure 1. Sample sizes ranged from 29 to 85 A. gambiae specimens per collection (Table 1). The locations included Asembo Bay (1994; hereafter referred to as Asembo), Kisian, Kimilili, and Awendo from western Kenya (1996); Malindi and Jego from eastern Kenya (1996); Mbeya from Tanzania (1997); Mangochi and Nsanje from Malawi (1992); Zambezi from Zambia (1999); Kinshasa from the DRC (2000); Benguia from Gabon (1999); Simbock from Cameroon (1998); Gwamlar from Nigeria (1999); Navrongo from Ghana (1997); and Barkedji from Senegal (1995). Indoor-resting adult mosquitoes (mostly females) were collected by pyrethrum spray or aspiration in Kisian, Kimilili, Awendo, Malindi, Jego, Malawi, Zambia, Nigeria, the DRC, and Ghana. In Tanzania, Cameroon, Gabon, and Senegal, blood-seeking mosquitoes were collected by human-baited night catches. In Asembo, blood-fed and blood-seeking females were collected at dawn by aspiration from net traps hung over the beds of sleeping volunteers. At each site, mosquitoes were collected within 1 week from houses less than 10 km apart.
|
|
Karyotypically, populations from Cameroon and Gabon consisted of the Forest form (Fontenille D and Elissa N, unpublished data). Since Kinshasa (DRC) lies within the zone where this type is prevalent, presumably these species are also Forest form. All the other specimens were collected from populations where only the Savanna form was found (Coluzzi et al. 1985; della Torre et al. 2001; Fontenille D, unpublished data; Petrarca et al. 2000).
DNA Extraction and Microsatellite Genotype Scoring
Procedures and loci were previously described in detail (Donnelly et al. 2001; Lehmann et al. 1996, 1997, 1998). Only A. gambiae were included in the analysis after species identification was carried out (Scott et al. 1993). Microsatellite loci mapped to regions outside polymorphic chromosomal inversions in A. gambiae were selected (Figure 2 and Table 1). Microsatellite alleles were polymerase chain reaction (PCR) amplified with one fluorescent-labeled primer and their size was determined using the ABI 377 sequencing system. Polymerase chain reaction restriction fragment length polymorphism (PCR-RFLP) assays were used to determine the molecular form (Favia et al. 1997). Available specimens from Mangochi (Malawi), Cameroon, and Senegal consisted of one to two legs. DNA extracts of the specimens from Cameroon and Mangochi were exhausted after genotyping 7 loci, while that of Senegal was exhausted after genotyping 10 loci.
|
Data Analysis
Goodness-of-fit tests for HardyWeinberg (HW) expectations and genotypic linkage disequilibrium were performed using exact tests available in GENEPOP 3.2 (Raymond and Rousset 1995a). Differentiation among populations was assessed by F statistics (Wright 1978) calculated using GENEPOP according to Weir and Cockerham (1984). Deviation of a single locus FST from zero was tested by exact tests of heterogeneity of gene frequencies (Raymond and Rousset 1995b) using GENEPOP. FST is a better measure of differentiation than RST where differentiation is shaped primarily by drift (Slatkin 1995), as was the case for A. gambiae (Lehmann et al. 1998, 1999). Global tests were employed to evaluate the significance of multiple tests. The sequential Bonferroni procedure (Holm 1979) can detect a single test-specific departure, such as a locus-specific departure, whereas the binomial test (which estimates the probability of obtaining the observed number of significant tests at the 0.05 level given the total number of tests) can detect weaker departures across multiple tests, such as genome-wide departures. Bootstrapping was performed to calculate 95% confidence intervals (CIs) of parameter estimates using 1000 bootstrap replications, unless specified otherwise. The 95% CI of regression slopes was obtained by bootstrapping residuals (Mooney and Duval 1993). Neighbor-joining population trees were computed using MEGA 2.1 (Kumar et al. 1993) based on matrices of FST values. The robustness of population trees was evaluated by bootstrapping over loci. The bootstrapped matrices were analyzed using PHYLIP 3.5 (Felsenstein 1989). Bootstrap support values greater than 75% were considered biologically significant (Hillis and Bull 1993). Calculations not available in GENEPOP, MEGA, and PHYLIP were carried out using programs written by T.L. in SAS language (SAS 1990).
| Results |
|---|
|
|
|---|
Specimens from Senegal, Ghana, and Cameroon consisted of the M genotype. The DNA of eight specimens from Cameroon was exhausted before M/S assays were carried out and two repeatedly failed to produce PCR product. All other populations consisted of the S genotype, except the DRC, where both genotypes coexisted (M:S ratio of 32:15, and 2 which did not yield PCR product). Of importance is that no M/S heterozygotes were observed.
Population Homogeneity and Independence of Loci
Significant departures from HW expectations within populations were found in 23 of 167 tests (P <.001, binomial test). As in previous microsatellite studies (e.g., Donnelly and Townson 2000; Kamau et al. 1998; Lehmann et al. 1998, 1999; Simard et al. 2000), most deviations (19) were associated with positive FIS values, reflecting heterozygote deficits (Table 1). Locus-specific departures based on the sequential Bonferroni test were detected in locus 678 in the DRC and in locus 2A1 in Zambia and Jego. Heterozygote deficits in multiple loci indicate that samples may consist of several pooled subpopulations (Wahlund effect), the effects of inbreeding, or the presence of null alleles (Callen et al. 1993). Unlike the Wahlund effect and inbreeding, which affect the entire genome, observed deficits were clustered on locus 2A1 (seven populations). A few individuals repeatedly failed to produce a PCR product at 2A1, suggesting that they were homozygotes for null alleles. Excluding locus 2A1 left 12 significantly positive FIS values (P >.12, binomial test) and no population had a significant heterozygote deficit at the multitest level.
Assuming that the molecular forms represent two panmictic units, we expected that the deficit of heterozygotes observed in the total DRC population would disappear if tests were performed separately in each form. Contrary to our expectation, the deficit of heterozygotes across loci observed for the pooled DRC population did not disappear when M and S populations were analyzed separately. In the total DRC sample (
), significant FIS values were observed at three of the four X-linked loci (2A1, 99, and 678; Table 1). In the M subpopulation (
), significantly positive FIS values were still seen at these loci; positive values were also detected at loci 29C1 and 79, on the third and the second chromosomes, respectively. Even in the smaller S subsample (
), a significantly positive FIS was observed at locus 678 and its value remained high (0.31). Locus 678 is mapped to division 6 of the X chromosome (Zheng et al. 1996), near the rDNA (Figure 2). The persistence of heterozygote deficits in the separate M and S subpopulations is consistent with locus-specific selection rather than with two reproductively isolated populations.
Linkage disequilibrium (LD) tests were performed between all pairs of loci for all populations. The Wahlund effect or inbreeding should cause within-population LD because members of the different subpopulations will have different probabilities of carrying certain combinations of alleles. However, if null alleles cause the heterozygote deficits, LD is not expected, because all individuals are equally likely to carry a null allele, and the association between alleles from different loci is not disturbed. The fraction of significant (
, single test) LD tests ranged from zero (Kimilili, Cameroon, and Senegal) to 11% (Malindi), but none were significant at the multitest level (
, binomial tests), or by the sequential Bonferroni method. No evidence for LD was detected in the DRC (6 of 66 tests;
, binomial test). These results suggest that loci are independent and that null alleles were the likely cause of the heterozygote deficits. Nevertheless, the insignificant LD tests may reflect the limited power of this test.
Within-Population Diversity
Genetic diversity within populations was moderate to high for all loci (Table 1). Variation in sample size between populations (58164 chromosomes per locus per population) can have a substantial effect on the observed number of alleles (A). To compare populations in an unbiased way we randomly sampled 20 individuals (40 chromosomes) from each population, with replacement (100 times), and calculated the number of alleles and unbiased expected heterozygosity (He; Nei 1987) for each locus. The bootstrapped He mean remained within 2% of the original value in all cases and is thus not shown in Table 1. To summarize between-population differences in genetic diversity, we applied principal component analysis (PCA) based on the correlation matrix of the bootstrapped mean A and mean He of each locus. This analysis was based on 10 loci (excluding Mangochi and Cameroon, and locus 678, which was not genotyped in Senegal) and captured 65% of the total variation (across 20 original variables) in two principal components (Figure 3). The first principal component accounted for 53% of the total variation and separated two groups of populations. One group included Malindi (Kenya), Jego (Kenya), Nsanje (Malawi), Tanzania, and Zambia, while the other group included the remaining nine populations. The first group had lower scores, indicating lower diversity. Geographically the groups corresponded to the southeastern (SE) region and northwestern (NW) region of sub-Saharan Africa (Figure 1). The second principal component accounted for 12% of the variation and appeared to separate the DRC from all the rest. Analysis of all 16 populations based on seven loci yielded the same groups based on the first principal component (accounting for 58% of the variation), and placed Mangochi (Malawi) in the SE group and Cameroon in the NW group, as expected. The second principal component (accounting for 10% of the variation) did not separate any population (not shown). To check whether the explanation that genetic diversity is lower in the SE region is consistent with the original variables, the differences between regions of bootstrapped A and He means were tested in a multivariate analysis of variance and found to be highly significant (A, 6.6 versus 5.1; He, 69 versus 62; P <.001). Clearly this is not a test of the difference between regions, as populations were grouped into regions based on the PCA, but the results of the test are consistent with our interpretation of the PCA.
|
Differentiation Between Populations: Heterogeneity Among Loci
If populations represent distinct demes, significant differentiation is expected across the genome. Heterogeneity in allele frequency between pairwise population comparisons was found in 117 of 120 pairs using Fisher multisample tests. Insignificant heterogeneity was found in three comparisons between western Kenyan populations (Asembo versus Kisian, Kimilili versus Kisian, and Kimilili versus Awendo) located within 200 km of each other.
FST values among loci varied widely (range -0.02 to 0.40). Tables of FST values between all population pairs for every locus are available from T.L. upon request. To assess whether this variation reflected locus heterogeneity, we calculated jackknifed means for each locus for every population pair (excluding one locus at a time and calculating 11 means, each without a single locus) and calculated an influence index: (
)/Foriginal. This index estimates the relative change in FST after excluding a given locus. Figure 4A plots the FST value for all population pairs more than 200 km apart and genotyped at all 11 loci against the influence index for each locus. This index is not intended as a statistical test, as we do not know the limits of the distribution of the influence index under locus homogeneity. Rather it was used as an exploratory tool to identify putatively influential loci, defined as those whose removal of their single FST value (of 11 values) changed the mean FST by 6095% (upper right corner of Figure 4, top panel). The same values, all with
, are located 5 to 15 interquartile range units (IQR) away from their corresponding median (Figure 4, bottom panel). Values located more than three IQR from the median are typically considered as extreme outliers (SAS 1999). Notably, all 13 points (out of 396) which fell more than three IQR from their median (all values with influence larger than 35%) belonged to locus 678 or locus 79, which are mapped near the rDNA and inversion 2Rb, respectively (Figure 2; see below).
|
The extreme outlier values of locus 678 were found in comparisons between the M and S populations. Allele distributions at locus 678 (Figures 5 and 6) show the basis for this pattern. In Ghana (M genotype), the allele size range was 149163, with a 153 bp allele predominating (77%). In neighboring Nigeria, as for all S populations, the allele size range was 153211, and a 165 bp allele predominated (18%; Figure 5). Allele 153 was rare or absent in all S-genotype populations. In the DRC, the overall frequency of allele 153 was 45%, but was 65% in its M subpopulation and only 4% in its S subpopulation (Figure 6). There were 11 heterozygotes for this allele, of which 10 belonged to the M form. The distinct allelic profile at locus 678 of each molecular form resulted in extremely high FST values (0.20.4) and in the departure from HW expectations reported above.
|
|
The other extreme outlier values were associated with locus 79, in comparisons between Nigeria and three other populations (Gabon, DRC, and Asembo). Populations from Ghana, Senegal, and Nigeria had moderate to high frequencies of alleles 175 and 177 (25%48% and 17%26%, respectively), which were low elsewhere (<15% and <4%, respectively). These populations also lacked allele 176, which was at least moderately frequent elsewhere. Notably, allele 176 does not fit a series predicted by a 2 bp repeat motif and a constant flanking sequence (Figure 5). Alleles 175, 176, and 177 were absent from the M population in the DRC, although allele 176 was found in the S population there (13%). Unlike locus 678, locus 79 did not separate the M and S genotypes. Its location (subdivision 11A) near inversion 2Rb (Figure 2), which is under selection (Coluzzi et al. 1985; Toure et al. 1994, 1998), suggests an inversion-specific effect in accordance with Lanzaro et al. (1998).
High FST values were also observed for locus 29, but influence indices were less than 40% (Figure 4, top panel) and their location within the whiskers of the box plots (Figure 4, bottom panel) indicate no exceptional deviations. With only two alleles segregating, FST in locus 29 attains high values when one allele dominates, that is, is closer to fixation (Wright 1978). Besides the influential/outlier values of loci 678 and 79, the distribution of the jackknifed FST means of the remaining nine loci appears homogeneous, as only 3 of 326 values are located beyond the "whiskers" (Figure 4, bottom panel). These results strongly caution that variation between certain populations at loci 678 and 79 has been shaped not only by drift, migration, and mutation, but also by selection operating presumably on a linked locus. Because the homogeneity of loci was probably violated by including a locus with
with 10 loci whose mean is 0.02 (range -0.006 to 0.075), as exemplified by locus 678 between Asembo and Ghana (influence index 56%), we removed loci 678 and 79 from analyses aimed at capturing genome-wide patterns.
Differentiation Between Populations: Phylogeography
An unrooted neighbor-joining population tree was constructed based on a matrix of the mean FST across nine loci between all pairs of populations (Figure 7). Large FST distances with high bootstrap values (
) distinguished two groups of populations. One group included populations from eastern Kenya, Malawi, Tanzania, and possibly Zambia. The other group included all other populations. These groups correspond to those discerned by PCA (NW and SE; Figure 3) with the exception of Zambia, which was separated by equal FST distances from each division (branch length 0.03,
; Figure 7).
|
Within each division, populations separated by shorter geographic distances appear to be more related genetically, but this pattern was broken by the cluster of M populations (Figure 7). Of importance is that the M subpopulation from the DRC was clustered with the other more northerly M populations, a long genetic distance from its sympatric S subpopulation. As locus 678 was excluded, this cluster represents a genome-wide pattern consistent with a common descent for all M populations. Despite the clade's low bootstrap support (
), the assembly of all M populations regardless of geographic distance is unlikely to be a result of noise. Of interest is that the grouping of Senegal and Ghana apart from all the other populations (
; Figure 7) suggests a split between populations inhabiting the savanna and the forest. Substructure was greater in the geographically smaller SE subdivision, in which three nested clusters were discerned (Figure 7). These included two populations from eastern Kenya nested in a cluster containing Nsanje, which were nested in a cluster containing Mangochi. All populations in this cluster were nested within Tanzania.
The phylogeographic analysis supports three clear patterns: (1) a continental split between the NW and SE divisions, (2) a shallower, secondary split between the M and the S populations within the NW division, and (3) a series of nested population clusters in the SE division. Differentiation between the M and S populations was lower than that between several allopatric populations of the same molecular form, and the distinction between Forest (Cameroon, Gabon, DRC) and Savanna (all others) chromosomal forms added little to the classification of populations.
Differentiation Between Populations: Isolation by Distance
The distribution of A. gambiae across its range in sub-Saharan Africa is more or less continuous (Coetzee et al. 2000). Large areas inhospitable to this species that can isolate populations, such as the Rift Valley complex (Kamau et al. 1999; Lehmann et al. 1999, 2000) are few and migration around them may counteract their effect. The isolation by distance model (Wright 1951) is therefore a plausible one for A. gambiae. It assumes that migration is equally likely in all directions and that distance is the main determinant of differentiation.
The positive slope of the regression of FST/(
) on the logarithm of the distance (Rousset 1997), based on the mean FST (excluding loci 678 and 79), is consistent with the isolation by distance model (slope = 0.013,
; Table 2). To determine whether this pattern also occurs within divisions and subdivisions, we repeated this analysis within and between these units (Table 2). Positive slopes (
) were found in all analyses except between molecular forms in the NW division, within the M form, and when both forms were included in the NW division (Table 2). Rousset (1997) pointed out that the slope depends on the range of the logarithm of distance values. Large differences between the estimates of the slopes (Table 2) suggest heterogeneity of the slopes, but we must accommodate the disparity in distance ranges. Even after truncating the distance range of the S population in the NW division to match that of the SE division (NW, 302230 km; SE, 1902180 km), the slope for the S populations in the NW division (0.0024, 95% CI 0.0010.003) was significantly smaller than that in the SE division (0.0151, 95% CI 0.0050.025).
|
| Discussion |
|---|
|
|
|---|
Distinguishing locus-specific from genome-wide patterns is a central prerequisite for the description of A. gambiae population structure because of the effects of selection and inversions (della Torre et al. 2001; Gentile et al. 2001; Lanzaro et al. 1998; Mukabayire et al. 2001; Taylor et al. 2001). Locus homogeneity with respect to differentiation between populations was probably violated unless loci 678 and 79 were removed from the analyses of genome-wide patterns. The deepest split in the gene pool was measured between the SE and NW divisions (Figure 7). Other clusters of populations, such as the molecular forms, were separated by smaller genetic distances. Understanding the processes that have shaped this structure starts with the question of what might constitute the barriers among these units.
Between the NW and SE divisions there are a near-continuous series of highlands, deserts, and lakes along the Rift Valley system that are inhospitable for A. gambiae (Figure 7). In addition, an inhospitable dry savanna extends from northern Kenya between the narrow coastal plain and the highlands and spreads into central Tanzania (not depicted in Figure 7). This series of ecogeographic features probably constitutes the northern barrier between divisions, as was shown previously (Lehmann et al. 1999, 2000). No such geographic features are found between the DRC and Zambia or between Nsanje (Malawi) and Zambia (Figure 7). Alternatively, a recent bottleneck in the SE region explains both the high differentiation between divisions and the lower genetic diversity within the SE division. In the absence of geographical barriers, distance alone determines differentiation. For instance, the isolation by distance model (SE, S form; Table 2) is consistent with the differentiation between Zambia and both the DRC (S) and Nsanje (1340 km apart; expected
; observed mean
, respectively). Consistent with a northern barrier, however, distance fails to predict the differentiation between eastern and western Kenya (
,
; see Lehmann et al. 1999). These results suggest that migration between divisions occurs mostly in the south and are consistent with a bottleneck that shaped this structure.
The Role of Colonization in Establishing the SE Subdivision
The nested clusters of populations in the SE division may reflect a series of founder events associated with a population spreading from the northwest and progressing eastward and northward along the coast. Under a mutation-drift equilibrium with similar effective population size, each population of a given population pair has the same likelihood of possessing private alleles. Colonization events, particularly if associated with founder effects, will result in loss of private alleles in colony populations. Accordingly, most alleles of the colony population are expected to exist in the source population, but many alleles of the source population are absent in the colony due to the founder effect. To test for the existence of such a pattern in the SE division, we calculated an index of inclusiveness of alleles and used it as an ad hoc test statistic. The inclusiveness index (I) was calculated as the proportion of alleles in the colony (c) population that are shared with a putative source (s) population (Fc[s]) times the difference between this and the "reverse" fraction:
. Unlike differentiation indices, inclusiveness is typically asymmetric (
) and is asymptotically bounded between 1 and -1. Large positive values require that Fc[s] is large and that Fs[c] is considerably smaller, indicating that the data are consistent with the hypothesis that c and s represent a colony and a source, respectively. Near-zero values indicate that the two populations possess nearly the same alleles or very different alleles, but in either case the data do not conform to the "source-colony" hypothesis. Small negative values indicate that the relationships may be reversed (i.e., the putative colony is more likely to be the source population). To avoid bias due to differences in sample size, we used the same sample size (per locus) for each pair of populations by sorting specimens in order of collection and excluding those in excess. A priori hypotheses were tested by bootstrapping the matrix of inclusiveness values between all pairs of populations, after excluding the values under the hypothesis to determine whether they were higher than 95% of the bootstrapped values.
Our first hypothesis is based on the structure of the population tree (Figure 7): a series of colonization events took place starting with migrants from Zambia into Tanzania; migrants from Tanzania into Malawi; and last, migrants from Malawi into eastern Kenya. An alternative hypothesis is that all SE populations were colonized from one origin in a rather short time (not sequentially). Since western Kenya and the DRC are closest to the SE division, they were considered the most plausible source populations. In these tests, alleles across all 11 loci were used (a total of 219 alleles), the population from Asembo was arbitrarily selected to represent western Kenya (Kisian, Kimilili, and Awendo were excluded, although if their values are averaged the same patterns were obtained), and the S and M populations of the DRC were considered separately (see below).
The low inclusiveness values do not support the "sequential colonization" hypothesis, but high and significant values support the "one source" hypothesis (Table 3). These high values highlight the lack of private alleles in the SE populations and that most of the SE alleles are found in western Kenya (Table 3). To clarify the sources of the SE division, we tested Senegal, Ghana, Cameroon, Gabon, and Nigeria as possible source populations. They were all rejected as source populations (
), except Nigeria (
), which has an allele profile very similar to western Kenya (Figure 7; mean
). While the true origin cannot be determined, a colonization event from a population(s) most similar to western Kenya is supported by these results.
|
The Extent of Isolation Between the M and S Populations
The complex status of the molecular forms is highlighted by our results, some of which support recent reproductive isolation, but others support selection (Table 4). If reproductive isolation alone has generated the high differentiation at the rDNA and locus 678 (
), why is the genome-wide differentiation so much lower (
)? Heterogeneity among loci is a strong signature of selection. Further, the colocalization of the two high-differentiation loci in division 6 contrasts with other X-linked loci (1D1, 2A1, and 99) that show no differentiation between M and S forms in the DRC (
, exact tests) and indicate that loci 678 and the rDNA are not independent. Notably the genome-wide differentiation between the molecular forms is lower than that between some allopatric populations within forms, albeit the results above suggest that the differentiation between divisions is the result of a founder effect (see more below). Finally, contrary to expectations based on two panmictic units in the DRC, departures from HW expectations persisted on locus 678 in both forms, and additional departures, not seen in the pooled population, were detected in each form despite the lower sample size. Nevertheless, at least partial reproductive isolation between sympatric M and S populations was evident even from several autosomal loci associated with significant differentiation in the DRC (locus 46,
; locus 197,
; locus 577,
;
, exact tests) and in other localities (della Torre et al. 2001; Wondji et al. 2001). Further, a low frequency (<2%) of M/S matings was measured in Mali (Tripet et al. 2001) and a large difference in the frequency of the kdr mutation between sympatric M and S populations was reported (Weill et al. 2000). Neither selection alone nor recent isolation alone can explain these findings.
|
An explanation that accounts for all these findings involves fluctuating gene flow and selection. According to this hypothesis, the combined effects of assortative mating and selection against M/S heterozygotes severely limits gene flow between them under typical conditions. However, if selection is relaxed, in as yet unobserved conditions, then gene flow will reduce the differentiation genome-wide. The traces of gene flow will be quickly obscured in chromosomal regions affected by selection and hitchhiking. A different explanation is that heterozygote females (for an X-linked locus) may be at a disadvantage under natural conditions, but gene flow could continue via males; a situation which cannot be resolved with available samples, as they consist almost exclusively of females. Fluctuating selection and gene flow is a hypothesis that postulates unobserved events, but some support for it is found in the allele distributions at locus 678. Unlike the population in Ghana that is very distinct from its neighboring S population (Nigeria), the M population in the DRC shares alleles 167, 169, and 179 (all absent in Ghana) with the sympatric S population (Figure 6). Separated by a large number of repeats (three to nine) from the nearest M allele (161 bp), they probably originated by gene flow from the sympatric S population, although a common ancestral origin cannot be ruled out. Nucleotide data are needed to determine whether the M form is indeed monophyletic, as suggested by Figure 7.
Recent studies have suggested that differentiation between species can be high in several genes despite persistent gene flow and lower differentiation in other genes (e.g., Machado et al. 2002) and that divergence leading to incipient speciation can occur between populations that exchange genes at high rates (e.g., Beheregaray and Sunnucks 2001; Michalak et al. 2001). These conclusions are backed by male sterility (e.g., Machado et al. 2002) or by extensive phenotypic differences (e.g., Beheregaray and Sunnucks 2001; Michalak et al. 2001), which are not paralleled in the present knowledge of A. gambiae molecular forms. However, these findings strengthen the view that the molecular forms represent incipient species (della Torre et al. 2002 and references therein). Additional studies exploring the phenotypic differences between the forms are clearly needed. Regardless of the taxonomic debate, the molecular forms may represent distinct epidemiological entities, so treating them separately is operationally justified, at least until differences in their vectoral capacity and susceptibility to means of control are evaluated.
Gene Flow and Other Processes Shaping the Population Structure
Consistent with the isolation by distance model, differentiation increased with geographic distance, but the effect of distance was rather small. For example, the expected FST across 6,000 km using the steepest slope (of the SE division) is only 0.045. The large differences between slopes suggest that factors other than distance are involved in shaping differentiation. An accurate estimate of gene flow is beyond our grasp because of departure from the mutation-drift-migration equilibrium (Donnelly et al. 2001). Isolation by distance and the inferred colonization process further complicate the interpretation of gene flow.
The colonization of the SE division probably does not represent the original introduction of A. gambiae into this area, but a colonization following a large-scale bottleneck. Traces of a historical bottleneck were detected in the SE division (Jego), but not in the NW division (Asembo) based on the imbalance index (Donnelly et al. 2001). What could cause such a large-scale bottleneck? Severe droughts occur repeatedly in Africa, with the most profound effects in long dry spell regions, where the dry season normally lasts 6 months or longer (Goudie 1996). This climate prevails in large parts of the SE region and in the Sahel (Figure 7). The impact of severe drought is exemplified in the drought of 17701830, which totally dried up Lake Rukwa (Tanzania) and Lake Chiuta (Malawi), reduced the levels of Lake Malawi by more than 120 m, and brought famine that led to human migration (Nicholson 1998a,b; Owen et al. 1990). Such events probably reduced both the number of available breeding sites and hosts. The areas least affected by droughts are the equatorial forests and the belt of wet savanna (short dry spell) surrounding it (Goudie 1996). It is plausible therefore that the ultimate source populations of A. gambiae reside in the forest and the wet savannas, whereas populations residing in the dry savanna and the Sahel are doomed to experience extinction/bottlenecks every few hundred years. This process might help maintain the genetic homogeneity of A. gambiae across its range and reduce the "effective range" (loosely defined as the range of permanent populations) of the species to nearly half of its presently observed range. This effective range may be even smaller, because droughts "shrink" the forest and the ring of wet savanna around it. Why do populations in the Sahel (e.g., Barkedji, Senegal) show no sign of a bottleneck compared with those in the SE division (e.g., Jego)? The Sahel is a narrow belt, less than 500 km from the wet savanna, a relatively short distance that allows rapid colonization and replenishment of genetic diversity. In addition, the Sahel is populated by the relatively drought-tolerant M molecular form. The SE region, on the other hand, is partly separated by the Rift Valley and its associated highlands and by a longer distance, and so requires a longer period to replenish its genetic diversity. If periodic droughts shaped the deepest division of the structure of A. gambiae populations, then other drought-sensitive species in Africa with moderate mobility should present similar patterns. A remarkably similar pattern was observed in Anopheles funestus (Yan G, personal communication) and in Drosophila teissieri, showing reduced genetic diversity in three Tanzanian populations (SE) compared with those from Gabon and Ivory Coast (NW) (Cobb et al. 2000). Additional studies on relevant species will permit a more rigorous test of the effects of severe drought on A. gambiae in the context of regional phylogeography.
| Acknowledgments |
|---|
We are grateful to Mark Danga, Francis Atieli, George Olang', and Charles Mbogo (Kenya Medical Research Institute) for their help in field collections. We thank Fernando Monteiro, Martin Donnelly, Jean-Pierre Dujardin, and Nora Besansky for discussions on earlier versions of this manuscript. Thanks to the staff of the CDC Core Facility, Computer Support and Data Management for their help, and to Sue Dillard for her dedicated work on Figure 7. This investigation received financial assistance from the UNDP/World Bank/WHO Special Programme for Research and Training in Tropical Diseases (TDR).
| Footnotes |
|---|
Corresponding Editor: Stephen Schaeffer
| References |
|---|
|
|
|---|
-
Beheregaray LB, Sunnucks P, 2001. Fine-scale genetic structure, estuarine colonization and incipient speciation in the marine silverside fish Odontesthes argentinensis. Mol Ecol. 10:2849-2866.[CrossRef][Medline]
Besansky NJ, Lehmann T, Fahey GT, Fontenille D, Braack LE, Hawley WA, Collins FH, 1997. Patterns of mitochondrial variation within and between African malaria vectors, Anopheles gambiae and An. arabiensis, suggest extensive gene flow. Genetics. 147:1817-1828.[Abstract]
Black WC, Lanzaro GC, 2001. Distribution of genetic variation among chromosomal forms of Anopheles gambiae s.s: introgressive hybridization, adaptive inversions, or recent reproductive isolation? Insect Mol Biol. 10:3-7.[CrossRef][Web of Science][Medline]
Bryan JH, Di Deco MA, Petrarca V, 1982. Inversion polymorphism and incipient speciation in Anopheles gambiae s.s in the Gambia, West Africa. Genetica. 59:167-176.[CrossRef][Web of Science]
Callen DF, Thompson AD, Shen Y, Phillips HA, Richards RI, Mulley JC, Sutherland GR, 1993. Incidence and origin of "null" alleles in the (AC)n microsatellite markers. Am J Hum Genet. 52:922-927.[Web of Science][Medline]
Cobb M, Huet M, Lachaise D, Veuille M, 2000. Fragmented forests, evolving flies: molecular variation in African populations of Drosophila teissieri. Mol Ecol. 9:1591-1597.[CrossRef][Medline]
Coetzee M, Craig M, le Sueur D, 2000. Distribution of African malaria mosquitoes belonging to the Anopheles gambiae complex. Parasitol Today. 16:74-77.[CrossRef][Web of Science][Medline]
Coluzzi M, Petrarca V, Di Deco MA, 1985. Chromosomal inversion intergradation and incipient speciation in Anopheles gambiae. Boll Zool. 52:45-63.
Coluzzi M, Sabatini A, Petrarca V, Di Deco MA, 1979. Chromosomal differentiation and adaptation to human environments in the Anopheles gambiae complex. Trans R Soc Trop Med Hyg. 73:483-497.[CrossRef][Web of Science][Medline]
della Torre A, Costantini C, Besansky NJ, Cacccone A, Petrarca V, Powell JR, Coluzzi M, 2002. Speciation within Anopheles gambiaethe glass is half full. Science. 298:115-117.
della Torre A, Fanello C, Akogbeto M, Dossou-yovo J, Favia G, Petrarca V, Coluzzi M, 2001. Molecular evidence of incipient speciation within Anopheles gambiae s.s. in West Africa. Insect Mol Biol. 10:9-18.[CrossRef][Web of Science][Medline]
Donnelly MJ, Licht MC, Lehmann T, 2001. Evidence for recent population expansion in the evolutionary history of the malaria vectors Anopheles arabiensis and Anopheles gambiae. Mol Biol Evol. 18:1353-1364.
Donnelly MJ, Townson H, 2000. Evidence for extensive genetic differentiation among populations of the malaria vector Anopheles arabiensis in eastern Africa. Insect Mol Biol. 9:357-367.[CrossRef][Web of Science][Medline]
Favia G, della Torre A, Bagayoko M, Lanfrancotti A, Sagnon N, Toure YT, Coluzzi M, 1997. Molecular identification of sympatric chromosomal forms of Anopheles gambiae and further evidence of their reproductive isolation. Insect Mol Biol. 6:377-383.[CrossRef][Web of Science][Medline]
Felsenstein J, 1989. PHYLIPphylogeny inference package (version 3.2). Cladistics. 5:164-166.
Gentile G, Slotman M, Ketmaier V, Powell JR, Caccone A, 2001. Attempts to molecularly distinguish cryptic taxa in Anopheles gambiae s.s. Insect Mol Biol. 10:25-32.[CrossRef][Web of Science][Medline]
Goudie AS, 1996. Climate: past and present. In: The physical geography of Africa (Adams WM, Goudie AS, and Orme AR, eds). Oxford: Oxford University Press; 3559.
Hillis DM, Bull JJ, 1993. An empirical test of bootstrapping as a method for assessing confidence in phylogenic analysis. Syst Biol. 42:182-192.[CrossRef][Web of Science]
Holm S, 1979. A simple sequentially rejective multiple test procedure. Scand J Stat. 6:65-70.
Kamau L, Lehmann T, Hawley WA, Orago AS, Collins FH, 1998. Microgeographic genetic differentiation of Anopheles gambiae mosquitoes from Asembo Bay, western Kenya: a comparison with Kilifi in coastal Kenya. Am J Trop Med Hyg. 58:64-69.[Abstract]
Kamau L, Mukabana WR, Hawley WA, Lehmann T, Irungu LW, Orago AA, Collins FH, 1999. Analysis of genetic variability in Anopheles arabiensis and Anopheles gambiae using microsatellite loci. Insect Mol Biol. 8:287-297.[CrossRef][Web of Science][Medline]
Kingdon J, 1989. Island Africa. Princeton, NJ: Princeton University Press.
Kumar S, Tamura K, Nei M, 1993. MEGA: molecular evolutionary genetics analysis, version 1.01. University Park, PA: Pennsylvania State University.
Lanzaro GC, Toure YT, Carnahan J, Zheng L, Dolo G, Traore S, Petrarca V, Vernick KD, Taylor CE, 1998. Complexities in the genetic structure of Anopheles gambiae populations in west Africa as revealed by microsatellite DNA analysis. Proc Natl Acad Sci USA. 95:14260-14265.
Lehmann T, Besansky NJ, Hawley WA, Fahey TG, Kamau L, Collins FH, 1997. Microgeographic structure of Anopheles gambiae in western Kenya based on mtDNA and microsatellite loci. Mol Ecol. 6:243-253.[CrossRef][Medline]
Lehmann T, Blackston CR, Besansky NJ, Escalante AA, Collins FH, Hawley WA, 2000. The Rift Valley complex as a barrier to gene flow for Anopheles gambiae in Kenya: the mtDNA perspective. J Hered. 91:165-168.
Lehmann T, Hawley WA, Grebert H, Collins FH, 1998. The effective population size of Anopheles gambiae in Kenya: implications for population structure. Mol Biol Evol. 15:264-276.[Abstract]
Lehmann T, Hawley WA, Grebert H, Danga M, Atieli F, Collins FH, 1999. The Rift Valley complex as a barrier to gene flow for Anopheles gambiae in Kenya. J Hered. 90:613-621.
Lehmann T, Hawley WA, Kamau L, Fontenille D, Simard F, Collins FH, 1996. Genetic differentiation of Anopheles gambiae populations from east and west Africa: comparison of microsatellite and allozyme loci. Heredity. 77(pt 2):192-200.
Machado CA, Kliman RM, Markert JA, Hey J, 2002. Inferring the history of speciation from multilocus DNA sequence data: the case of Drosophila pseudoobscura and close relatives. Mol Biol Evol. 19:472-488.
Marshall E, 2000. Malaria. A renewed assault on an old and deadly foe. Science. 290:428-430.
Michalak P, Minkov I, Helin A, Lerman DN, Bettencourt BR, Feder ME, Korol AB, Nevo E, 2001. Genetic evidence for adaptation-driven incipient speciation of Drosophila melanogaster along a microclimatic contrast in "Evolution Canyon," Israel. Proc Natl Acad Sci USA. 98:13195-13200.
Mooney CZ, Duval RD, 1993. Bootstrapping: a nonparametric approach to statistical inference. New Berry, CA: Sage.
Mukabayire O, Caridi J, Wang X, Toure YT, Coluzzi M, Besansky NJ, 2001. Patterns of DNA sequence variation in chromosomally recognized taxa of Anopheles gambiae: evidence from rDNA and single-copy loci. Insect Mol Biol. 10:33-46.[CrossRef][Web of Science][Medline]
Nei M, 1987. Molecular evolutionary genetics. New York: Columbia University Press.
Nicholson SE, 1998a. Fluctuations of Rift Valley Lakes Malawi and Chilwa during historical times: a synthesis of geological, archeological and historical information. In: Environmental change and response in east African lakes (Lehman JT, ed). Amsterdam: Kluwer; 207231.
Nicholson SE, 1998b. Historical and modern fluctuations of Lakes Tanganyika and Rukwa and their relationship to rainfall variability. Amsterdam: Kluwer; 119.
Owen R, Crossley R, Johnson TC, Tweddle D, Kornfield I, Davison S, Eccles DH, Engstrom DE, 1990. Major low levels of Lake Malawi and their implications for speciation rates in cichlid fishes. Proc R Soc Lond B Biol Sci. 240:519-553.
Petrarca V, Beier JC, 1992. Intraspecific chromosomal polymorphism in the Anopheles gambiae complex as a factor affecting malaria transmission in the Kisumu area of Kenya. Am J Trop Med Hyg. 46:229-237.
Petrarca V, Nugud AD, Ahmed MA, Haridi AM, Di Deco MA, Coluzzi M, 2000. Cytogenetics of the Anopheles gambiae complex in Sudan, with special reference to An. arabiensis: relationships with east and west African populations. Med Vet Entomol. 14:149-164.[CrossRef][Web of Science][Medline]
Powell JR, Petrarca V, della Torre A, Caccone A, Coluzzi M, 1999. Population structure, speciation, and introgression in the Anopheles gambiae complex. Parassitologia. 41:101-113.[Medline]
Raymond M, Rousset F, 1995a. GENEPOP (version 1.2): population genetics software for exact tests and ecumenicism. J Hered. 86:248-249.
Raymond M, Rousset F, 1995b. Testing heterozygote excess and deficiency. Genetics. 140:1413-1419.[Abstract]
Rousset F, 1997. Genetic differentiation and estimation of gene flow from F-statistics under isolation by distance. Genetics. 145:1219-1228.[Abstract]
SAS,, 1990. SAS language: references, version 6. Cary, NC: SAS Institute.
Scott JA, Brogdon WG, Collins FH, 1993. Identification of single specimens of the Anopheles gambiae complex by the polymerase chain reaction. Am J Trop Med Hyg. 49:520-529.
Simard F, Lehmann T, Lemasson JJ, Diatta M, Fontenille D, 2000. Persistence of Anopheles arabiensis during the severe dry season conditions in Senegal: an indirect approach using microsatellite loci. Insect Mol Biol. 9:467-479.[CrossRef][Web of Science][Medline]
Slatkin M, 1995. A measure of population subdivision based on microsatellite allele frequencies. Genetics. 139:457-462.[Web of Science][Medline]
Smits A, Roelants P, Van Bortel W, Coosemans M, 1996. Enzyme polymorphisms in the Anopheles gambiae (Diptera:Culicidae) complex related to feeding and resting behavior in the Imbo Valley,. Burundi. J Med Entomol. 33:545-553.
Taylor C, Toure YT, Carnahan J, Norris DE, Dolo G, Traore SF, Edillo FE, Lanzaro GC, 2001. Gene flow among populations of the malaria vector, Anopheles gambiae, in Mali, West Africa. Genetics. 157:743-750.
Toure YT, Petrarca V, Traore SF, Coulibaly A, Maiga HM, Sankare O, Sow M, Di Deco MA, Coluzzi M, 1994. Ecological genetic studies in the chromosomal form Mopti of Anopheles gambiae s.str. in Mali, West Africa. Genetica. 94:213-223.[CrossRef][Web of Science][Medline]
Toure YT, Petrarca V, Traore SF, Coulibaly A, Maiga HM, Sankare O, Sow M, Di Deco MA, Coluzzi M, 1998. The distribution and inversion polymorphism of chromosomally recognized taxa of the Anopheles gambiae complex in Mali, West Africa. Parassitologia. 40:477-511.[Medline]
Tripet F, Toure YT, Taylor CE, Norris DE, Dolo G, Lanzaro GC, 2001. DNA analysis of transferred sperm reveals significant levels of gene flow between molecular forms of Anopheles gambiae. Mol Ecol. 10:1725-1732.[CrossRef][Medline]
Weill M, Chandre F, Brengues C, Manguin S, Akogbeto M, Pasteur N, Guillet P, Raymond M, 2000. The kdr mutation occurs in the Mopti form of Anopheles gambiae s.s. through introgression. Insect Mol Biol. 9:451-455.[CrossRef][Web of Science][Medline]
Weir BS, Cockerham CC, 1984. Estimating F-statistics for the analysis of population structure. Evolution. 38:1358-1370.[CrossRef][Web of Science]
Whitlock MC, McCauley DE, 1999. Indirect measures of gene flow and migration:
. Heredity. 82:117-125.
Wondji C, Simard F, Fontenille D, 2001. Evidence for genetic differentiation between molecular forms M and S within the forest chromosomal form of Anopheles gambiae in an area of sympatry. Insect Mol Biol.
Wright S, 1951. The genetical structure of populations. Ann Eugen. 15:323-354.
Wright S, 1978. Evolution and the genetics of populations: variability within and among natural populations. Chicago: University of Chicago Press.
Zheng L, Benedict MQ, Cornel AJ, Collins FH, Kafatos FC, 1996. An integrated genetic map of the African human malaria vector mosquito, Anopheles gambiae. Genetics. 143:941-952.[Abstract]
This article has been cited by other articles:
![]() |
J. R. Loaiza, M. E. Scott, E. Bermingham, J. Rovira, and J. E. Conn Evidence for Pleistocene Population Divergence and Expansion of Anopheles albimanus in Southern Central America Am J Trop Med Hyg, January 1, 2010; 82(1): 156 - 164. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Diabate, A. Dao, A. S. Yaro, A. Adamou, R. Gonzalez, N. C. Manoukis, S. F. Traore, R. W. Gwadz, and T. Lehmann Spatial swarm segregation and reproductive isolation between the molecular forms of Anopheles gambiae Proc R Soc B, December 7, 2009; 276(1676): 4215 - 4222. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Lefevre, L.-C. Gouagna, K. R. Dabire, E. Elguero, D. Fontenille, F. Renaud, C. Costantini, and F. Thomas Beyond Nature and Nurture: Phenotypic Plasticity in Blood-Feeding Behavior of Anopheles gambiae s.s. When Humans Are Not Readily Accessible Am J Trop Med Hyg, December 1, 2009; 81(6): 1023 - 1029. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Edillo, A. Kiszewski, J. Manjourides, M. Pagano, M. Hutchinson, A. Kyle, J. Arias, D. Gaines, R. Lampman, R. Novak, et al. Effects of Latitude and Longitude on the Population Structure of Culex pipiens s.l., Vectors of West Nile Virus in North America Am J Trop Med Hyg, November 1, 2009; 81(5): 842 - 848. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. J. Kent, S. Mharakurwa, and D. E. Norris Spatial and Temporal Genetic Structure of Anopheles arabiensis in Southern Zambia over Consecutive Wet and Drought Years Am J Trop Med Hyg, August 1, 2007; 77(2): 316 - 323. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. A. Subramanian, P. Arensburger, P. W. Atkinson, and D. A. O'Brochta Transposable Element Dynamics of the hAT Element Herves in the Human Malaria Vector Anopheles gambiae s.s. Genetics, August 1, 2007; 176(4): 2477 - 2487. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. J. KENT, P. E. THUMA, S. MHARAKURWA, and D. E. NORRIS SEASONALITY, BLOOD FEEDING BEHAVIOR, AND TRANSMISSION OF PLASMODIUM FALCIPARUM BY ANOPHELES ARABIENSIS AFTER AN EXTENDED DROUGHT IN SOUTHERN ZAMBIA Am J Trop Med Hyg, February 1, 2007; 76(2): 267 - 274. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. E. Yawson, D. Weetman, M. D. Wilson, and M. J. Donnelly Ecological Zones Rather Than Molecular Forms Predict Genetic Differentiation in the Malaria Vector Anopheles gambiae s.s. in Ghana Genetics, February 1, 2007; 175(2): 751 - 761. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A. Slotman, L. J. Reimer, T. Thiemann, G. Dolo, E. Fondjo, and G. C. Lanzaro Reduced Recombination Rate and Genetic Differentiation Between the M and S Forms of Anopheles gambiae s.s. Genetics, December 1, 2006; 174(4): 2081 - 2093. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. P. Michel, O. Grushko, W. M. Guelbeogo, N. F. Lobo, N. Sagnon, C. Costantini, and N. J. Besansky Divergence With Gene Flow in Anopheles funestus From the Sudan Savanna of Burkina Faso, West Africa Genetics, July 1, 2006; 173(3): 1389 - 1395. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. E. CONN, J. H. VINEIS, J. P. BOLLBACK, D. Y. ONYABE, R. C. WILKERSON, and M. M. POVOA POPULATION STRUCTURE OF THE MALARIA VECTOR ANOPHELES DARLINGI IN A MALARIA-ENDEMIC REGION OF EASTERN AMAZONIAN BRAZIL. Am J Trop Med Hyg, May 1, 2006; 74(5): 798 - 806. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. M. Menge, D. Zhong, T. Guda, L. Gouagna, J. Githure, J. Beier, and G. Yan Quantitative Trait Loci Controlling Refractoriness to Plasmodium falciparum in Natural Anopheles gambiae Mosquitoes From a Malaria-Endemic Region in Western Kenya Genetics, May 1, 2006; 173(1): 235 - 241. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A. SLOTMAN, M. M. MENDEZ, A. D. TORRE, G. DOLO, Y. T. TOURE, and A. CACCONE GENETIC DIFFERENTIATION BETWEEN THE BAMAKO AND SAVANNA CHROMOSOMAL FORMS OF ANOPHELES GAMBIAE AS INDICATED BY AMPLIFIED FRAGMENT LENGTH POLYMORPHISM ANALYSIS Am J Trop Med Hyg, April 1, 2006; 74(4): 641 - 648. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. J. Kennington, L. Partridge, and A. A. Hoffmann Patterns of Diversity and Linkage Disequilibrium Within the Cosmopolitan Inversion In(3R)Payne in Drosophila melanogaster Are Indicative of Coadaptation Genetics, March 1, 2006; 172(3): 1655 - 1663. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A. SLOTMAN, A. D. TORRE, M. CALZETTA, and J. R. POWELL DIFFERENTIAL INTROGRESSION OF CHROMSOMAL REGIONS BETWEEN ANOPHELES GAMBIAE AND AN. ARABIENSIS Am J Trop Med Hyg, August 1, 2005; 73(2): 326 - 335. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. D. Stump, J. A. Shoener, C. Costantini, N. Sagnon, and N. J. Besansky Sex-Linked Differentiation Between Incipient Species of Anopheles gambiae Genetics, March 1, 2005; 169(3): 1509 - 1519. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Tripet, G. Dolo, and G. C. Lanzaro Multilevel Analyses of Genetic Differentiation in Anopheles gambiae s.s. Reveal Patterns of Gene Flow Important for Malaria-Fighting Mosquito Projects Genetics, January 1, 2005; 169(1): 313 - 324. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. MOLINA-CRUZ, A. M. P. DE MERIDA, K. MILLS, F. RODRIGUEZ, C. SCHOUA, M. M. YURRITA, E. MOLINA, M. PALMIERI, and W. C. BLACK IV GENE FLOW AMONG ANOPHELES ALBIMANUS POPULATIONS IN CENTRAL AMERICA, SOUTH AMERICA, AND THE CARIBBEAN ASSESSED BY MICROSATELLITES AND MITOCHONDRIAL DNA Am J Trop Med Hyg, September 1, 2004; 71(3): 350 - 359. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. D. STUMP, F. K. ATIELI, J. M. VULULE, and N. J. BESANSKY DYNAMICS OF THE PYRETHROID KNOCKDOWN RESISTANCE ALLELE IN WESTERN KENYAN POPULATIONS OF ANOPHELES GAMBIAE IN RESPONSE TO INSECTICIDE-TREATED BED NET TRIALS Am J Trop Med Hyg, June 1, 2004; 70(6): 591 - 596. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Sharakhov, O. Braginets, O. Grushko, A. Cohuet, W. M. Guelbeogo, D. Boccolini, M. Weill, C. Costantini, N'F. Sagnon, D. Fontenille, et al. A Microsatellite Map of the African Human Malaria Vector Anopheles funestus J. Hered., January 1, 2004; 95(1): 29 - 34. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. R. G. Nyanjom, H. Chen, T. Gebre-Michael, E. Bekele, J. Shililu, J. Githure, J. C. Beier, and G. Yan Population Genetic Structure of Anopheles arabiensis Mosquitoes in Ethiopia and Eritrea J. Hered., November 1, 2003; 94(6): 457 - 463. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. P. BRAGINETS, N. MINAKAWA, C. M. MBOGO, and G. YAN POPULATION GENETIC STRUCTURE OF THE AFRICAN MALARIA MOSQUITO ANOPHELES FUNESTUS IN KENYA Am J Trop Med Hyg, September 1, 2003; 69(3): 303 - 308. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||




, +, and , respectively. Population pairs associated with outlier and influential values (defined as described in the text) are shown based on the first three letters of their name (e.g., GHA denotes Ghana). Note: the point at the lower right caused a trivial change in the mean differentiation from 0.0008 to 0.003. In the box whisker plot, the box extends between the 25th and the 75th percentile, that is, across one IQR, and the whiskers extend up to the most extreme value, but not beyond 1.5 times the IQR. Values located more than two IQR from the median are shown





