Skip Navigation


Journal of Heredity Advance Access originally published online on April 20, 2005
Journal of Heredity 2005 96(4):363-367; doi:10.1093/jhered/esi062
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow All Versions of this Article:
96/4/363    most recent
esi062v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (2)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Lee, C.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Lee, C.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The American Genetic Association. 2005. All rights reserved. For Permissions, please email: journals.permissions@oupjournals.org.

Selection Bias in Quantitative Trait Loci Mapping

C. Lee

Laboratory of Statistical Genetics, Ilsong Institute of Life Science, Hallym University, Anyang, Kyonggi-do 431-060, Korea

Address correspondence to Chaeyoung Lee at the address above, or e-mail: clee{at}hallym.ac.kr.


    Abstract
 Top
 Abstract
 Materials and Methods
 Results
 Discussion
 References
 
A simulation study was performed to see whether selection affected quantitative trait loci (QTL) mapping. Populations under random selection, under selection among full-sib families, and under selection within a full-sib family were simulated each with heritability of 0.3, 0.5, and 0.7. They were analyzed with the marker spacing of 10 cM and 20 cM. The accuracy for QTL detection decreased for the populations under selection within full-sib family. Estimates of QTL effects and positions differed (P < .05) from their input values. The problems could be ignored when mapping a QTL for the populations under selection among full-sib families. A large heritability helped reduction of such problems. When the animals were selected within a full-sib family, the QTL was detected for the populations with heritability of 0.5 or larger using the marker spacing of 10 cM, and with heritability of 0.7 using the marker spacing of 20 cM. This study implied that when selection was introduced, the accuracy for QTL detection decreased and the estimates of QTL effects were biased. A caution was warranted on the decision of data (including selected animals to be genotyped) for QTL mapping.


The overwhelming number of molecular markers produced by genome projects provides a great opportunity to locate genes controlling the expression of quantitative traits. Various experimental designs have been utilized for quantitative trait loci (QTL) detection (Lynch and Walsh 1998). Inbred lines are, however, often limited in humans and animals, and in such cases QTL analyses are to utilize existing populations as they are. Either natural or artificial selection is usually introduced to the populations. Use of such populations exposes QTL detection to potential biases introduced by genotyping individuals, especially for economically important animals selected through breeding programs. The objectives of this study were to examine if such selection affects QTL mapping and to assess the impact on estimates of QTL effects and positions.


    Materials and Methods
 Top
 Abstract
 Materials and Methods
 Results
 Discussion
 References
 
Simulation
A population was produced beginning with 80 founder individuals randomly assigned with QTL genotypes (40 males and 40 females). Each male parent was mated to three rotationally selected female parents. That is, the first male was mated to the 1st, 2nd, and 3rd females, the second male to the 2nd, 3rd, and 4th females, through the 40th male to the 40th, 1st, and 2nd females. The 120 full-sib families were produced in the F1 generation. Ten individuals were generated in each full-sib family, and a total of 1200 F1 individuals were produced. Sex was randomly assigned, producing 600 males and 600 females. Phenotypes were simulated, and 180 males and 180 individuals were selected and genotyped based on three criteria of the phenotypes. One was random selection and another was to select 36 of 120 full-sib families based on the large phenotypic means of individuals in the families. The third criterion was to select three individuals with large phenotypic values in each full-sib family. For 120 males and 120 females randomly selected among them, each male was mated to a randomly assigned female, producing 120 full-sib families. For the other animals, 60 males and 60 females were backcrossed to a randomly selected parent, again producing 120 full-sib families. Ten individuals were generated in each full-sib family, and a total of 2400 individuals were produced in the F2 generation. Phenotypes were simulated, and based on the three criteria described above, 720 individuals were selected and genotyped.

One chromosome was simulated with a length of 120 cM. The markers were generated every 10 cM and 20 cM, and a QTL was simulated on the chromosome to control the expression of a quantitative trait. The QTL was located at position 77 cM on the chromosome. The allelic effects of the QTL were simulated based on input values of their variances.

Phenotypes were simulated by the following model:

where yijk is a phenotypic value, µ is the overall mean for the trait, and 15.0 was simulated. The sex effect was ignored in this simulation. The variable fi is the full-sib litter effect generated from N(0, 0.1). is the dummy variable for the additive QTL effect taking 1, 0, and –1 for genotypes QQ, Qq, and qq, respectively. is the dummy variable for the dominance effect taking 0, 1, and 0 for genotypes QQ, Qq, and qq. The a and d are regressions for the additive and dominance effects of QTL. The additive and dominance effects were generated, respectively, from and gk is polygenic effect, and the polygenic effect is defined as the genetic effect remaining after the QTL effect is accounted for. This polygenic effect consists of half of the parents' polygenic effects (gb for boar and gs for sow) and Mendelian sampling ({phi}k), that is, gk = 1/2gb + 1/2 gs + {phi}k. Mendelian sampling was simulated using a normal distribution with mean zero and variance equal to the polygenic variance unexplained by parents' polygenic effects. Since all the boars and sows in this population were not inbred, Mendelian sampling was generated from eijk is the random environmental effect and is generated from The input values for a0 and d0 were 2 and 1, respectively. The additive and dominance genetic variances and error variance were determined by the total heritability ( which was partitioned into the additive QTL heritability, the dominance QTL heritability, and the polygenic heritability. Hence the total heritability can be expressed with variance components as the following:

The polygenic variance is defined as the fraction of genetic variance remaining after the QTL effect is accounted for.

The additive and dominance variances were calculated with Falconer and Mackay's (1996) equations:


where pQ is the allele frequency of Q at the QTL.

Given a value of total heritability of the trait, the residual variance is expressed as

The input values of the total heritability in the simulation were 0.3, 0.5, and 0.7. A random number generator based on the Box-Muller method was used to generate random Gaussian deviates (Press et al. 1992). A total of nine populations were simulated, and 50 replicates were generated for each population. They were each analyzed with the marker spacing of 10 cM and 20 cM, and each analysis was coded as shown in Table 1.


View this table:
[in this window]
[in a new window]
 
Table 1.. Codes of analyses designed in the simulation study

 
QTL Mapping
The method for QTL analysis utilized in this study was devised first to estimate the marker genotype means using a mixed model that accounted for full-sib litter effects and then to estimate the QTL effects using a weighted least-squares analysis based on the conditional frequencies of QTL given marker genotypes. For details, see QTLMDSS (http://ilsongls.hallym.ac.kr/hwp/qtlmdss.doc).


    Results
 Top
 Abstract
 Materials and Methods
 Results
 Discussion
 References
 
QTL Detection
Detection of QTL was performed for 18 analyses designed in the simulation study. The average likelihood ratio was estimated for each marker interval from 50 replicates (Figures 1–4). With the marker spacing of 10 cM, the likelihood ratio estimates were consistently the largest at the interval of markers located at 70 cM and 80 cM where a QTL was generated in the simulation (Figures 1–3). It was also true with the marker spacing of 20 cM (shown in Figure 4), with the maximum likelihood ratios located between 60 cM and 80 cM (data are not shown for populations simulated with heritability of 0.3 and 0.7). Regardless of marker spacing and heritability, the likelihood estimates obtained from the populations under random selection were a little bit larger in the area of the QTL than those from the population under selection among full-sib families, and they were close in other areas.



View larger version (17K):
[in this window]
[in a new window]
 
Figure 1.. Likelihood ratio for the detection of a QTL with a marker spacing of 10 cM from the populations simulated with a heritability of 0.3. The dotted line (S0.05) represents the threshold value at a significance level of 0.05.

 


View larger version (18K):
[in this window]
[in a new window]
 
Figure 2.. Likelihood ratio for the detection of a QTL with a marker spacing of 10 cM from the populations simulated with a heritability of 0.5. The dotted line (S0.05) represents the threshold value at a significance level of 0.05.

 


View larger version (17K):
[in this window]
[in a new window]
 
Figure 3.. Likelihood ratio for the detection of a QTL with a marker spacing of 10 cM from the populations simulated with a heritability of 0.7. The dotted line (S0.05) represents the threshold value at a significance level of 0.05.

 


View larger version (16K):
[in this window]
[in a new window]
 
Figure 4.. Likelihood ratio for the detection of a QTL with a marker spacing of 20 cM from the populations simulated with a heritability of 0.5. The dotted line (S0.05) represents the threshold value at a significance level of 0.05.

 
On the other hand, the estimates from the populations under selection within the full-sib family were all considerably smaller than those from the other populations. Furthermore, QTL were not detected in the analyses of WS1H3, WS2H3, and WS2H5. Here we also observed that for a population simulated with a heritability of 0.5, the QTL was detected with a marker spacing of 10 cM (Figure 2), but not with a marker spacing of 20 cM (Figure 4). All the likelihood estimates drastically increased with a larger heritability.

QTL Effect and Position
The estimates of additive and dominance effects for the QTL are shown in Table 2. The estimates of its additive and dominance effects all corresponded to the input values for the populations under random selection and under selection among full-sib families (P > .05), but not for the populations under selection within full-sib family (P < .05). Their empirical standard errors were larger for the populations under selection within the full-sib family than for the other populations.


View this table:
[in this window]
[in a new window]
 
Table 2.. Estimates of QTL additive and dominance effects and its linkage position from analyses of simulated data

 
Table 2 also shows that the estimates of linkage positions differ from their input values for the populations under selection within the full-sib family (P < .05), but not for the populations under random selection and the populations under selection among full-sib families (P > .05). Their standard errors were larger for the populations under selection within the full-sib family than for the other populations. Within such a population group, marker spacing influenced the linkage position estimates more than heritability. For example, within the population group under random selection, the estimates along with the change of heritability ranged from 76.59 to 76.68 with a marker spacing of 10 cM and from 76.35 to 76.44 with a marker spacing of 20 cM. However, the estimates along with the change in marker spacing ranged from 76.35 to 76.59 with a heritability of 0.3, from 76.43 to 76.66 with a heritability of 0.5, and from 76.44 to 76.68 with a heritability of 0.7.


    Discussion
 Top
 Abstract
 Materials and Methods
 Results
 Discussion
 References
 
Selection is often applied to domestic livestock to improve economically important traits. However, various data generated by such selection should be analyzed with caution. For example, genetic groups have been incorporated with the animal model that can correctly explain the first and second moments of animals' genetic merits (Westell and Van Vleck 1987). Another example can be found in a simulation study where splitting data by gender affected estimates of variance and covariance components because analyzing records from only one sex did not account for selection in the other sex (Lee and Pollak 1997). This problem can be overcome by introducing a multiple trait model treating male and female records as different traits to explain the heterogeneity of the variance and covariance components by gender (Lee et al. 1997). The current study also points out potential selection bias introduced by selective genotyping in QTL mapping. However, the influence of selective genotyping on QTL mapping is not severe and is sometimes negligible (Ayoub and Mather 2002; Darvasi 1997; Ronin et al. 2003). This might be because, with selective genotyping, individuals from the high and low phenotypic extremes are usually genotyped.

The simulation study demonstrated that the accuracy for QTL detection decreased when selection was introduced. This problem was serious for the populations under selection within the full-sib family. Furthermore, estimates of QTL effects and positions were all different from their corresponding input values. However, these problems were negligible when mapping a QTL for the populations under selection among full-sib families. This might be because the full-sib litter effects confounded with selection effects could be largely explained by the analytical model introduced in this study. Although the selection and litter effects were confounded in the analytical model, they were not the effects of much interest in such QTL detection. I suggested that the analytical model could help to overcome the selection problem in this special situation. Cautions should be used for QTL detection from existing populations when genotyping individuals selected through various breeding schemes.

The heritability level is an important factor in QTL mapping for data under a selection program. An obvious trend of improved QTL detection with a large heritability was observed in the current study. This agreed with the previous findings of Belknap (1998), Kearsey and Farquhar (1998), and Wu (1999). Noor et al. (2001) demonstrated that narrow marker spacing as well as large heritability had salutary effects on QTL detection efficacy. The current study further indicates that when the animals were selected within the full-sib family, the QTL was detected for the populations with a heritability of 0.5 or greater using the marker spacing of 10 cM, and with a heritability of 0.7 using the marker spacing of 20 cM. Further investigation of such a threshold level of heritability associated with marker spacing would be beneficial to determine appropriate experimental designs and optimal sample sizes for QTL analysis.

In conclusion, the simulation study suggested the need for a careful decision on animals to be genotyped for QTL mapping. When analyzing data under selection, the accuracy for QTL detection was decreased and the estimates of additive and dominance effects were biased. This study also implied that genotyping only one sex might create similar problems for a population under selection, because selection bias can be introduced by partitioning data by gender (Lee and Pollak 1997). I suggest the analytical model used in the current study for mapping QTL with data under selection among full-sib families.


    Acknowledgments
 
This research was supported in parts by a grant from the BioGreen 21 Program, Rural Development Administration, Korea, and a research grant from Hallym University, Korea. The author would like to thank two anonymous reviewers for their valuable comments.


    Footnotes
 
Corresponding Editor: Sudhir Kumar

Received February 8, 2004
Accepted January 23, 2005


    References
 Top
 Abstract
 Materials and Methods
 Results
 Discussion
 References
 

    Ayoub M and Mather DE, 2002. Effectiveness of selective genotyping for detection of quantitative trait loci: an analysis of grain and malt quality in three barley populations. Genome 45:1116–1124.[Medline]

    Belknap JK, 1998. Effect of within-strain sample size on QTL detection and mapping using recombinant inbred mouse strains. Behav Genet 28:29–38.[CrossRef][Web of Science][Medline]

    Darvasi A, 1997. The effect of selective genotyping on QTL mapping accuracy. Mamm Genome 8:67–68.[CrossRef][Web of Science][Medline]

    Falconer DS and Mackay TFC, 1996. Introduction to quantitative genetics, 4th ed New York: Longman.

    Kearsey MJ and Farquhar AG, 1998. QTL analysis in plants; where are we now?. Heredity 80:137–142.

    Lee C and Pollak EJ, 1997. Influence of partitioning data by sex on genetic variance and covariance components for weaning weight in beef cattle. J Anim Sci 75:61–67.[Abstract/Free Full Text]

    Lee C, Van Tassell CP, and Pollak EJ, 1997. Estimation of genetic variance and covariance components for weaning weight in Simmental cattle. J Anim Sci 75:325–330.[Abstract/Free Full Text]

    Lynch M and Walsh B, 1998. Genetics and analysis of quantitative traits Sunderland, MA: Sinauer Associates.

    Noor MAF, Cunningham AL, and Larkin JC, 2001. Consequences of recombination rate variation on quantitative trait locus mapping studies: simulations based on the Drosophila melanogaster genome. Genetics 159:581–588.[Abstract/Free Full Text]

    Press WH, Teukolsky SA, Vetterling WT, and Flannery BP, 1992. Numerical recipes in FORTRAN, 2nd ed New York: Cambridge University Press.

    Ronin Y, Korol A, Shtemberg M, Nevo E, and Soller M, 2003. High resolution mapping of quantitative trait loci by selective recombinant genotyping. Genetics 164:1657–1666.[Abstract/Free Full Text]

    Westell RA and Van Vleck LD, 1987. Simultaneous genetic evaluation of sires and cows for a large population of dairy cattle. J Dairy Sci 70:1006–1017.[Abstract/Free Full Text]

    Wu RL, 1999. Mapping quantitative trait loci by genotyping haploid tissues. Genetics 152:1741–1752.[Abstract/Free Full Text]


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
J HeredHome page
Y. Kim, M. Kong, Y. J. Nam, and C. Lee
A Quantitative Trait Locus for Oleic Fatty Acid Content on Sus scrofa Chromosome 7
J. Hered., September 1, 2006; 97(5): 535 - 537.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow All Versions of this Article:
96/4/363    most recent
esi062v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (2)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Lee, C.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Lee, C.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?