Effects of Introgression and Recombination on Haplotype Structure and Linkage Disequilibrium Surrounding a Locus Encoding Bymovirus Resistance in Barley

We present a detailed analysis of linkage disequilibrium (LD) in the physical and genetic context of the barley gene Hv-eIF4E, which confers resistance to the barley yellow mosaic virus (BYMV) complex. Eighty-three SNPs distributed over 132 kb of Hv-eIF4E and six additional fragments genetically mapped to its flanking region were used to derive haplotypes from 131 accessions. Three haplogroups were recognized, discriminating between the alleles rym4 and rym5, which each encode for a spectrum of resistance to BYMV. With increasing map distance, haplotypes of susceptible genotypes displayed diverse patterns driven mainly by recombination, whereas haplotype diversity within the subgroups of resistant genotypes was limited. We conclude that the breakdown of LD within 1 cM of the resistance gene was generated mainly by susceptible genotypes. Despite the LD decay, a significant association between haplotype and resistance to BYMV was detected up to a distance of 5.5 cM from the resistance gene. The LD pattern and the haplotype structure of the target chromosomal region are the result of interplay between low recombination and recent breeding history.

T HE ever-increasing availability of nucleotide sequence, and the concomitant improvement that this brings to our understanding of the organization of complex crop plant genomes, has created the opportunity to identify trait-related genes and to analyze their allelic diversity. The association between phenotype and genotype in diverse populations represents a powerful approach, but the applicability and design of such analyses depend critically on the extent and pattern of linkage disequilibrium (LD) present in the study population. Cultivated barley (Hordeum vulgare ssp. vulgare) has many of the hallmarks known to be associated with a high level of LD. Its effective recombination rate is dramatically reduced by its predominantly inbreeding habit, with an estimated outcrossing rate of 5% in winter barley and ,0.5% in spring barley (Giles et al. 1974;Doll 1987;Abdel-Ghani et al. 2005). Domestication and intensive selection have introduced major bottlenecks in genetic variation, and these are thought to be largely responsible for the perceived narrowness of the modern gene pool (Badr et al. 2000;Russell et al. 2000;Matus and Hayes 2002). LD in modern spring barley, as estimated from a whole-genome survey, extends over distances of at least 10 cM, indicative of an extensive conservation of the genetic identity of barley chromosomes (Kraakman et al. 2004). However, estimates of genomewide LD conceal localized variation, which, as has been shown for a number of species, can be substantial and independent of the mating system (Gupta et al. 2005). In the self-pollinating species Arabidopsis thaliana LD varies from ,10 kb in global (Tian et al. 2002) to 50-250 kb in local populations (Nordborg et al. , 2005Aranzana et al. 2005). Within a given set of maize (an outbreeding species) accessions, the extent of LD has been documented to vary widely around different genes and chromosomal regions (Remington et al. 2001). On chromosome 1, for example, it has been shown to decline within 0.1-0.2 kb (Tenaillon et al. 2001), while in a region under high selection pressure, it can extend up to 600 kb (Palaisa et al. 2004).
Little is known of the structure of LD in the physical and genetical vicinity of genes in cultivated barley (reviewed in Gupta et al. 2005). As the success of association studies in this major crop species depends critically on this knowledge, we have set out to evaluate LD in a well-characterized region of chromosome 3H, which harbors Hv-eIF4E, a gene encoding an important virus resistance. Two recessive alleles, rym4 and rym5, confer resistance to both soil-borne Bymoviruses, barley yellow mosaic virus (BaYMV) and barley mild mosaic virus (BaMMV) (Kanyuka et al. 2005;Stein et al. 2005). This combination of pathogens is commonly referred to as the barley yellow mosaic virus complex (BYMV). Carriers of rym4 are resistant to BaMMV, BaMMV-Sil, and BaYMV-1, but are susceptible to BaYMV-2, while those carrying rym5 are resistant to BaYMV-1, BaYMV-2, and BaMMV, but not to BaMMV-Sil (Kanyuka et al. 2004). Although at least seven independent loci conferring resistance to BYMV have been identified in barley to date (Ordon et al. 2005), European breeding has relied heavily on rym4 and rym5, both of which originate from single germplasm accessions. The Dalmatian landrace Ragusa is the source of rym4 (Huth 1985), while rym5 derives from the Chinese landrace Mokusekko 3 (Konishi et al. 1997;Graner et al. 1999;Friedt et al. 2000).
The cloning of Hv-eIF4E Wicker et al. 2005) provides a focus for an LD analysis. Taking into consideration that breeding for resistance in Europe did not start until the 1980s, and that the Hv-eIF4E locus has been subjected to a high selection pressure over a short timescale, this case study reveals much of how genome structure and dynamics can be shaped by plant breeding.

MATERIALS AND METHODS
Plant material: Included in the study were 127 cultivated barley cultivars and landraces originating from Europe, Asia, and America, along with four accessions of wild barley (ssp. spontaneum) collected in either Turkey or Israel. Geographical origins, pedigrees (where available), and phenotype with respect to reaction to BYMV are listed in supplemental Table  S1 at http:/ /www.genetics.org/supplemental/. Accessions carrying rym4 included Ragusa, 19 European cultivars, and four Asian landraces. With respect to rym5, five European and six Asian accessions, including the donor Mokusekko 3, were analyzed. All these accessions were tested in the field and by mechanical inoculation to verify their rym4/rym5 status (Götz and Friedt 1993;Ordon et al. 1993) (supplemental Table S1 at http:/ /www.genetics.org/supplemental/). The remaining resistant accessions have not been tested for allelism to rym4 and rym5 (rym?; Table S1 at http:/ /www.genetics.org/ supplemental/). Nevertheless, their resistance is likely due to the presence of genes other than rym4/rym5, since their haplotypes differed markedly from those of the verified rym4 and rym5 accessions. In the context of this study, these genotypes are referred to as noncarriers of rym4/rym5 and were merged with the group of susceptible cultivars. Genomic DNA was extracted from a single plant per accession as described elsewhere (Graner et al. 1991).
Population structure: Sixteen barley expressed-sequencetag (EST)-derived simple sequence repeat markers (SSRs) (EST-SSRs) were selected to characterize population structure (supplemental Table S2 at http:/ /www.genetics.org/ supplemental/) (Thiel et al. 2003). The selection of appropriate markers was based both on map position (to obtain an even distribution across the genome) and on informativeness (high polymorphic information content value) (Weber 1990;Anderson et al. 1993). The markers were organized into mul-tiplex sets comprising between three and six different primer pairs. Amplicons were generated with the Multiplex PCR kit (QIAGEN, Chatsworth, CA), using an amplification program of 95°/15 min, followed by 40 cycles of 94°/30 sec, 60°/30 sec, and 72°/15 sec, with a final extension step of 72°/10 min. PCR products were separated on 6% polyacrylamide gels using the ABI377 system (Applied Biosystems/Applera, Darmstadt, Germany), and profiles were analyzed with the software packages GeneScan 3.7.1 und Genotyper 3.7. The assessment of genetic population structure was performed within a Bayesian framework using a Markov chain Monte Carlo algorithm to sample from the joint posterior distribution of the subpopulation allele frequencies, and assignment of individuals to particular subgroups was effected with STRUCTURE version 2.1 (Pritchard et al. 2000a; http:/ /pritch.bsd.uchicago.edu/ structure.html). For a K setting of between 2 and 6, 10 independent simulations were performed, using the admixture model and a burn-in of 500,000 followed by 1,000,000 iterations.
To assess the applicability of an association analysis within modern breeding material, association tests were performed in 51 European winter barley cultivars, comprising 20 rym4 carriers and 31 susceptible accessions. To infer the population structure of this sample, the estimated individual membership coefficient in each subgroup was accounted for with a structured population association test (STRAT) (Pritchard et al. 2000b). The test statistic is constructed by computing the likelihood ratio to test the null hypothesis that allele frequencies of subpopulations at the candidate locus are independent of phenotype. Phenotypic association was tested by treating single haplotypes as multiallelic marker loci, including only single polymorphic sites at which the minor allele occurred at a frequency of at least 0.05. Empirical P-values of the observed test statistic per marker locus and per polymorphic site were calculated using 100,000 permutations.
The variation in phenotype due to population structure was assessed by a multiple linear regression model (PLABSTAT version 2H;Utz 1993). The estimated membership fractions for each genotype in K clusters of K ¼ 3 to K ¼ 6 in the entire set and K ¼ 2 for the subset of European winter cultivars were considered. The coefficient of determination from the regression (R 2 ) was applied to quantify the effect on the trait of the genetic background structure.
SNP discovery and genotyping: A number of mutations in Hv-eIF4E are known to confer resistance to BYMV. Thus the scan for polymorphic nucleotides within the set of 131 accessions was focused on an assembly of 12 genomic fragments within and surrounding Hv-eIF4E ( Figure 1). The size of each fragment was between 175 and 1039 bp, encompassing in total 6.9 kb. This represents a genetic interval of 5.5 cM proximal to, and 0.9 cM distal to, Hv-eIF4E (Table 1). The LD structure in relation to physical distance was analyzed on the basis of six fragments spanning the 132-kb BAC contig AY661558 established during the cloning of Hv-eIF4E Wicker et al. 2005). This included three genic fragments, covering all five exons (645 bp) and flanking intron regions (1840 bp), one fragment from the adjacent distally mapping gene Hv-MLL (marker No519; 1048 bp upstream), and one noncoding fragment mapping both distally (No969) and proximally (No1134) to the target. The proximal fragments GBR1843, GBS0526, and GBS0419 have been previously mapped in the Oregon Wolfe mapping population (OWB, Dom 3 Rec) (Costa et al. 2001), which consists of 94 F 1derived doubled haploid progeny (Stein et al. 2007). The distal fragments GBR1845, GBR1851, and GBS1020 were mapped in 115 segmental recombinant inbred lines of Alraune 3 W122.37, a population genetically equivalent to .4800 F 2 progeny (AW) . Because of a lack of polymorphism between the parents of both these mapping populations, it was not possible to combine all the markers onto a single map. Therefore, for the LD analysis, genetic map distances were considered separately for the OWB and AW populations.
Statistical analyses: For the purpose of statistical analyses, indels were regarded as single sites. Estimates for LD were obtained via the software package TASSEL (version 1.0.9, http:/ /www.maizegenetics.net/bioinformatics/tasselindex. htm), applying the measurement r 2 (squared correlation coefficient) (Hill and Robertson 1968). To estimate the strength of LD between haplotypes, we used the measure D9 (Lewontin 1964) modified for multiple alleles by calculating a weighted average of D9 value where the weights are the products of the corresponding allele frequencies (Hedrick 1987;Farnir et al. 2000). Significance of LD is determined by a two-sided Fisher's exact test for biallelic sites and by permutations (setting number 1000) for multiallelic sites. To preclude bias of low-frequency alleles on the LD calculation, polymorphic sites featuring allele(s) with a frequency of #0.05 were excluded. Values of r 2 and D9 were either plotted as a function of the pairwise distance between the polymorphic sites or displayed as an LD matrix as generated by TASSEL.
For the clustering of haplotypes across the 132-kb contig and the assignment to haplogroups, neighbor-joining trees (Saitoh et al. 2004) were constructed on the basis of Kimura two-parameter distances and pairwise deletions of gaps, applying MEGA version 2.1 (Kumar et al. 1994). To test the robustness of derived tree topologies, 1000 bootstraps were performed. To estimate gene genealogies, a haplotype network of Hv-eIF4E was obtained by statistical parsimony (Templeton et al. 1992), using the program TCS version 1.18 (Clement et al. 2000). This program calculates the probability of parsimony for all pairwise differences until the probability exceeds 0.95.
To assess genomic variability across the entire target region and to measure and compare the diversity within and across the designated haplogroups, both nucleotide (pi) and haplotype (Hd) diversities were estimated using DnaSP (Version 3.51; Rozas and Rozas 1999; http:/ /www.ub.es/dnasp/) software. For this purpose, indels were treated as single sites. Overall, 12 of the 127 H. vulgare genotypes were excluded from the analysis because of missing data.

RESULTS
Population structure: The 16 EST-SSR loci revealed 77 alleles, with 2-9 alleles/locus. The distribution of these alleles was in close accordance with the taxonomic and geographic subgroups defined in the germplasm collection, as illustrated by a Bayesian approach (Figure 2). However, it was difficult to fully determine the optimal number of subgroups, since the posterior probabilities for the number of clusters increased steadily. On the assumption of two subgroups (K ¼ 2), the population was divided into winter and spring types. A stepwise increase to K ¼ 6 led to the gradual separation of the Asian from the European accessions, of the two-row from the six-row spike types, and of the cultivated (vulgare) from the wild (spontaneum) types. Importantly, all rym4 carriers and all European rym5 carriers displayed allele frequencies similar to those of susceptible accessions belonging to different subgroups. Thus population structure could account for only 21% (K ¼ 3) to 29% (K ¼ 6) of the variation in BYMV resistance.
Structure of LD: The 12 genomic fragments encompassing the target region were resequenced across the collection of 131 accessions. A total of 83 polymorphic sites, comprising 78 SNPs and five indels, were identified. Of the 83 sites, 5 were triallelic, and 4 of these, in addition to 14 of the biallelic sites, had an allele frequency of #0.05 and were excluded from the subsequent analysis. Across the entire collection, a considerable degree of LD was evident within the physical contig ( Figure 3A), while at the genetic level, the r 2 value dropped sharply to ,0.3 within 1 cM of the target ( Figure 3B). Despite the diminishing r 2 value, a repeated increase of LD (r 2 . 0.3) was observed at a larger distance. This was not an artifact of population stratification, as the allele distribution of the corresponding pairs of polymorphic sites did not associate with population structure.
To compare the LD structure between rym4/rym5resistant and -susceptible genotypes (including noncarriers of rym4/rym5), r 2 was determined separately for these two subgroups (Figure 4). The extended structure of LD observed for the 132-kb contig was confirmed for all subgroups, but with varying strength. In the subgroup represented by the rym4 (n ¼ 24) and the rym5 (n ¼ 11) carriers, most pairs of polymorphic sites within the interval displayed complete LD (r 2 ¼ 1), while the susceptible subgroup (n ¼ 96) showed fewer high r 2 values. LD within the susceptible group fell, as it did for the entire set, within 1 cM of the target. On the other hand, the rym4/rym5 subgroup was characterized by a significantly inflated LD across the entire genetic region, demonstrating a high degree of conservation within resistant genotypes.
Haplotype structure: Haplotypes were generated on the basis of the polymorphic sites, and their structure across the entire genetic interval was analyzed. Considering each individual fragment, between three and seven haplotypes achieved a frequency .2% (Table  1). LD strength between haplotypes of a single DNA fragment within subgroups of resistant (rym4 and rym5 carriers) and susceptible genotypes revealed remarkable differences (supplemental Figure S1 at http:/ /www.genetics.org/supplemental/). Compared to the susceptible group, resistant accessions had higher r 2 values across the entire region. For the 132-kb contig, 22 haploytpes were identified on the basis of 54 polymorphic sites ( Figure 5). The rym4 carriers included two haplotypes (haplotype 1 rym4-A and haplotype 2 rym4-E), which matched the two broad geographical origins of Asia (A) and Europe (E). All rym5 carriers shared the same haplotype (haplotype 4 rym5). Noncarriers of rym4/rym5 were represented by 19 haplotypes. More than half of them (10/19) were singletons while .67% of those genotypes belonged to one of the three major haplotypes 3, 5a, and 17. Using a neighbor-joining method, the haplotypes clustered into three major clades, hereafter referred to as haplogroups I-III ( Figure 6). The formation of these haplogroups appeared to be independent of BYMV resistance (Figure 5). Thus, haplogroup I included three haplotypes comprising the European and Asian rym4 carriers (haplotypes 1 and 2) and a group of highly conserved susceptible two-row winter cultivars (haplotype 3). Only 6 of the 54 scored sites varied within this haplogroup. Haplogroup II contained rym5 carriers (haplotype 4) and geographically diverse accessions (haplotypes 5-12), which included both no, or as yet undetermined, resistance alleles/genes. A clear dimorphism between haplogroups I and II was obtained for half of the polymorphic sites of the contig. Dimorphic sites were located mainly in noncoding sequences, while 15 of the 16 polymorphic exon sites were shared between the two haplogroups. Only one cultivar (Posaune, haplotype 13) was intermediate between these groups, containing a signature of recombination both up-and downstream of the Hv-eIF4E exon 1. Haplogroup III (haplotypes 14-20) had a composite structure with at least three origins. The sequence data of No1134 are closely related to haplogroup I, with only two polymorphic sites present ( Figure 5, position 29084n and 29201n). Moreover, variation within Hv-eIF4E fragment 1 resulted in a sequence, which is clearly distinct from that present in haplogroups I and II. The pattern of polymorphic sites downstream of Hv-eIF4E fragment 1, however, resembled haplogroup II. This apparent patchwork on either side of fragment 1 may indicate the occurrence of several historical recombination events within Hv-eIF4E. The overwhelming reduction in haplotype and nucleotide diversity within the haplogroups, as compared to what is present across the entire collection, validates the grouping (supplemental Table S4 at http://www.genetics.org/supplemental/). A separate analysis indicates that, within a given haplogroup, less diversity is present in the noncoding than in the coding sequence. Importantly, all sites within the coding sequence of Hv-eIF4E generate an amino acid exchange(s) in the protein sequence.
A haplotype network for Hv-eIF4E (Figure 7), constructed with a statistical parsimony algorithm, illustrates the relationships between the haplotypes between and within the haplogroups and emphasizes the genetic distance between the haplogroups I and II/III, with the presumed recombinant haplotype 13 representing the only link between them. Haplotypes 5 and 17 are central within the network and have generated a number of descendant haplotypes. Both are present in a significant number of accessions and are broadly distributed with respect to growth habit and origin. Thus they are probably the most ancient in the germplasm set. In contrast, on the basis of the termini of the network, the two resistant haplotypes, rym4-E and rym5, must have emerged rather recently.
The genetic diversity within the European (n ¼ 80) and Asian (n ¼ 32) subgroups was compared across the physical contig. Despite the 2.5-fold excess of European material within the collection, the Asian accessions included a larger number of haplotypes. The more frequent occurrence of singletons in the latter set (seven vs. four) is largely responsible for its apparent wider diversity. With respect to common haplotypes (frequency within a subgroup $0.05), six were represented in the European set (95.5% of the total diversity) and seven in the Asian set (80.5% of the total diversity). Four of these common haplotypes were shared between the subgroups (haplotypes 4, 5a, 17, and 20; Figure 5). As a result, Hd and p-values were comparable between the European (Hd ¼ 0.806 6 0.022; pi ¼ 0.00484 6 0.00020) Association analysis: Since BYMV resistance was most frequent among the European winter barleys, a test for association between the candidate locus and resistance was carried out for the subgroup of 51 European winter cultivars, of which 20 are rym4 carriers and the remainder are susceptible accessions. To ascertain the population structure within this group, a model-based clustering algorithm was applied. The average-likelihood values from 10 runs reached a maximum at K ¼ 2 and fell for higher values (data not shown). The population explained 6% of the variation in BYMV resistance. Structured population association tests were carried out between BYMV resistance and haplotype at both the 12 marker fragments and at 55 of the 83 polymorphic sites (excluding those where the minimum allele frequency was ,0.05). The association was significant (P , 0.01) for all nine loci mapping between No1134 and GBS1020, comprising the physical contig across Hv-eIF4E and the distal 1-cM region (Table 2). At six loci in this interval (No1134, Hv-eIF4E 1-3, No519, GBR1845M), rym4 carriers possessed haplotypes that were not observed in any susceptible accessions. Locus GBR1843, located 6.5 cM proximally to the physical contig, showed only a weak association (P , 0.05) with resistance. Over 56% of the SNPs were significantly associated with the resistant phenotype (P , 0.01), confirming the outcome of the haplotype test (supplemental Table S5 at http://www. genetics.org/supplemental/).
As a control, association tests were carried out between BYMV resistance and alleles of the evenly distributed SSR markers across the genome (supplemental Table S5 at http://www.genetics.org/supplemental/).
Of 11 polymorphic SSR markers in the European winter barleys set, 10 were not associated with the phenotype and only one marker located on 6H showed a weak association (GBM1021; P ¼ 0.035). DISCUSSION We have described a detailed evaluation of LD and haplotype patterns surrounding the gene Hv-eIF4E, which encodes a heavily utilized virus resistance in European barley.
Population structure: Population structure has a major impact on patterns of LD and, consequently, on the outcome of association studies (Pritchard et al. 2000b). The diverse collection of cultivars and landraces from Europe, Asia, and America selected in this study could be clustered on the basis of growth habit, ear morphology, geographical origin, and subspecies and is similar in genetic breadth to collections used in other genomewide marker analyses in barley (Melchinger et al. 1994;Ordon et al. 1997;Thiel et al. 2003). Importantly, however, the population structure did not disturb the association between haplotype and BYMV resistance.  ) and noncoding regions are designated according to their position in the AY661558 sequence. Dots indicate sites identical to haplotype 1 (rym4-E). For all indels, the starting point is given (*1, 108 bp; *2, 2 bp; *3, 1 bp; *4, 10 bp). Considering all 54 polymorphic sites, haplotypes 1-20 can be identified. Haplotypes 5 and 7 were separated into ''a'' and ''b,'' which, although sharing an identical haplotype for all three Hv-eIF4E genic fragments, differed at a site in the flanking region (haplotype 5 at No519; haplotype 7 at No969). Haplogroups I, II, and III are indicated on the left. Growth habit (w, winter; s, spring) and origin (E, Europe; A, Asia; U, America) are given for each haplotype.
Linkage disequilibrium and genomic pattern: In self-pollinating species such as A. thaliana and rice, LD has been observed to extend several kilobases beyond a target gene (Hagenblad and Nordborg 2002;Nordborg et al. 2002;Garris et al. 2003;Hagenblad et al. 2004;Olsen et al. 2004), and can even reach the centimorgan range Zhu et al. 2003;Aranzana et al. 2005). Corresponding results apply at the Hv-eIF4E locus. The structure of the three conserved haplogroups in the physical vicinity of Hv-eIF4E reflected a high level of LD across the 132-kb interval. Similarly, a sustained level of LD has been revealed over a 212-kb stretch flanking the hardness locus in European elite barley cultivars (Caldwell et al. 2006). With respect to genetic distance, LD around Hv-eIF4E fell below the critical threshold of r 2 ¼ 0.3 within ,1 cM. This result is inconsistent with a genomewide estimate for LD of up to 10 cM, as reported among modern spring barley cultivars (Kraakman et al. 2004). However, it has been conclusively established for both plants and humans that LD is highly variable, reflecting the combined influences of population structure, genomic region under consideration, and the number of polymorphic sites available (Akey et al. 2003;Ke et al. 2004).
The drastic decay of LD at the genetic level, as it was observed in this study, was mainly attributable to the susceptible accessions. While in both rym4-and rym5resistant groups considerable haplotype conservation persisted, the haplotypes of susceptible accessions revealed a high level of recombination in the regions flanking the 132-kb fragment. Various forces can contribute to haplotype conservation, including (a) prior genetic bottlenecks resulting in a low effective population size, (b) introgression of rym4 and rym5 from a restricted number of sources, (c) a very recent history of intensive selection for resistance against BYMV, (d) a lack of recombination in the target region, and (e) gene function. While it is difficult to demonstrate a specific bottleneck affecting the accessions investigated in this study, the effect of severe domestication-related bottlenecks on haploytpe diversity between modern barley cultivars and wild ssp. spontaneum has been repeatedly described (Badr et al. 2000;Matus and Hayes 2002;Piffanelli et al. 2004). A comparison between the diversity at Hv-eIF4E within wild and cultivated barley should provide more information regarding the history of this locus. Generally, modern breeding has narrowed the genetic base by altering allele frequencies. With the increased use of ssp. spontaneum as a donor for resistance genes, novel alleles have been introgressed into the gene pool of cultivated barley. The typical outcome of such introgressions has been analyzed elsewhere (Ivandic et al. 1998;Pillen et al. 2004). While it initially generates a spike in genetic diversity, strong selection for the exotic allele will gradually erode the frequencies of the ''old'' alleles, even leading to their complete loss (Russell et al. 2000;Collins et al. 2001;Bundock and Henry 2004). Such a scenario is likely to have occurred during breeding for resistance to BYMV. The analysis of pedigree and molecular marker data provides evidence that there was only a single source each of rym4 and rym5 in European elite germ plasm (Huth 1985;Graner and Bauer 1993;Friedt et al. 2000). Thus, strong selection for resistance resulted in an enrichment of the corresponding alleles in European winter barley. A similar situation applies to the selection of yellow endosperm in maize, which started in the early 20th century, and has been traced to two independent introgression events. As a result, genetic variation in the vicinity of the target locus Y1 is very low (Palaisa et al. 2003(Palaisa et al. , 2004. The short timescale over which intensive selection for BYMV resistance has operated has provided as yet only limited opportunities for recombination around the resistance gene. However, targeted selection for rym4/ rym5 is probably not the only reason for the observed LD, as the pedigree triplets (parent1, parent2, offspring) formed by the susceptible cultivars Ursel, Ultra, Villa, and Volla, which are all represented in the collection, also showed no recombination across the entire region genotyped. Since susceptible alleles of Hv-eIF4E are not likely to be subjected to selection pressure, the observed pattern is more probably an outcome of related pedigrees and a limited number of meiotic events during the breeding process.
A low recombination rate in the target region has possibly also governed the size of the introgression segment and the LD pattern. In this regard, a comparison of genetic and physical distances in the region of Hv-eIF4E revealed marked differences in recombinational activity and, in particular, a reduction proximal to the gene and an increase distal to it . Except for marker fragment No1134, which Figure 7.-The Hv-eIF4E haplotype network. Numbers correspond to the haplotype designations in Figure 5. Lines represent mutational changes and solid circles indicate intermediate haplotypes. The 95% confidence interval is 13 steps. Haplotypes shared between more than one genotype are indicated by squares. Shaded squares correspond to rym4-E (1), rym4-A (2), and rym5 (4) haplotypes, respectively. Haplotypes showing strong independence from growth habit and origin are marked with a hatched pattern. Haplogroups I, II, and III are indicated.

TABLE 2
Structured population association tests between haplotype frequency at single-marker loci and rym4-encoded BYMV resistance borders an interval, characterized by a low ratio of physical-to-genetic distance (0.8-2.3 Mb/cM), the contig is located in a region with a high ratio (30-50 Mb/ cM). The low recombination frequencies are consistent with the large proportion of transposable element sequence present in the 439.7-kb BAC contig, which contains only one genic island of 10 kb harboring Hv-eIF4E and Hv-MLL (Wicker et al. 2005). Meiotic recombination in eukaryotes is confined mainly to genes (Thuriaux 1977;Civardi et al. 1994;Dooner and MartinezFerez 1997), and studies in maize have confirmed that genes located close to retrotransposons have less recombinational activity than those present within gene clusters (Fu et al. 2002). A possible additional factor contributing to LD relates to selection pressure on genes located on either side of the BAC contig. The maintenance of LD across several genes has been demonstrated in maize, although it was largely restricted to gene-rich regions (Palaisa et al. 2004). Further investigations are clearly needed to determine whether the genomic patterns observed can be attributed to a single factor such as limited sample size, limited effective population size, the small number of generations since introgression, or low recombination, or whether it is the result of a combination of some or all of these forces.
Haplotype network for the Hv-eIF4E locus: On the basis of the derived network model, it can be suggested that the resistant and susceptible alleles have descended relatively recently from a common ancestor. The separation between haplogroups I and II lends strength to the notion that both rym4 and rym5 have an independent evolutionary history. Evidence for a geographic pattern has been provided for several loci in wild barley (Morrell et al. 2003(Morrell et al. , 2005, but the strong dimorphism associated with rym4 and rym5 did not correspond to any geographic subdivision. The fact that the germplasm has been subjected to breeding, material exchange, and subsequent introgression events of course may obscure such a division. Thus, to gain a better understanding of the evolution of the haplogroups, comprehensive allele genotyping within a set of geographically widely distributed landraces and wild barleys will be required. Surprisingly, the strong clustering of haplotypes identified across Hv-eIF4E was exclusively attributable to polymorphic sites in noncoding sequence, while the genetic variation within each group was due mainly to amino acid replacement mutations in the exons. Why the dimorphism among the haplogroups is due largely to noncoding polymorphic sites remains unexplained. In particular, the haplotype diversity within haplogroup II is mostly attributable to rare polymorphic sites present in single Asian genotypes (haplotypes 8-12). Ongoing tests for allelism will show whether these genotypes carry new resistance alleles at the Hv-eIF4E locus, or whether the resistant phenotype is conferred by an independent locus.
Consequences for association studies: The maintenance of the resistant haplotype blocks, resulting from recent, nonrecombined introgression event(s), has resulted in the formation of significant associations between resistance and haplotypes up to a distance of at least 1 cM from the resistance gene. Such a coarse level of resolution is inadequate for map-based gene cloning. On the other hand, it does imply that a fairly low marker density is sufficient to detect associations between a target region and resistance. If this is typical for other genes within breeding germplasm, the prospects are good for identifying chromosomal segments associated with traits of interest. Further resolution may be achievable by using populations that have been intermated over many generations, thereby promoting the breakdown of linkage blocks. This situation is not common in breeding material and is more likely to be found in landraces and wild populations. For spp. spontaneum, levels of LD comparable to that of the outbreeding species maize have been reported (Lin et al. 2002;Morrell et al. 2005;Caldwell et al. 2006), and these should be sufficient to provide the genetic resolution necessary to identify the functional polymorphism associated with the trait variation.