| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Genetics, Vol. 171, 1305-1309, November 2005, Copyright © 2005
doi:10.1534/genetics.105.043661
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||





* Departament de Genètica Vegetal, Laboratori de Genètica Molecular Vegetal, CSIC-IRTA, 08348 Cabrils (Barcelona), Spain,
National Institute of Fruit Tree Science, Tsukuba, Japan,
INRA, Unité de Recherches sur les Espèces Fruitières et la Vigne, F-33 883 Villenave d'Ornon Cedex, France,
Dipartimento di Scienze Agrarie e Ambientali, University of Udine, 33100 Udine, Italy and ** Department of Genetics and Biochemistry, Clemson University, Clemson, South Carolina 29634
1 Corresponding author: Departament de Genètica Vegetal, Laboratori de Genètica Molecular Vegetal, CSIC-IRTA, Carretera de Cabrils s/n, 08348 Cabrils (Barcelona), Spain.
E-mail: pere.arus{at}irta.es
| ABSTRACT |
|---|
|
|
|---|
A strategy to improve the efficiency of mapping, named selective mapping, was proposed by VISION et al. (2000). It consists of a two-step process in which, first, a mapping population of usual size (N = 60250) is used to construct a saturated framework map with markers placed on it with high precision, and second, new markers are added to this map with lower precision using a selected subset of highly informative plants. The final objective is to lower the cost of genotyping new markers with a minimal loss of mapping precision. The selection of this subset of plants is based on the number and position of recombinational crossover sites (or breakpoints) detected with the framework marker data in each plant. The breakpoints identified by the ensemble of the selected plants define a set of bins, i.e., chromosome fragments bounded by two adjacent breakpoints or by a distal breakpoint and the telomere, characteristic of each subset (Figure 1). For a given marker, the joint genotype of the selected subset of plants ideally identifies a unique small bin in the genome. The optimal subset of a given size would have the maximum possible number of breakpoints evenly spaced throughout the genome, resulting in a high number of small bins of uniform size. VISION et al. (2000) developed methods and designed a software program (MapPop) to facilitate the selection of optimal (or nearly optimal) subsets from mapping populations.
|
| MATERIALS AND METHODS |
|---|
|
|
|---|
Our main criterion for selecting the set of plants for bin mapping (the bin set) was for the number of plants included in this set to be minimal. Additional criteria were a good combination of the following: the minimal number of joint genotypes that each correspond to more than one bin ("duplicate bins"), the smallest maximum bin length, and the highest number of bins (minimal average bin length). By visual inspection, we found that fewer than six plants would generate a high number of duplicate bins. Six was considered a desirable size, because a set of eight individuals (six plants of the F2 plus the two parents or one parent and the F1 hybrid) would be enough for bin mapping. Eight is a suitable unit of analysis as the plates used for PCR reactions are usually of 8 x 12 wells or multiples of this number. Two approaches were followed to find this set of six plants: (a) the Mappop v.1.0 software (VISION et al. 2000) and (b) selection by visual inspection. The algorithm used for selecting the bin set of MapPop, based on minimizing the expected and maximum bin lengths (VISION et al. 2000), is more efficient in finding optimal plant subsets than visual inspection, which lies essentially in finding a good combination of plants among those that have high numbers of breakpoints. In contrast, visual inspection allowed us a better control of the genotypically identical bins.
For these analyses we considered only those plants and codominant markers with at least 70% of the data points, which reduced the data set from 88 plants and 562 markers to 60 plants and 388 markers. Once a set of plants was selected, the final number of bins and their genotype were determined using all 562 loci.
For Mappop we used the 60 x 388 data set with the same notation as that of the Mapmaker mapping software (LANDER et al. 1987): A and B for homozygotes for female and male alleles, respectively, and H for heterozygotes. The commands used for the selection of the set of plants were "loadframe" with the typestring AB-CDH and "samplemax" for six plants. For visual inspection we selected 3 of the 14 plants with 11 or more breakpoints from the set of 60, which together detected a high number of bins. The other 3 were found by adding, to the first set, 13 additional plants with 9 or 10 recombination events and looking for combinations that complemented the first 3 and were within the selection criteria mentioned previously.
Given that a proportion of the markers is expected to be dominant and that these markers are less informative for bin mapping (Figure 1b), additional plants need to be included in the bin set to obtain a level of resolution similar to that found for codominant markers with the six plants of the codominant set (the AHB set). For that purpose, after selection of the AHB set, the set was complemented with two sets of six plants, one for markers where the dominant allele was that of the almond parent (the DB set) and the other for markers dominant for the peach allele (the AC set). The ensemble of these 12 plants (6 of the AHB set plus 6 more of the DB or AC sets) allowed us to map dominant loci with the required precision. Selection of these new plants was done visually and with criteria similar to those defined previously.
To minimize the size of the experiment, we limited the parental information to Earlygold and the MB1-73 hybrid plant. With two inbred parental lines, there is a simple interpretation of the results, but peach and almond cultivars are heterozygous at many loci. The Earlygold parent was chosen as we expected a higher level of homozygosis in peach than in almond (BYRNE 1990). If Earlygold has a heterozygous genotype for alleles of the same size as the hybrid, the assignment of the A and B genotypes to the sample of six plants is ambiguous, because two reciprocal interpretations are possible (i.e., HHBHAB or HHAHBA). We found this for three markers, but in all of them only one of the two possible interpretations corresponded to a bin, which was accepted as the correct one. A more common event, including bins covering 73 cM (14% of the T x E distance), occurs if the Earlygold allele is dominant, and the bin has no homozygotes for the almond allele. Here, the marker would be taken as monomorphic. When looking for markers for these specific regions, in the monomorphic cases it may be advisable to run Texas or other plants of the F2, known to have the A genotype.
Bins of the AHB set were coded with the linkage group number of the bin location, followed by a colon, and then a two-digit number, corresponding to the position of the last marker included in the bin according to the map of DIRLEWANGER et al. (2004a). For example, bin 7:48 ends with a marker 48 cM from the top of linkage group 7. Some dominant markers, or codominant markers with missing data, were in two contiguous bins. These markers were not considered when determining the position of each bin.
In total, 401 microsatellite primer pairs were assayed: 68 (with the letters M and MA) come from a peach (cv. Akatsuki) cDNA library (YAMAMOTO et al. 2001); 7, MD201aMD207a, were obtained from microsatellites within the peach gene sequences of GenBank accession nos. AF414988, AF317062, AJ271438, X96856, AF129074, AF129073, and X77231, respectively; 63 (the UDAp series) from an apricot (cv. Portici) genomic library, enriched for AG/CT repeats (MESSINA et al. 2004); 42 (the UDA series) from an almond (cv. Ferragnès) genomic DNA library enriched for AC/TG repeats (TESTOLIN et al. 2004); 14 (the CPSCT series) were obtained from an enriched (AG/CT repeats) genomic DNA library of Japanese plum cv. Santa Rosa (MNEJJA et al. 2004); and 15 (the UCD series) from a genomic DNA library of sweet cherry cv. Valerij Tschkalov (STRUSS et al. 2003). A total of 180 microsatellites were found in EST collections, 153 (the EPPCU series) obtained from Clemson University and from the GDR (http://www.genome.clemson.edu/gdr/), and 27 from the collection of ESTs of INRA-Bordeaux (the EPPB series). The four-digit number given to the EST-derived microsatellites corresponds to the last four numbers of the accession number of the sequences from which they were obtained. We started with 220 EST sequences containing a microsatellite, but 40 of them were duplicates of other sequences already included. Using the methodology described by GEORGI et al. (2002), nine SSRs (pchgms48, 49, 51, 55, 56, 57, 59, 60, and 61) were obtained by searching for microsatellite sequences in peach BAC clones, which contained RFLP probes detecting markers located in different genome regions (AG37, Pru1, AG2, AG12, AG44, AC43, AG56, AC55, and B4A9, respectively). An additional SSR, pchgms58, in the BAC "Nemared" clone 39B10, was also studied. Finally, the DREa microsatellite was found in the sequence of a dehydration-responsive element-binding protein homolog of Prunus. Sequences of all markers reported in this article are recorded in GenBank.
DNA was extracted in Cabrils as previously described (VIRUEL et al. 1995) and transferred to Udine, Tsukuba, and Bordeaux for analysis. Methods for PCR amplification, electrophoretic separation, and labeling were those currently used in the laboratories of the authors (YAMAMOTO et al. 2001; ARANZANA et al. 2003; DIRLEWANGER et al. 2004b; TESTOLIN et al. 2004). Data of Bordeaux, Tsukuba, and Udine were double checked at Cabrils, and those of Cabrils were checked independently by two of the authors. The joint genotype of each marker was used to map each marker by visually matching the joint genotype with that of the set of bins obtained in the framework map.
| RESULTS |
|---|
|
|
|---|
The final unit of analysis was Earlygold, the MB1-73 hybrid plant and the F2 plants 5, 12, 23, 30, 34, and 83 (Figure 2). When analyzed for the 401 SSR primer pairs, 253 (63%) were polymorphic, giving 264 loci. For 243 primer pairs, we found a single polymorphic locus, and 10 segregated for more than one locus. Nine of these resulted in two polymorphic loci and one resulted in three. From the 148 SSR primer pairs (37% of all SSR primer pairs used) that did not yield any scorable polymorphism, 97 (24%) produced a monomorphic band in the progeny studied, 8 (2%) produced a multi-banded pattern difficult to score, and 43 (11%) did not amplify. Of the 350 primer pairs that produced scorable patterns (97 monomorphic plus 253 polymorphic), 72% segregated in T x E. The characteristics of the bins identified and the positions of the markers added to this map are listed in Table S1 at http://www.genetics.org/supplemental/.
|
For the SSRs where the peach allele was dominant we selected the AC set, consisting of six more plants (15, 27, 57, 74, 102, and 117). The combined information of sets AHB and AC resulted in 66 bins, all corresponding to a different joint genotype, with a maximum bin size of 25.0 cM. The same was done with the almond allele dominant (BD set), and we found that plants 3, 7, 17, 64, 91 and 95, plus the AHB set, detected 59 different bins, the longest being 20.1 cM. The bins detected with these new sets did not always match perfectly the bins found with the AHB set (data not shown).
From the 35 dominant markers found, 18 were dominant for the almond allele and 17 for the peach allele. Fourteen dominant markers (40%) could be assigned either to a single bin (31%) or to two contiguous bins on the same linkage group (9%), using only the AHB set. With the six additional plants for 18 of the 21 dominant markers assigned to more than one linkage group, we found that all of them were in one or two contiguous bins of the AHB set.
New markers were found in 60 of the 67 bins. The bins without new markers were small, including only two to six markers of the previous T x E map. Those with the largest number of markers coincided with some of the longest and more populated bins in T x E. The bin where most new SSRs were located was 1:50, with 15 SSRs and an interval of 14.1 cM (30 markers in the previous T x E map), followed by bin 3:37 with 13 SSRs with an interval of 11.7 cM (27 markers), and bin 7:25, with 12 markers and the longest interval, of 24.7 cM (24 markers).
| DISCUSSION |
|---|
|
|
|---|
When using bin sets of small size, several bins (i.e., duplicate bins) may correspond to the same joint genotype. This is more likely to occur in maps with longer total distances when using the same bin set size, because more bins are expected, and the number of possible joint genotypes remains the same. This can be solved by increasing the bin set size proportionally to the map length. Duplicate bins were found in both the set of plants selected visually and that obtained with the Mappop software, but in the first case their number was lower. We adopted the visual approach, but it is time consuming and it cannot be discounted that a more efficient set of six plants exists in T x E. Improving the Mappop software to detect possible duplicate bins and to provide different sets of plants ordered by their efficiency in different respects (number of duplicate bins, maximum bin size, and average bin size), allowing the users to choose the set more appropriate for their needs, would be a solution.
A total of 264 new SSRs, obtained from 401 microsatellite primer pairs, could be placed on the T x E map with an average accuracy of 7.8 cM, using only 6 plants of the population instead of 88, i.e., with less than one-tenth of the effort. The current number of markers on this map is now 826, which, considering that the total distance is 524 cM (DIRLEWANGER et al. 2004a; this work), corresponds to an average map density of 0.63 cM/marker. The number of SSRs of T x E has increased from 185 to 449 (average map density of 1.2 cM/SSR). We found only two new bins, indicating that the coverage of the T x E map is almost complete.
The analysis of a reduced number of plants implies that any scoring errors could lead to gross mistakes in the assignment of the marker position. In our case, the number of possible bins obtained with a codominant marker in the set of six plants is 36 = 729. We found only 67 bins, and predicted 5 more, considering the five cases where two contiguous bins are separated by two recombination events instead of one, as in the other cases. Additional bins may be found in the extremes of each linkage group, although we considered this an event with low probability, given the high level of saturation of the T x E map. Thus, considering the 72 predicted and the 729 possible bins, erroneous interpretation would give a new bin 90% of the time. Submitting markers that detect new bins to a more exhaustive analysis, i.e., confirmation with a larger set of plants or the analysis of the whole population, would make bin mapping a very robust method against errors.
Some of the map positions of the markers were expected, such as those of seven mapped SSRs (the pchgms series) developed from BACs that contained another mapped marker, or for EPPCU2288 and EPPCU6309, that correspond to different SSRs located on the same gene. In all these cases, the pairs of markers expected to be in the same physical region were also located in the same bin. Thirty-three more SSRs have already been placed on other maps, 26 in an intraspecific peach F2 (YAMAMOTO et al. 2001; T. YAMAMOTO, unpublished data), and 17 in a three-way interspecific progeny, involving almond, peach, and myrobolan plum (DIRLEWANGER et al. 2004b). All but one of them fell in bins located in the expected linkage group and in a similar position to that found in these maps. The exception was MA040a on one of the ends of G3 of the myrobolan plum map (DIRLEWANGER et al. 2004b) while it was expected to be in bin 6:74. The same marker was located on the expected G6 region by T. YAMAMOTO (unpublished data) in the "Akame" x "Juseitou" peach F2. This may be because the MA040a primers detect an additional locus in myrobolan plum, the locus is misplaced in this map, or there is a small genetic rearrangement in this species.
One area where bin mapping would be particularly efficient is in the candidate gene approach, where a large number of possible candidates must be tested for colocation with specific genes or QTL. The use of bin mapping combined with the high polymorphism of T x E (72% of the scorable microsatellite primer pairs segregated) would facilitate this task, allowing the selection of only those candidates that fall into the target regions, which could be later studied in more detail in the whole T x E population or in other populations. Another important advantage of the bin mapping approach is to facilitate the scientific community access to a reference mapping population and to cooperate in placing new markers or characters by exchanging a limited number of vegetatively propagated plants or DNA samples.
| ACKNOWLEDGEMENTS |
|---|
|
|
|---|
| LITERATURE CITED |
|---|
|
|
|---|
ARANZANA, M. J., A. PINEDA, P. COSSON, E. DIRLEWANGER, J. ASCASIBAR et al., 2003 A set of simple-sequence repeat (SSR) markers covering the Prunus genome. Theor. Appl. Genet. 106: 819825.[Medline]
BYRNE, D. H., 1990 Isozyme variability in four diploid stone fruits compared with other woody perennial plants. J. Hered. 81: 6871.
CIPRIANI, G., G. LOT, W. G. HUANG, M. T. MARRAZZO, E. PETERLUNGER et al., 1999 AC/GT and AG/CT microsatellite repeats in peach (Prunus persica (L) Batsch): isolation, characterisation and cross-species amplification in Prunus. Theor. Appl. Genet. 99: 6572.[CrossRef]
DIRLEWANGER, E., E. GRAZIANO, T. JOOBEUR, F. GARRIGA-CALDERÉ, P. COSSON et al., 2004a Comparative mapping and marker assisted selection in Rosaceae fruit crops. Proc. Natl. Acad. Sci. USA 101: 98919896.
DIRLEWANGER, E., P. COSSON, W. HOWAD, G. CAPDEVILLE, N. BOSSELU et al., 2004b Microsatellite genetic linkage maps of Myrobalan plum and an almond-peach hybridlocation of root-knot nematode resistance genes. Theor. Appl. Genet. 109: 827832.[CrossRef][Medline]
GEORGI, L. L., Y. WANG, D. YVERGNIAUX, T. ORMSBEE, M. IÑIGO et al., 2002 Construction of a BAC library and its application to the identification of simple sequence repeats in peach (Prunus persica (L.) Batsch). Theor. Appl. Genet. 105: 11511158.[CrossRef][Medline]
JOOBEUR, T., M. A. VIRUEL, M. C. DE VICENTE, B. JÁUREGUI, J. BALLESTER et al., 1998 Construction of a saturated linkage map for Prunus using an almond x peach F2 progeny. Theor. Appl. Genet. 97: 10341041.[CrossRef]
LANDER, E. S., P. GREEN, J. ABRAHAMSON, A. BARLOW, M. J. DALY et al., 1987 MAPMAKER: an interactive computer package for constructing primary genetic linkage maps of experimental and natural populations. Genomics 1: 174181.[CrossRef][Medline]
MESSINA, R., O. LAIN, M. T. MARRAZZO, G. CIPRIANI and R. TESTOLIN, 2004 New set of microsatellite loci isolated in apricot. Mol. Ecol. Notes 4: 432434.[CrossRef]
MNEJJA, M., J. GARCIA-MAS, W. HOWAD, M. L. BADENES and P. ARÚS, 2004 Simple-sequence repeat (SSR) markers of Japanese plum (Prunus salicina Lindl.) are highly polymorphic and transferable to peach and almond. Mol. Ecol. Notes 4: 163165.
STRUSS, D., R. AHMAD, S. M. SOUTHWICK and M. BORITZKI, 2003 Analysis of sweet cherry (Prunus avium L.) cultivars using SSR and AFLP markers. J. Am. Soc. Hortic. Sci. 128: 904909.
TANKSLEY, S. D., N. D. YOUNG, A. H. PATERSON and M. W. BONIERBALE, 1989 RFLP mapping in plant breeding: new tools for an old science. BioTechnology 7: 257263.[CrossRef]
TESTOLIN, R., R. MESSINA, O. LAIN, M. T. MARRAZZO, W. H. HUANG et al., 2004 Microsatellites isolated in almond from an AC-repeat enriched library. Mol. Ecol. Notes 4: 459461.[CrossRef]
VIRUEL, M. A., R. MESSEGUER, M. C. DE VICENTE, J. GARCIA-MAS, P. PUIGDOMÈNECH et al., 1995 A linkage map with RFLP and isozyme markers for almond. Theor. Appl. Genet. 91: 964971.
VISION, T. J., D. G. BROWN, D. B. SHMOYS, R. T. DURRETT and S. D. TANKSLEY, 2000 Selective mapping: a strategy for optimizing the construction of high-density linkage maps. Genetics 155: 407420.
YAMAMOTO, T., T. SHIMADA, T. IMAI, H. YAEGAKI, T. HAJI et al., 2001 Characterization of morphological traits based on a genetic linkage map in peach. Breed. Sci. 51: 271278.[CrossRef]
Communicating editor: S. R. MCCOUCHThis article has been cited by other articles:
![]() |
S. Jung, M. Staton, T. Lee, A. Blenda, R. Svancara, A. Abbott, and D. Main GDR (Genome Database for Rosaceae): integrated web-database for Rosaceae genomics and genetics data Nucleic Acids Res., January 11, 2008; 36(suppl_1): D1034 - D1040. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |