- Open Access
Genetic diversity and structure of Elymus tangutorum accessions from western China as unraveled by AFLP markers
Hereditas volume 156, Article number: 8 (2019)
Understanding genetic diversity of wild plant germplasm and the relationships between ecogeographic and genetic characteristics may provide insights for better utilizing and conserving genetic resources. Elymus tangutorum (Nevski) Hand.-Mazz, a cool-season hexaploid perennial, is an important pasture bunchgrass species used for forages and grassland restoration in Qinghai-Tibet Plateau and northwest China. In this study, 27 E. tangutorum accessions from diverse origins of western China were evaluated using AFLP markers in an effort to delve into the genetic relationships among them. The effects of eco-environmental factors and geographical isolation on the genetic diversity and population structure were also elucidated.
On account of 554 polymorphic fragments amplified with 14 primer combinations, the mean values of some marker parameters including polymorphic information content, resolving power and marker index were 0.2504, 14.10 and 23.07, respectively, validating the high efficiency and reliability of the markers selected. Genetic dissimilarity index values among accessions ranged from 0.1024 to 0.7137 with a mean of 0.2773. STRUCTURE, UPGMA clustering and PCoA analyses showed that all accessions could be divided into the three main clusters; however, this results do not exactly coincide with geographic groups. We found medium differentiation (FST = 0.162) between Qinghai-Tibet Plateau (QTP) and Xinjiang (XJC), and high differentiation (FST = 0.188) among three Bayesian subgroups. A significant correlation (r = 0.312) between genetic and geographical distance was observed by Mantel test in the species level, while the weak correlation was detected between genetic and environmental distance for all accessions and most of geographical groups. In addition, a significant ecological influence of average annual precipitation on genetic distance was revealed in XJC group and the Bayesian subgroup A.
This study indicates that AFLP technique are a useful tool to measure genetic diversity in E. tangutorum, showing that geographical and environmental factors (especially precipitation) together, play a crucial role in genetic differentiation patterns. These findings underline the importance of local adaptation in shaping patterns of genetic variability and population structure in E. tangutorum germplasm collected in Western China.
Elymus Linn. is the largest and most widely distributed genus in the tribe Triticeae, with 150 species grown in temperate regions of the world . This genus has a closer phylogenetic relationship with some of the important cereal crops, such as wheat, barley, rye and triticale . Therefore, it may serve as a valuable natural gene pool of desirable traits for the improvement of these crops [2, 3]. Besides, many Elymus species are also used as fodder grasses and ecological protection . Elymus tangutorum (Nevski) Hand.-Mazz, a perennial hexaploid species with the StYH genome (2n = 6x = 42), along with E. duhuricus Turcz. ex Griseb., E. excelsus Turcz. ex Griseb. and E. ivoroschilowii Probat. constitute the E. dahuricus complex [4, 5]. E. tangutorum differs morphologically from E. dahuricus Turcz. in the light of the short upward awns and its distribution in the Qinghai-Tibet Plateau (QTP), Xinjiang province of China and the Alpine regions of Central Asia. In QTP, E. tangutorum is widely used for the degraded grassland restoration, due to high productivity, drought resistance, cold tolerance, and adaptability . As one of the more important forage grasses in QTP, E. tangutorum has been widely studied. Most of studies mainly focused on the domestication, cultivation, phylogenesis and genome constitution [4, 6].
Understanding genetic diversity of wild plant germplasm and the relationships between ecogeographic and genetic characteristics may provide insights for better utilizing and conserving genetic resources . Investigation of germplasm diversity can be implemented via morphological and molecular means. Previous analyses based on agro-morphological characters and geographical origin indicated that a wide range of phenotypic divergence occurred among Elymus accessions of ecotypes, and/or cultivars [8, 9]. DNA-based markers have been regarded as practical tools with high efficiency and wide genome coverage in illuminating the pattern of genetic diversity and phylogenetic relationships in plant germplasm resources, with various advantages over phenotypic traits such as being not subject to environmental influences, potential of unlimited numbers, and assay from any development stage. In Elymus species, diverse PCR-based markers, employed in diversity and phylogeny studies, such as RAPD , ISSR , SRAP , ScoT [9, 13], and AFLP [14,15,16] have become prevalent as any prior sequence information is not required. Among different marker systems available at present, AFLP method could explore variation throughout the entire genome, including both coding and non-coding DNA regions and may therefore genome-wide variation was allowed. Moreover, AFLP offers high reproducibility and multilocus polymorphisms simultaneously identified using a single assay. These characteristics make the AFLPs extremely appropriate for molecular characterization of germplasm collection. Although previous work has provided preliminary data to characterize genetic diversity between and within two E. tangutorum populations from Tibet province of China by AFLP analyses [4, 5], until today there is no detailed study has been conducted for molecular characterization using a larger germplasm collection.
Genetic diversity in natural plant populations is held especially by the spatio-temporal environmental heterogeneity, i.e., genetic differentiation is strongly influenced through isolation by distance or local adaptation [17,18,19]. To estimate the impact of environmental factors on the genetic diversity is indispensable to gain a deeper insight into such evolutionary forces [20,21,22]. Thus, the combination analysis of molecular markers and eco-geographical data can provide beneficial information for taking up suitable strategies for utilization and conservation of wild plant germplasms [19, 23, 24].
Here, AFLP markers were used to analyze, on a regional scale, genetic diversity and structure of 27 wild E. tangutorum germplasm accessions indigenous from two contrasting climatic zones of Western China, namely Qinghai-Tibet plateau and Xinjiang province (Fig. 1). The goals of the current study were (1) to evaluate and compare the genetic diversity and population structure of E. tangutorum germplasm from different areas, and (2) to verify the potential influence of spatial and environmental factors on the patterns of detected population structure.
Plant materials and DNA extraction
Twenty seeds of each accession of E. tangutorum were germinated in vermiculite under identical conditions (25 °C, 300 μmols·m2·s− 1; 16-h photoperiod) in a plant growth chamber. Genomic DNA was extracted from bulked young leaves from five plants per accession employing a CTAB procedure . The quality and quantity of extracted DNA was assayed by 1% agarose gel electrophoresis and NanoDrop® spectrophotometer and was diluted with sterile distilled water to give a final concentration of 100 ng/μL.
AFLP fingerprinting was carried out following Vos et al. (1995) with minor modifications [26, 27], but the primers were labeled with 6-FAM fluorescent dye at the 5’end. Briefly, 300 ng of total genomic DNA was digested using the two restrictive enzymes EcoRI and MseI, following ligation of proper adapters to the restricted DNA fragments. After ligation reactions, pre-selective amplification was performed with (EcoRI + 1)/(MseI + 1) primers. Then the products from the previous step were applied to selective amplification using (EcoRI + 3)/(MseI + 3) selective primers. Initially, 48 selective primer combinations were screened on six accessions, and 14 pairs were chosen based on total number of fragments detected, fragment robustness and polymorphism (Additional file 1: Table S1). The automated separation and detection were fulfilled on an ABI 3730xl sequencer (Applied Biosystems) with the assistance of GeneScan 500 ROX as internal size standard. Genemarker 2.2 (SoftGenetics) was used to score fragments of 60–500 bp with a peak height above or equal to 100 reflective fluorescent units (RFUs). Then AFLP data from all primers was transferred into a binary matrix, scored as“1” (presence of fragment) and “0” (absence of fragment).
The discriminatory power of each AFLP primer combination was evaluated by calculating polymorphic information content (PIC), marker index (MI), and resolving power (RP) with polymorphic bands only. The formula to calculate PIC value of each primer combination is: PICi = 2 fi (1 − fi), where PICi is the PIC of ith marker, fi stand for the frequency of the present fragments and 1 − fi represent the frequency of absent fragments . Marker index (MI) was calculated as MI = PIC × EMR, where EMR (effective multiple ratio) is defined as the product of the proportion of polymorphic loci (PP) and the number of polymorphic loci (NPF), namely the total number of polymorphic fragments per primer . The resolving power (RP) of each primer was estimated as Rp = ∑Ib, where Ib (fragment informativeness) defined as Ib = 1–2 × |0.5 − p|, where, p is the proportion of accessions in which the fragment is present .
To understand the genetic relationship among all 27 accessions, pair-wise genetic distance (GD) were calculated on the basis of the Dice’s coefficient by NTSYS-pc v2.10 software. To obtain a more detailed view of the distribution of genetic variation within and between different geographical groups (XJ vs QTP), mean GD among accessions belonging to the same and/or different groups was also calculated. Based on GD matrix, the principal coordinate analysis (PCoA) and Unweighted Pair Group Method with Arithmetic (UPGMA) clustering analysis were performed. The robustness of the dendrograms was assessed with 1000 bootstrap iterations by Freetree software . The Bayesian model-based clustering method was applied to explore the population structure within the studied accessions or genetically diverse population (or groups) using the STRUCTURE software v.2.3.4 (http://web.stanford.edu/group/pritchardlab/structure.html). Following the suggestion by Forsberg et al.  for outcrossing species, the admixture model was adopted and the heterozygous loci were treated as missing data. Population structure analysis was performed on AFLP data of all 27 accessions without any prior classification based on the Bayesian model-based method. Evanno’s ad hoc ΔK statistic was used to determine the most probable clustering number of subgroups . We set the number of clusters (K) from 1 to 5 (10 times for each K) for accurate assignments of accessions. For each run, the burn-in period of 50,000 iterations and a run length of 500,000 MCMC replications were implemented. Considering that the indication of the most likely number of groups, namely LnP(D) usually plateaus or increases slightly after the “right K” is reached , we used Structure Harvester online software (http://taylor0.biology.ucla.edu/struct_harvest/) to detect optimum value of K based on ΔK method . The output files for the selected K based on ten independent runs were then processed in CLUMMP  using the LargeKGreedy algorithm and the G’pairwise similarities. Results were exported to DISTRUCT  for better graphical presentation. We also executed spatial population analysis using TESS 2.3.1 , which has been described as carrying high percentage of correct inference than other Bayesian clustering algorithm under certain conditions . Geographical coordinates of each accession were contained in AFLP dataset as prior information, and the MCMC algorithm was run under BYM admixture model with 50,000 sweeps and10 000 burn-in. Fifty independent iteration analyses were performed at each value of clusters K (2~5).The value of K with the highest likelihood was determined according to the deviance information criterion (DIC), and the results of runs with 20% lowest DIC values were processed using CLUMMP and DISTRUCT, respectively, as did in the STRUCTURE analysis. Tess outputs was further visualized in a geographical perspective using an R script (POPSutilities) written by Flora Jay .
To validate the genetic variation among and within inferred subgroups according to geographic origins or STRUCTURE results, analysis of molecular variance (AMOVA) based on PhiPT (analogue of FST) value was using GenAlEx 6.5 was performed. The observed (Na) and effective number (Ne) of alleles, expected heterozygosity (HE), Shannon’s information index (I) , and Nei’s genetic distance among geographic groups were also calculated by GenAlEx.
Based on geographical location of each accession, the corresponding environmental data (precipitation and temperature parameters) was obtained by DIVA-GIS (version 188.8.131.52, http://swww.diva-gis.org/). The geographic and environmental distance matrix was calculated based on the Euclidean distance between pairs of collection sites using NTSYS-pc v2.10. Correlation between geographical and genetic distances (Nei-Li coefficients) between accessions was measured by Mantel test with 9999 permutations using IBD (Isolation by Distance) software v1.53 . A similar analysis was conducted to compare the genetic distance and environmental distance (altitude, precipitation and temperature) for accessions in different geographic groups.
The AFLP fingerprinting was performed on 27 wild E. tangutorum accessions using 14 primer combinations. A total of 554 scorable fragments were detected with a mean of 40 per assay unit, of which 509 (91.7%) were polymorphic and 70 (12.6%) were unique (Tables 1 and 2). The size range of fragments scored across accessions was 60 to 500 bp. The total number of amplified fragments (TNF) per primer pair varied from 30 to 47 and the number of polymorphic fragments per primer pair (NPF) ranged from 26 (E46M51, E-ATT/M-CCA) to 41 (E43M55, E-ATA/M-CGA). The number of unique fragments per primer pair (NFU) ranged from one (for E46M60) (NUF = 1) to 10 (for E40M59 and E50M72). The percentage polymorphism varied from 80.95 to 97.56% with an average of 91.77% per primer combination. This result suggests that E. tangutorum germplasm from western China could be characterized well by above AFLP dataset retaining abundant variation. Polymorphism information content (PIC) of each primer pair being 0.5 for AFLP as the dominant marker, 38% of polymorphic loci (195) possessed PIC values above 0.25, implying high discriminatory power of AFLP markers for E. tangutorum germplasm. In addition, the marker index (MI) and resolving power (RP) values for individual primer pair were recorded. There were positive correlations between PIC, RP and MI (r = 0.98, 0.90 and 0.87, respectively). The most discriminatory primer combinations among 14 primers were E-ATG/M-CTC (E45M60, PIC = 0.3355, MI = 32.73, RP = 20.74) followed by E-AAC/M-GAC (E32M64, PIC = 0.3099, MI = 29.32, RP = 17.63).
Population structure analysis
The results of population using the STRUCTURE software showed a clear peak of the ΔK values when K was set at 2 (Fig. 2),which indicate that the sampled accessions belonged to two inferred genome fraction (Fig. 3). Samples with a inferred genome fraction value (membership proportion) of 0.8 or more to a subgroups are considered as pure, and less than 0.80, as admixture . Considering individual’s membership proportion (Qi), all 27 accessions were assembled into three Bayesian subgroups designated as subgroup A (pure, blue fraction ≥0.8), subgroup B (admixture, 0.2 ≤ blue fraction ≤0.8) and subgroup C (red fraction ≥0.2). The three subgroups included 10, 8 and 9 accessions, respectively. Geographically, most of accessions from Xingjiang (XJC) (10, 71.43%) belonged to subgroup A and the rest resided in subgroup B (Et20) and C (Et04 and Et13). Six accessions from Sichuan (SCC) were part of subgroup B and the five remaining accessions were part of subgroup C. For the three accessions from Gansu (GSC), Et18 consisted in subgroups B and the remaining two belonged to subgroup C.
The Bayesian clustering results by TESS 2.3.1 had lowest DIC values (highest probability) at Kmax = 4, indicating that the sampled accessions belonged to four inferred genome fraction (Additional file 2: Figure S1). This pattern is slightly different from the result from Structure runs in which the optimal K = 2. However, Considering the proportion of individual’s membership (Qi), the Bayesian subgroups based on TESS was almost exactly equivalent to the Structure result, both of which have the two dominant ancestry coefficients (Fig. 3). On the other hand, the geographic map displaying the Q matrix spatially by R script  also exhibited a general trend that the samples from XJC is obviously different from the samples from QTP, although the number of collection sites were limited and the spatial mapping for each accession is modeled over the whole Asia area rather than just considered at actual sampling sites (Additional file 3: Figure S2).
Cluster and principal coordinate analysis
Based on the 509 polymorphic fragments, estimates of the genetic distances (Dice’s coefficient) ranged from 0.1024 to 0.7137 with an average of 0.2773, implying a largely genetic divergence in the germplasm collections studied. Accessions Et08 and Et11 were most related due to a minimum genetic distance of 0.1024, while Et01 and Et23 were most divergent with the maximum genetic distance of 0.7137. The rest of accessions have exhibited different middle levels of genetic distances. Compared with 0.2611 of inter-group dissimilarity, the accessions from Xinjiang were more homogenous with lower dissimilarity values (0.2214), whereas the QTP accessions were more variable genetically with higher dissimilarity values (0.2785). These results were also confirmed by STRUCTURE analysis that most of Xinjiang accessions had pure type of genome fraction. The genetic distance matrix was used to build a hierarchical dendogram of genetic relationships based on the UPGMA clustering method (Fig. 3), in which three major clades were clearly categorized with further internal subgroupings. These three clades were identified at the 0.259 dissimilarity level. Grouping of most of accessions was congruent with their respective geographical origins. However, the accession Et20 and Et13 from Xinjiang did not separate clearly from QTP accessions. The bootstrap support of all major branches were higher than 50%, and a majority of them ranged from 70 to 99% revealing reliability of data and clustering results. Despite some differences, there was coherence between Bayesian method and hierarchical clustering analysis (Fig. 3). The UPGMA dendrogram could separate well three Bayesian subgroups, except for the accessions Et27, Et19 and Et23 were revealed as outliers.
A similar clustering patter of the studied germplasm was demonstrated by the principal coordinate analysis (PCoA) (Fig. 4) illustrating that scatter-plot dipiction of PCoA is comparable to either of the hierarchical and Bayesian cluster analysis. That is, most of studied accessions were separated according to their geographical groups and/or the locations sites then put in the same cluster. PCoA was performed based on Nei-Li distances and confirmed division of 27 accessions into three major clusters: ClusterI, II and III. The first two principal coordinates represented the total of 69.1% of the total molecular variation (60.25, and 8.65%, respectively). The first axis (PC1), which contributed 60.25% of the variance, separated most of accessions, which have been assigned to three UPGMA clusters. The second axis (PC2),explained 8.65% of the variation, could further distinguish the accessions from cluster I and III.
Genetic diversity estimates
Geographically, the region of Gansu and Sichuan Province are both part of the QTP region. Hence, we merged the GSC and the SCC group into a new group named QTP group. The total gene diversity across different ecogeographical regions was estimated based on 509 polymorphic fragments (Additional file 4: Table S2). The percentage of polymorphic loci (PP) was found to be higher in accessions of QTP (73.29%) than those of Xinjiang (70.76%). The Xinjiang group (XJC) showed high Nei’s indices of gene diversity (He) of 0.24 and Shannon’s diversity (I) of 0.36, whereas QTP group displayed a moderate levels of diversity (He = 0.21and I = 0.32). For the three Bayesian subgroups, the PP value was found to be highest among accessions from subgroup A (67.69%) followed by subgroup C (65.16%). The markers were least informative for assessment of genetic variability of accessions from subgroup B with PP restricted to 53.25%. Similarly, subgroup A showed the highest intra-group diversity (I = 0.36 and He = 0.24), whereas subgroup B had the lowest intra-group diversity (I = 0.01 and He = 0.0.01).
The two independent AMOVA analyses implemented relying on geographic origins and STRUCTURE clustering results and a highly significant variation (P < 0.01) was detected (Table 3). For regional groups, 16.24% of the total variation was due to differences among populations and 73.90% variation was due to divergence within groups. Both of the pairwise FST values of two regional groups were significant (P ≤ 0.01) which were tested by GenAlex 6.5. The total FST = 0.162 indicated that the genetic variability is moderate between QTP and XJC groups. The three subgroups inferred from STRUCTURE showed 81 and 19% of total variance within and among the groups. This means that the total FST was 0.188 (P ≤ 0.01) and the derived three subgroups are highly structured by Bayesian inference panel. Significant differences in the FST were observed for all the pairwise comparisons of three subgroups inferred from STRUCTURE analysis (Table 4). The maximum pairwise FST (0.274) found between subgroup A and subgroup C was confirmed by their great difference of inferred genome fraction (Table 4). In addition, AMOVA analysis indicate that the level of gene flow either between geographic areas or between Bayesian clusters was low and insufficient to make homogenization in the E. tangutorum collections of western China.
To verify the eco-geographical factors influencing genetic structuring, Mantel’s test was used to estimate matrix correlations between genetic distance and geographical, elevation, average annual temperature, and average annual precipitation distances. A weak but significant pattern of isolation by geographic distance was detected within all accessions (r = 0.312; P = 0.001). A similar result was found in each group except GSC (Table 5). This pattern derives largely from measurable differentiation among geographic groups seen in STRUCTURE and UPGMA clustering analysis. Furthermore, there was a faint or non-significant correlation between distance matrices of genetic and environmental factors for all accessions. However, a significantly positive correlation was demonstrated between genetic distance and the average annual precipitation distance in XJC (r = 0.685; P = 0.01) and Subgroup A (r = 0.594; P = 0.05) (Table 5). In addition, relatively high Mantel r-value were observed between environmental factors and genetic distance for Gansu accessions, all of which were non-significant due to a small group size (only 3 accessions).
AFLP polymorphism and discriminating capacity of the assays
The practical availability of molecular markers in plant germplasm characterization depends on their power to discriminate the genotypically different accessions analyzed. In the present paper, the efficiency of AFLPs were assessed by recording different marker parameters concerning polymorphism, PIC (Polymorphic Information Content), MI (marker index) and RP (resolving power). In total, 509 AFLP polymorphic fragments with 91.77% of polymorphism were observed in the current work, which was obviously higher than previous works by AFLP in E. tangutorum and other related Elymus species. In the studies of E. dahuricus complex diversity, 69.35, 58.79, 29.65 and 27.14% of polymorphic fragments for E.excelsus, E.dahuricus, E.woroschilowii and E. tangutorum was obtained respectively . In three Elymus species indigenous to East Asia, 34.2, 30.1 and 51.5% of polymorphism was detected for E. tsukushiensis, E. humidus and E. dahuricus respectively . So far, the primer combinations used in this study successfully established high levels of genetic distinctness among the studied E. tangutorum accessions using AFLP fingerprints. The level of polymorphism obtained here with AFLP markers is also higher than that reported for other Elymus germplasm in western China by other types of dominant markers such as E. sibiricus by ScoT (89%)  and E. nutans by ISSR (91.4%) . In all, the participation of polymorphic fragments largely relies on the species, origin ecogeographic area and the differentiation degree of sampled accessions, amount of primers and even type of molecular marker used [42, 43]. Moreover, as all primer combinations produced unique fragments (on average, 5 per primer), those primers could be used to differentiate particular accessions .
The polymorphic information content (PIC) values of 0.25 in current study demonstrated good marker discriminatory power in view of PIC value starting from 0 to 0.5 for dominant marker like AFLPs [45, 46]. This values also agree with AFLP studies done with other forage grass such as Dactylis glomerata  and Phalaris aquatica .The Marker index (MI) and resolving power (RP), two importantly alternative parameters in selecting informative markers for diversity studies [49, 50], had also displayed high efficiency of AFLP markers in unveiling polymorphisms in E. tangutorum germplasm. A very strong and positive correlation observed among PIC, RP and MI value in present work indicates the use of any of three parameters to screen the informative primer combinations ; although the relationship between the informativeness of molecular markers for identifying germplasms and these marker parameters is not totally clear . Primer combinations E-ATG/M-CTC and E-AAC/M-GAC should be the most informative due to the highest records on PIC, RP and MI. Therefore, they are recommended for use in germplasm diversity analyses of other E. tangutorum collections. The Shannon index is an accurate alternative measure of diversity due to no need for estimate of allele frequencies under Hardy-Weinberg equilibrium. The Shannon diversity detected in this study (0.34) was higher than those of E. sibiricus germplasm from western China by SCoT (I = 0.285) and EST-SSR (I = 0.237) markers [13, 51], also indicating a high level of genetic diversity in the accessions of E. tangutorum (Table 2).
Clustering pattern and genetic structure
It is attractive for breeders and germplasm curators to know the width of genetic diversity within a plant species of great importance . The spatial genetic diversity analysis was conducted to verify the relationship between Elymus tangutorum accessions and biogeographical patterns by different clustering approaches. The UPGMA tree, the PCoA scatter plot, and the STRUCTURE analyses revealed a similar or identical membership among 27 wild E. tangutorum accessions, with a general grouping pattern including three genetically distinct and consistent groups. When these accessions are grouped according to clusters distinguished(Fig. 3), a greater range of Nei-Li distances is observed, which is comparable to previous studies for other Elymus germplasm with similar collection areas [13, 53, 54]. The first cluster derived from UPGMA tree consisted of ten accessions from Xinjiang Province in southwest of China, having a typical temperate continental climate. Meanwhile, Cluster I had the distinctly pure membership based on Bayesian clustering. The second cluster was primarily formed by accessions from southeast of Qinghai-Tibet Plateau (QTP), a region greatly far from Xinjiang, having a typical plateau mountain climate. Cluster II was corresponding well to the admixture subgroup STRUCTUE by analysis. The third cluster was also chiefly comprised by the accessions from southeast of QTP, with another distinctly pure membership. In short, the above results were in agreement with the geographical proximity of studied accessions and their genetic relationships.
Exceptions to this pattern of clustering were three accessions (Et04, Et13 and Et20) from Xinjiang,which were grouped with QTP accessions in Clade II and Clade III respectively. Since E. tangutorum is used in natural grassland restoration projects , it is possible that these three accessions are not native accessions from QTP and might have been introduced through domestic trade of seeds from Xinjiang. By contrasting the figures of UPGMA, PCoA and STRUCTURE analyses, two accessions (Et19 and Et27) assigned to Bayesian Subgroup II, fall into outgroups of UPGMA analyses. The possible reason causing such relationships may be STRUCTURE assumes that the loci within a population are under Hardy–Weinberg equilibrium (HWE) and it is susceptible to many factors such as non-random mating, genetic drift, mutations, gene flow and selection . Nonetheless, in different clustering analyses, it was largely found that separate clusters were often obtained in accordance with the respective geographic origin of accessions. Therefore the cluster pattern in present work might be due, in part, to differences in the adaptation of ecotypes to various selection forces from local conditions . These results are in agreement with earlier studies which showed that geographical separation of other Elymus species germplasm did not always result in greater genetic differentiation [12, 57].
Indeed, the genetic diversity and population structure are determined by joint effects of many factors including geographic distribution, life cycle, mating system, selection and adaptation . Considering the genetic differences among studied regions, the accessions from Xinjiang province showed higher diversity (I = 0.24, He = 0.36) than those from QTP (I = 0.21, He = 0.32). With regard to the Bayesian subgroups, higher diversity was found among accessions in subgroup A, which correspond to accessions from Xinjiang province. Based on a membership probability threshold of 0.80, most accessions (66.67%) such as subgroup A and C appeared to have a pure population ancestry in their genome component, which might result from limited gene flow among wild accessions with distantly geographical origin . This explains why we have observed high diversity within geographical groups. However, more than half of the accessions from QTP were composed of admixture of genome fraction. Such case could be supported by the fact that E. tangutorum was frequently used to grassland restoration in Qinghai-Tibetan Plateau as mentioned above [6, 60]. Therefore, human activities may lead to frequently natural crossing by wind pollination between adjacent stands.
Results of AMOVA analysis showed low genetic variation among regions accounted for only 16.2% of the germplasm collection, whereas within-region variation accounts for 83.8% of the total variability. Similarly, only 19% of variation was detected for Bayesian groups,. These results on variation among accessions within geographical groups were confirmed with previous studies for E. nutans and E. sibiricus collections with similar origins [12, 57]. However, our results are more extreme. This could be a consequence of high gene flow resulting from grassland restoration project or the limited number of studied accessions for each region. The low among-group variance components indicated that the genetic background attributable to the geographical origin contributes slightly to the observed genetic diversity. Besides, the high gene flow and differentiation within E. tangutorum, supported by clustering and AMOVA, was also probably driven by its mating system, which is one of the important life-history feature that strongly influence genetic diversity and population structure . As reported in genetic diversity study for Elymus dahuricus complex , E. tangutorum is probably a predominantly self-pollinated species with 0.9 of FST value. In their review, Hamrick and Godt  claimed that short-lived, self-pollinating species have most of their genetic diversity partitioned among populations rather than within populations. Similarly, Nybom  assumed that short-lived self-pollinating species allocate most of their genetic variability within populations. Each E. tangutorum accession in present investigation could be treated as a population consisting of seeds of neighboring individual plants collected from differently distant locations . This situation may limit gene flow between accessions by geographic isolation barriers (i.e. mountains, rivers), thus the variation will maintain its characteristics within each accession. Although the within-accession variability is not determined by individual-based analysis, we have indeed observed the high degree of differentiation within this species according to UPGMA and PCoA analysis based on bulked DNA samples. Hence it is not surprising that the vast proportion of AFLP variation detected in E. tangutorum was present rather within than among groups, since the large-scale geographical group contained numerous accessions with distinctly genetic architecture. These results could be confirmed by that the totally average genetic distance (0.277) between all of accessions was higher than that (0.261) between accessions from the two main geographical groups (QTP and Xinjiang).
Correlation of genetic diversity and environmental variables
In virtue of the evolutionary forces, the decisive factor to influence the maintenance of genetic variability is the spatial variation of environment and the ecological difference between habitats . Remarkably, the Mantel test between genetic distance and geographical distance in all E. tangutorum accessions and most groups was weakly correlated (Table 5) and similar results were reported in previous studies of Elymus species [2, 57]. Within accessions from QTP group, we found lower r-values of geographical distance (r = 0.131) compared with the XJC group (r = 0.313), which was not significantly different from zero and might be related to the highly heterogeneous topography nature of the QTP collection. The complex topographic features and climate will work together to form varieties niches to influence the genetic variability . Thus, spatial distribution cannot be solely accounted for by a simple isolation-by-distance model and requires additional factors that influence the observed genetic population structure, such as environmental factors . In the Mantel test between genetic distance and environmental distance, we found a significantly positive correlation between genetic distance and average annual precipitation (r = 0.685; P = 0.01 and r = 0.594; P = 0.05) in the XJC group and Bayesian subgroup A. A possible explanation for this results is that in Xinjiang with arid and/or semi-arid temperate climate, the lower precipitation hinders seed germination and lead to decrease plants’ genetic diversity . In the diversity study of E. nutans germplasm from western China, the environmental divergence like elevation is also related to genetic distance (r = 0.695; P = 0.01) . On the other hand, the relationship between environmental factor and genetic distance is difficult to be fully resolved in complex environment . This could be responsible for the weak and non-significant correlation between genetic distance and environmental distance found in QTP group. Besides, although the accessions of GSC group had high Mantel r-value (Table 5) between genetic distance and environmental distance except the average annual temperature (r = 0.309), the correlation between the genetic diversity and environmental factor was still unraveled due to the small population size.
This study indicates that AFLP markers are a powerful tool to measure genetic diversity in E. tangutorum and geographical and environmental factors (especially precipitation) together plays a crucial role in genetic differentiation patterns. These findings highlight the importance of local adaptation in shaping patterns of genetic structure inferred in E. tangutorum accessions fromWestern China. Therefore, collecting and assessing E. tangutorum germplasm from major geographic regions and special ecogeographic environment such as Qinghai-Tibet Plateau, will help to expand the genetic base and sample the full extent of available variation.
Amplified fragment length polymorphism
Analysis of molecular variance
Hexadecyl trimethyl ammonium Bromid
Shannon information index
Inter-simple sequence repeat
Number of polymorphic loci
Polymerase chain reaction
Polymorphism information content
Principal coordinate analysis
Random Amplified Polymorphic DNA
Started codon targeted polymorphism
Sequence—related amplified polymorphism
Simple sequence repeat
Total number of amplified fragments
Unweighted pair-group method with arithmetic means
Dewey DR. The genomic system of classification as a guide to intergeneric hybridization with the perennial Triticeae. In: Gustafson JP, editor. Gene Manipulation in Plant Improvement. Boston: Springer; 1984. p. 209–79.
Díaz O, Sun GL, Salomon B, Bothmer RV. Levels and distribution of allozyme and RAPD variation in populations of Elymus fibrosus (Schrenk) Tzvel. (Poaceae). Genet Resour Crop Ev. 2000;47(1):11–24.
Kuo BZ, Wang JS, Li JH. Studies on morphological evolution, classification and distribution of the Triticeae in China. Acta Phytotaxon Sin. 1985;23(3):161–9.
Yang CR, Zhang HQ, Zhao FQ, Liu XY, Fan X, Sha LN, Kang HY, Wang Y, Zhou YH. Genome constitution of Elymus tangutorum (Poaceae: Triticeae) inferred from meiotic pairing behavior and genomic in situ hybridization. J Syst Evol. 2015;53(6):529–34.
Agafonov AV, Baum BR, Bailey LG. Differentiation in the Elymus dahuricus complex (Poaceae): evidence from grain proteins, DNA, and crossability. Hereditas. 2001;135(2–3):277–89.
Wang YL. Study on cultivation and domestication of Elymus tangutorum. Chin Qinghai J Anim Vet Sci. 2012;3(2):1–2.
Zhang DY, Jiang XH. Mating system evolution, resource allocation, and genetic diversity in plants. Acta Phytotaxon Sin. 2001;25(2):130–43.
Berdahl JD, Mayland HF, Asay KH, Jefferson PG. Variation in agronomic and morphological traits among Russian wildrye accessions. Crop Sci. 1999;39(6):1890–5.
Guo ZH, Yang HY, Zhang CL, Sun M, Fu KX, Zhang XQ, Ma X. Identification of four Elymus sibiricus cultivar(strains)from northwest plateau of Sichuan using morphological traits and SSR markers. Pratacult Sci. 2016;33(9):1718–27.
Sun GL, Díaz O, Salomon BVon BR. Genetic diversity in Elymus caninus as revealed by isozyme, RAPD, and microsatellite markers. Genome. 1999;42(3):420.
Li Y, Li S, Li L, Yang X, Li X. Comparison of genetic diversity of twelvel Elymus species using ISSR and SSR markers. Scientia Agri Sin. 2005;38(08):1522–7.
Chen ZH, Miao JM, Zhong JC, Ma X, Chen SY, Zhang XQ. Genetic diversity of wild Elymus nutans germplasm detected by SRAP markers. Acta Pratacul Sin. 2010;30(1):94–100.
Xie WG, Zhang JC, Zhao XH, Zhang JQ, Wang YR. Siberian wild rye ( Elymus sibiricus L.): Genetic diversity of germplasm determined using DNA fingerprinting and SCoT markers. Biochem Syst Ecol. 2015;60:186–92.
Sasanuma T, Endo TR, Ban T. Genetic diversity of three Elymus species indigenous to Japan and East Asia (E. Tsukushiensis, E. Humidus and E. Dahuricus) detected by AFLP. Genes Genet Syst. 2002;77(6):429–38.
Szczepaniak M, Cieślak EBednarek PT. Morphological and AFLP variation of Elymus repens (L.) Gould (Poaceae). Cell Mol Biol Lett. 2002;7(2A):547–58.
Szczepaniak M, Cieślak E. Assessment of genetic variation in Elymus repens and E. hispidus using AFLP markers and its implications for intraspecific taxonomy. In: Frey L, editor. Problems of grass biology. Kraków: W. Szafer Institute of Botany, Polish Academy of Sciences; 2003. p. 271–86.
Zhang HX, Zhang ML. Genetic structure of the Delphinium naviculare species group tracks Pleistocene climatic oscillations in the Tianshan Mountains, arid Central Asia. Palaeogeogr Palaeocl. 2012;353-355(5):93–103.
Wu ZG, Yu D, Li X, Xu XW. Influence of geography and environment on patterns of genetic differentiation in a widespread submerged macrophyte, Eurasian watermilfoil (Myriophyllum spicatum L., Haloragaceae). Ecol Evol. 2016;6(2):460–8.
Ozbek O, Millet E, Anikster Y, Arslan OFeldman M. Spatio-temporal genetic variation in populations of wild emmer wheat, Triticum turgidum ssp. dicoccoides , as revealed by AFLP analysis. Theor Appl Genet. 2007;115(1):19–26.
Dequigiovanni G, Ramos SLF, Alves-Pereira A, Fabri EG, Picanço-Rodrigues D, Clement CR, Gepts PVeasey EA. Highly structured genetic diversity of Bixa orellana var. urucurana, the wild ancestor of annatto, in Brazilian Amazonia. Plos One. 2018;13(6):e0198593.
Hübner S, HÖfken M, Oren E, Haseneyer G, Stein N, Graner A, Schmid K, Fridman E. Strong correlation of wild barley (Hordeum spontaneum) population structure with temperature and precipitation variation. Mol Ecol. 2009;18(7):1523–36.
Nevo E. Evolution of genome-phenome diversity under environmental stress. Proc Natl Acad Sci U S A. 2001;98(11):6233–40.
Castillo A, Dorado G, Feuillet C, Sourdille PHernandez P. Genetic structure and ecogeographical adaptation in wild barley ( Hordeum chilense Roemer et Schultes) as revealed by microsatellite markers. BMC Plant Biol. 2010;10(1):266.
Shaaf S, Sharma R, Kilian B, Walther A, Özkan H, Karami EMohammadi B. Genetic structure and eco-geographical adaptation of garlic landraces ( Allium sativum L.) in Iran. Genet Resour Crop Ev. 2014;61(8):1565–80.
Del SG, Manfioletti GSchneider C. The CTAB-DNA precipitation method: a common mini-scale preparation of template DNA from phagemids, phages or plasmids suitable for sequencing. BioTechniques. 1989;7(5):514–20.
Janssen P, Coopman R, Huys G, Swings J, Bleeker M, Vos P, Zabeau MKersters K. Evaluation of the DNA fingerprinting method AFLP as an new tool in bacterial taxonomy. Microbiology. 1996;142(7):1881.
Vos P, Hogers R, Bleeker M, Reijans M, Lee TVD, Hornes M, Friters A, Pot J, Paleman JKuiper M. AFLP: a new technique for DNA fingerprinting. Nucleic Acids Res. 1995;23(21):4407.
Roldanruiz I, Dendauw J, Evan B, Depicker AMde L. AFLP markers reveal high polymorphic rates in ryegrasses (Lolium spp.). Mol Breeding. 2000;6(2):125–34.
Powell W, Morgante M, Andre C, Hanafey M, Vogel J, Tingey SRafalski A. The comparison of RFLP, RAPD, AFLP and SSR (microsatellite) markers for germplasm analysis. Mol Breeding. 1996;2(3):225–38.
Ramakishana W, Lagu MD, Gupta VS, Ranjekar PK. DNA fingerprinting in rice using oligonucleotide probes specific for simple repetitive DNA sequences. Theor Appl Genet. 1994;88(3–4):402–6.
Hampl V, Pavlícek A, Flegr J. Construction and bootstrap analysis of DNA fingerprinting-based phylogenetic trees with the freeware program FreeTree: application to trichomonad parasites. Int J Syst Evol Microbiol. 2001;51(Pt 3):731.
Forsberg NEG, Russell J, Macaulay M, Leino MW, Hagenblad J. Farmers without borders genetic structuring in century old barley (Hordeum vulgare). Heredity. 2015;114(2):195.
Evanno G, Regnaut S, Goudet J. Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Mol Ecol. 2005;14(8):2611–20.
Jakobsson M, Rosenberg NA. CLUMPP: a cluster matching and permutation program for dealing with label switching and multimodality in analysis of population structure. Bioinformatics. 2007;23(14):1801–6.
Rosenberg NA. Distruct: a program for the graphical display of population structure. Mol. Ecol. 2004;4(1):137–8.
Chibiao C, Eric D, Florence F, Olivier F. Bayesian clustering algorithms ascertaining spatial population structure: a new computer program and a comparison study. Mol Ecol Notes. 2007;7(5):747–56.
Safner T, Miller MP, Mcrae BH, Fortin MJ, Manel S. Comparison of Bayesian clustering and edge detection methods for inferring boundaries in landscape genetics. Int J Mol Sci. 2011;12(2):865–89.
Holderegger P, Taberlet O. François. Forecasting changes in population genetic structure of alpine plants in response to global warming. Mol Ecol. 2012;21(10):2354–68.
Peakall R, Smouse PE. GenAlEx 6.5: genetic analysis in excel. Population genetic software for teaching and research—an update. Bioinformatics. 2012;28(28):2537–9.
Bohonak AJ. IBD (isolation by distance): a program for analyses of isolation by distance. J Hered. 2002;93(2):153–4.
Chen SY, Ma X, Zhang XQ, Chen ZH. Genetic variation and geographical divergence in Elymus nutans Griseb. (Poaceae: Triticeae) from West China. Biochem Syst Ecol. 2009;37(6):716–22.
Tarczyk B. Genetic diversity among Polish landraces of common oat (Avena sativa L.). Genet Resour Crop Ev. 2013;60(7):2157–69.
Blanco EZ, Bajay MM, Siqueira MV, Zucchi MI, Pinheiro JB. Genetic diversity and structure of Brazilian ginger germplasm (Zingiber officinale) revealed by AFLP markers. Genetica. 2016;144(6):1–12.
Fernández M, Figueiras A, Benito C. The use of ISSR and RAPD markers for detecting DNA polymorphism, genotype identification and genetic diversity among barley cultivars with known origin. Theor Appl Genet. 2002;104(5):845–51.
Varshney RK, Chabane K, Hendre PS, Aggarwal RK, Graner A. Comparative assessment of EST-SSR, EST-SNP and AFLP markers for evaluation of genetic diversity and conservation of genetic resources using wild, cultivated and elite barleys. Plant Sci. 2007;173(6):638–49.
Laurentin H, Karlovsky P. AFLP fingerprinting of sesame (Sesamum indicum L.) cultivars: identification, genetic relationship and comparison of AFLP informativeness parameters. Genet Resour Crop Ev. 2007;54(7):1437–46.
Sun M, Zhang CL, Zhang XQ, Fan Y, Fu KX, Wu WD, Bai SQ, Zhang JB, Peng Y, Huang LK. AFLP assessment of genetic variability and relationships in an Asian wild germplasm collection of Dactylis glomerata L. C R Biol. 2017;340(3):145–55.
Mian MAR, Zwonitzer JC, Chen Y, Saha MC, Hopkins AA. AFLP diversity within and among Hardinggrass populations. Crop Sci. 2005;45(6):2591–7.
Ghehsareh MG, Salehi H, Khosh-Khui M, Niazi A. Application of ISSR markers to analyze molecular relationships in Iranian jasmine ( Jasminum spp.) accessions. Mol Biotechnol. 2015;57(1):65–74.
Gilbert JE, Lewis RV, Wilkinson MJ, Caligari PDS. Developing an appropriate strategy to assess genetic variability in plant germplasm collections. Theor Appl Genet. 1999;98(6–7):1125–31.
Xie WG, Zhao XH, Zhang JQ, Wang YR, Liu WX. Assessment of genetic diversity of Siberian wild rye ( Elymus sibiricus L.) germplasms with variation of seed shattering and implication for future genetic improvement. Biochem Syst Ecol. 2015;58:211–8.
Maul J, Mirsky S, Emche S, Devine T. Evaluating a germplasm collection of the cover crop hairy vetch for use in sustainable farming systems. Crop Sci. 2011;51(6):2615.
Zhang JC, Xie WG, Wang YR, Zhao XH. Potential of start codon targeted (SCoT) markers to estimate genetic diversity and relationships among Chinese Elymus sibiricus accessions. Molecules. 2015;20(4):5987–6001.
Zhou Q, Luo D, Ma L, Xie W, Wang Y, Wang YLiu Z. Development and cross-species transferability of EST-SSR markers in Siberian wildrye (Elymus sibiricus L.) using Illumina sequencing. Sci Rep. 2016;6:20549.
Benor S, Hammer KBlattner FR. Genetic diversity and relationships in Corchorus olitorius (Malvaceae s.L.) inferred from molecular and morphological data. Genet Resour Crop Ev. 2012;59(6):1125–46.
Chen SY, Zhang XQ, Ma X, Huang LK. Assessment of genetic diversity and differentiation of Elymus nutans indigenous to Qinghai-Tibet plateau using simple sequence repeats markers. Can J Plant Sci. 2013;93(6):1089–96.
Ma X, Zhang XQ, Zhou YH, Bai SQ, Liu W. Assessing genetic diversity of Elymus sibiricus (Poaceae: Triticeae) populations from Qinghai-Tibet plateau by ISSR markers. Biochem Syst Ecol. 2008;36(7):514–22.
Nybom H. Comparison of different nuclear DNA markers for estimating intraspecific genetic diversity in plants. Mol Ecol. 2004;13(5):1143–55.
Greene SL, Gritsenko M, Vandemark G. Relating morphologic and RAPD marker Varlation to collection site environment in wild populations of red clover (Trifolium Pratense L.). Genet Resour Crop Ev. 2004;51(6):643–53.
Wang YL. Effect of row spacing and seeding rate on the forage and seed yield of Elymus tagutorum. Chin Qinghai J Anim Vet Sci. 2015;45(3):7–9.
Loveless MD, Hamrick JL. Ecological determinants of genetic structure in plant populations. Annu Rev Ecol Syst. 1984;15(1):65–95.
Hamrick JL, Godt MJW. Effects of life history traits on genetic diversity in plant species. Phil. Trans. Royal Soc. of London, Ser. B. 1996;351(1345):1291–8.
Reed BM, Engelmann F, Dulloo ME, Engels JMM. Technical guidelines for the management of field and in vitro germplasm collections. In: IPGRI Handbooks for Genebanks No. 7. Rome: Bioversity International; 2004. p. 95–6.
Apelbaumelkaher I. Natural selection causes microscale allozyme diversity in wild barley and a lichen at 'Evolution Canyon', Mt. Carmel, Israel. Heredity. 1997;78(4):373–82.
This study was funded by the Key Laboratory of Superior Forage Germplasm in the Qinghai-Tibetan plateau (2017-ZJ-Y12) and the earmarked fund for Modern Agro-industry Technology Research System (No.CARS-34). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Availability of data and materials
The datasets generated during the current study are available from the corresponding author on reasonable request.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Table S1. Adaptor and primer sequences used for AFLP analysis. (DOCX 17 kb)
Figure S1. Bayesian clustering results for 27 accessions of Elymus tangutorum using the program TESS from K = 2 to K = 5. (TIF 686 kb)
Figure S2. Maps representing the mean membership proportions for K clusters of Elymus tangutorum. (TIF 80 kb)
Table S2. Different genetic diversity estimates for geographical groups and Bayesian subgroups of E. tangutorum based on AFLP results. (DOCX 18 kb)
About this article
Cite this article
Wu, W., Liu, W., Sun, M. et al. Genetic diversity and structure of Elymus tangutorum accessions from western China as unraveled by AFLP markers. Hereditas 156, 8 (2019). https://doi.org/10.1186/s41065-019-0082-z
- Elymus tangutorum
- Genetic diversity
- Geographic groups
- Eco-environmental factors