Nejvíce citovaný článek - PubMed ID 25637298
To safeguard bread wheat against pests and diseases, breeders have introduced over 200 resistance genes into its genome, thus nearly doubling the number of designated resistance genes in the wheat gene pool1. Isolating these genes facilitates their fast-tracking in breeding programs and incorporation into polygene stacks for more durable resistance. We cloned the stem rust resistance gene Sr43, which was crossed into bread wheat from the wild grass Thinopyrum elongatum2,3. Sr43 encodes an active protein kinase fused to two domains of unknown function. The gene, which is unique to the Triticeae, appears to have arisen through a gene fusion event 6.7 to 11.6 million years ago. Transgenic expression of Sr43 in wheat conferred high levels of resistance to a wide range of isolates of the pathogen causing stem rust, highlighting the potential value of Sr43 in resistance breeding and engineering.
The wild relatives and progenitors of wheat have been widely used as sources of disease resistance (R) genes. Molecular identification and characterization of these R genes facilitates their manipulation and tracking in breeding programmes. Here, we develop a reference-quality genome assembly of the wild diploid wheat relative Aegilops sharonensis and use positional mapping, mutagenesis, RNA-Seq and transgenesis to identify the stem rust resistance gene Sr62, which has also been transferred to common wheat. This gene encodes a tandem kinase, homologues of which exist across multiple taxa in the plant kingdom. Stable Sr62 transgenic wheat lines show high levels of resistance against diverse isolates of the stem rust pathogen, highlighting the utility of Sr62 for deployment as part of a polygenic stack to maximize the durability of stem rust resistance.
Sequence assembly of large and repeat-rich plant genomes has been challenging, requiring substantial computational resources and often several complementary sequence assembly and genome mapping approaches. The recent development of fast and accurate long-read sequencing by circular consensus sequencing (CCS) on the PacBio platform may greatly increase the scope of plant pan-genome projects. Here, we compare current long-read sequencing platforms regarding their ability to rapidly generate contiguous sequence assemblies in pan-genome studies of barley (Hordeum vulgare). Most long-read assemblies are clearly superior to the current barley reference sequence based on short-reads. Assemblies derived from accurate long reads excel in most metrics, but the CCS approach was the most cost-effective strategy for assembling tens of barley genomes. A downsampling analysis indicated that 20-fold CCS coverage can yield very good sequence assemblies, while even five-fold CCS data may capture the complete sequence of most genes. We present an updated reference genome assembly for barley with near-complete representation of the repeat-rich intergenic space. Long-read assembly can underpin the construction of accurate and complete sequences of multiple genomes of a species to build pan-genome infrastructures in Triticeae crops and their wild relatives.
- MeSH
- anotace sekvence MeSH
- genom rostlinný MeSH
- genomika metody MeSH
- intergenová DNA MeSH
- ječmen (rod) genetika MeSH
- koncové repetice MeSH
- retroelementy MeSH
- sekvenční analýza DNA MeSH
- výpočetní biologie metody MeSH
- vysoce účinné nukleotidové sekvenování metody MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- intergenová DNA MeSH
- retroelementy MeSH
Wheat is one of the most important staple crops worldwide and also an excellent model species for crop evolution and polyploidization studies. The breakthrough of sequencing the bread wheat genome and progenitor genomes lays the foundation to decipher the complexity of wheat origin and evolutionary process as well as the genetic consequences of polyploidization. In this study, we sequenced 3286 BACs from chromosome 7DL of bread wheat cv. Chinese Spring and integrated the unmapped contigs from IWGSC v1 and available PacBio sequences to close gaps present in the 7DL assembly. In total, 8043 out of 12 825 gaps, representing 3 491 264 bp, were closed. We then used the improved assembly of 7DL to perform comparative genomic analysis of bread wheat (Ta7DL) and its D donor, Aegilops tauschii (At7DL), to identify domestication signatures. Results showed a strong syntenic relationship between Ta7DL and At7DL, although some small rearrangements were detected at the distal regions. A total of 53 genes appear to be lost genes during wheat polyploidization, with 23% (12 genes) as RGA (disease resistance gene analogue). Furthermore, 86 positively selected genes (PSGs) were identified, considered to be domestication-related candidates. Finally, overlapping of QTLs obtained from GWAS analysis and PSGs indicated that TraesCS7D02G321000 may be one of the domestication genes involved in grain morphology. This study provides comparative information on the sequence, structure and organization between bread wheat and Ae. tauschii from the perspective of the 7DL chromosome, which contribute to better understanding of the evolution of wheat, and supports wheat crop improvement.
- Klíčová slova
- 7DL chromosome arm, BAC by BAC, domestication, gene loss, physical mapping, wheat,
- MeSH
- Aegilops genetika MeSH
- biologická evoluce * MeSH
- chromozomy rostlin genetika MeSH
- genom rostlinný * MeSH
- lokus kvantitativního znaku MeSH
- pšenice genetika MeSH
- srovnávací genomová hybridizace MeSH
- syntenie MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
BACKGROUND: Numerous scaffold-level sequences for wheat are now being released and, in this context, we report on a strategy for improving the overall assembly to a level comparable to that of the human genome. RESULTS: Using chromosome 7A of wheat as a model, sequence-finished megabase-scale sections of this chromosome were established by combining a new independent assembly using a bacterial artificial chromosome (BAC)-based physical map, BAC pool paired-end sequencing, chromosome-arm-specific mate-pair sequencing and Bionano optical mapping with the International Wheat Genome Sequencing Consortium RefSeq v1.0 sequence and its underlying raw data. The combined assembly results in 18 super-scaffolds across the chromosome. The value of finished genome regions is demonstrated for two approximately 2.5 Mb regions associated with yield and the grain quality phenotype of fructan carbohydrate grain levels. In addition, the 50 Mb centromere region analysis incorporates cytological data highlighting the importance of non-sequence data in the assembly of this complex genome region. CONCLUSIONS: Sufficient genome sequence information is shown to now be available for the wheat community to produce sequence-finished releases of each chromosome of the reference genome. The high-level completion identified that an array of seven fructosyl transferase genes underpins grain quality and that yield attributes are affected by five F-box-only-protein-ubiquitin ligase domain and four root-specific lipid transfer domain genes. The completed sequence also includes the centromere.
- Klíčová slova
- Megabase-scale integration, Optical/physical maps Grain quality, Wheat sequence finishing, Yield,
- MeSH
- centromera metabolismus MeSH
- chromozomy rostlin genetika MeSH
- fruktany analýza MeSH
- fyzikální mapování chromozomů metody MeSH
- genom rostlinný * MeSH
- optické jevy * MeSH
- pšenice genetika MeSH
- semena rostlinná genetika MeSH
- umělé bakteriální chromozomy genetika MeSH
- zemědělství * MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Research Support, U.S. Gov't, Non-P.H.S. MeSH
- Názvy látek
- fruktany MeSH
BACKGROUND: The IWGSC strategy for construction of the reference sequence of the bread wheat genome is based on first obtaining physical maps of the individual chromosomes. Our aim is to develop and use the physical map for analysis of the organization of the short arm of wheat chromosome 5B (5BS) which bears a number of agronomically important genes, including genes conferring resistance to fungal diseases. RESULTS: A physical map of the 5BS arm (290 Mbp) was constructed using restriction fingerprinting and LTC software for contig assembly of 43,776 BAC clones. The resulting physical map covered ~ 99% of the 5BS chromosome arm (111 scaffolds, N50 = 3.078 Mb). SSR, ISBP and zipper markers were employed for anchoring the BAC clones, and from these 722 novel markers were developed based on previously obtained data from partial sequencing of 5BS. The markers were mapped using a set of Chinese Spring (CS) deletion lines, and F2 and RICL populations from a cross of CS and CS-5B dicoccoides. Three approaches have been used for anchoring BAC contigs on the 5BS chromosome, including clone-by-clone screening of BACs, GenomeZipper analysis, and comparison of BAC-fingerprints with in silico fingerprinting of 5B pseudomolecules of T. dicoccoides. These approaches allowed us to reach a high level of BAC contig anchoring: 96% of 5BS BAC contigs were located on 5BS. An interesting pattern was revealed in the distribution of contigs along the chromosome. Short contigs (200-999 kb) containing markers for the regions interrupted by tandem repeats, were mainly localized to the 5BS subtelomeric block; whereas the distribution of larger 1000-3500 kb contigs along the chromosome better correlated with the distribution of the regions syntenic to rice, Brachypodium, and sorghum, as detected by the Zipper approach. CONCLUSION: The high fingerprinting quality, LTC software and large number of BAC clones selected by the informative markers in screening of the 43,776 clones allowed us to significantly increase the BAC scaffold length when compared with the published physical maps for other wheat chromosomes. The genetic and bioinformatics resources developed in this study provide new possibilities for exploring chromosome organization and for breeding applications.
- Klíčová slova
- Chromosome 5BS, Genetic markers, Hexaploid wheat, Physical mapping, Sequencing, Synteny, Triticum aestivum,
- MeSH
- chléb * MeSH
- chromozomy rostlin genetika MeSH
- fyzikální mapování chromozomů * MeSH
- polymerázová řetězová reakce MeSH
- pšenice genetika MeSH
- umělé bakteriální chromozomy genetika MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
Barley (Hordeum vulgare L.) is a cereal grass mainly used as animal fodder and raw material for the malting industry. The map-based reference genome sequence of barley cv. 'Morex' was constructed by the International Barley Genome Sequencing Consortium (IBSC) using hierarchical shotgun sequencing. Here, we report the experimental and computational procedures to (i) sequence and assemble more than 80,000 bacterial artificial chromosome (BAC) clones along the minimum tiling path of a genome-wide physical map, (ii) find and validate overlaps between adjacent BACs, (iii) construct 4,265 non-redundant sequence scaffolds representing clusters of overlapping BACs, and (iv) order and orient these BAC clusters along the seven barley chromosomes using positional information provided by dense genetic maps, an optical map and chromosome conformation capture sequencing (Hi-C). Integrative access to these sequence and mapping resources is provided by the barley genome explorer (BARLEX).
The capacity of the bread wheat (Triticum aestivum) genome to tolerate introgression from related genomes can be exploited for wheat improvement. A resistance to powdery mildew expressed by a derivative of the cross-bread wheat cv. Tähti × T. militinae (Tm) is known to be due to the incorporation of a Tm segment into the long arm of chromosome 4A. Here, a newly developed in silico method termed rearrangement identification and characterization (RICh) has been applied to characterize the introgression. A virtual gene order, assembled using the GenomeZipper approach, was obtained for the native copy of chromosome 4A; it incorporated 570 4A DArTseq markers to produce a zipper comprising 2132 loci. A comparison between the native and introgressed forms of the 4AL chromosome arm showed that the introgressed region is located at the distal part of the arm. The Tm segment, derived from chromosome 7G, harbours 131 homoeologs of the 357 genes present on the corresponding region of Chinese Spring 4AL. The estimated number of Tm genes transferred along with the disease resistance gene was 169. Characterizing the introgression's position, gene content and internal gene order should not only facilitate gene isolation, but may also be informative with respect to chromatin structure and behaviour studies.
- Klíčová slova
- GenomeZipper, alien introgression, chromosome rearrangement, chromosome translocation, comparative analysis, linkage drag,
- MeSH
- Ascomycota patogenita MeSH
- chléb MeSH
- chromozomy rostlin genetika metabolismus MeSH
- DNA rostlinná genetika MeSH
- genetické markery MeSH
- mapování chromozomů MeSH
- mikrosatelitní repetice MeSH
- nemoci rostlin genetika mikrobiologie MeSH
- odolnost vůči nemocem MeSH
- počítačová simulace MeSH
- pšenice genetika mikrobiologie MeSH
- rostlinné geny MeSH
- sekvence nukleotidů MeSH
- translokace genetická MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- DNA rostlinná MeSH
- genetické markery MeSH
The hexaploid wheat genotype Chinese Spring (CS) has been used worldwide as the reference base for wheat genetics and genomics, and significant resources have been used by the international community to generate a reference wheat genome based on this genotype. By sequencing flow-sorted 3B chromosome from a hexaploid wheat genotype CRNIL1A and comparing the obtained sequences with those available for CS, we detected that a large number of sequences in the former were missing in the latter. If the distribution of such sequences in the hexaploid wheat genome is random, CRNILA sequences missing in CS could be as much as 159.3 Mb even if only fragments of 50 bp or longer were considered. Analysing RNA sequences available in the public domains also revealed that dispensable genes are common in hexaploid wheat. Together with those extensive intra- and interchromosomal rearrangements in CS, the existence of such dispensable genes is another factor highlighting potential issues with the use of reference genomes in various studies. Strong deviation in distributions of these dispensable sequences among genotypes with different geographical origins provided the first evidence indicating that they could be associated with adaptation in wheat.
The assembly of a reference genome sequence of bread wheat is challenging due to its specific features such as the genome size of 17 Gbp, polyploid nature and prevalence of repetitive sequences. BAC-by-BAC sequencing based on chromosomal physical maps, adopted by the International Wheat Genome Sequencing Consortium as the key strategy, reduces problems caused by the genome complexity and polyploidy, but the repeat content still hampers the sequence assembly. Availability of a high-resolution genomic map to guide sequence scaffolding and validate physical map and sequence assemblies would be highly beneficial to obtaining an accurate and complete genome sequence. Here, we chose the short arm of chromosome 7D (7DS) as a model to demonstrate for the first time that it is possible to couple chromosome flow sorting with genome mapping in nanochannel arrays and create a de novo genome map of a wheat chromosome. We constructed a high-resolution chromosome map composed of 371 contigs with an N50 of 1.3 Mb. Long DNA molecules achieved by our approach facilitated chromosome-scale analysis of repetitive sequences and revealed a ~800-kb array of tandem repeats intractable to current DNA sequencing technologies. Anchoring 7DS sequence assemblies obtained by clone-by-clone sequencing to the 7DS genome map provided a valuable tool to improve the BAC-contig physical map and validate sequence assembly on a chromosome-arm scale. Our results indicate that creating genome maps for the whole wheat genome in a chromosome-by-chromosome manner is feasible and that they will be an affordable tool to support the production of improved pseudomolecules.
- Klíčová slova
- chromosomes, flow sorting, optical mapping, physical map, sequencing, wheat,
- MeSH
- biotechnologie metody MeSH
- chromozomy rostlin genetika MeSH
- genom rostlinný * MeSH
- mapování chromozomů metody MeSH
- pšenice genetika MeSH
- sekvenční analýza DNA metody MeSH
- tandemové repetitivní sekvence MeSH
- umělé bakteriální chromozomy MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH