Nejvíce citovaný článek - PubMed ID 25711827
New insights into the wheat chromosome 4D structure and virtual gene order, revealed by survey pyrosequencing
Wheat is one of the most important staple crops worldwide and also an excellent model species for crop evolution and polyploidization studies. The breakthrough of sequencing the bread wheat genome and progenitor genomes lays the foundation to decipher the complexity of wheat origin and evolutionary process as well as the genetic consequences of polyploidization. In this study, we sequenced 3286 BACs from chromosome 7DL of bread wheat cv. Chinese Spring and integrated the unmapped contigs from IWGSC v1 and available PacBio sequences to close gaps present in the 7DL assembly. In total, 8043 out of 12 825 gaps, representing 3 491 264 bp, were closed. We then used the improved assembly of 7DL to perform comparative genomic analysis of bread wheat (Ta7DL) and its D donor, Aegilops tauschii (At7DL), to identify domestication signatures. Results showed a strong syntenic relationship between Ta7DL and At7DL, although some small rearrangements were detected at the distal regions. A total of 53 genes appear to be lost genes during wheat polyploidization, with 23% (12 genes) as RGA (disease resistance gene analogue). Furthermore, 86 positively selected genes (PSGs) were identified, considered to be domestication-related candidates. Finally, overlapping of QTLs obtained from GWAS analysis and PSGs indicated that TraesCS7D02G321000 may be one of the domestication genes involved in grain morphology. This study provides comparative information on the sequence, structure and organization between bread wheat and Ae. tauschii from the perspective of the 7DL chromosome, which contribute to better understanding of the evolution of wheat, and supports wheat crop improvement.
- Klíčová slova
- 7DL chromosome arm, BAC by BAC, domestication, gene loss, physical mapping, wheat,
- MeSH
- Aegilops genetika MeSH
- biologická evoluce * MeSH
- chromozomy rostlin genetika MeSH
- genom rostlinný * MeSH
- lokus kvantitativního znaku MeSH
- pšenice genetika MeSH
- srovnávací genomová hybridizace MeSH
- syntenie MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
BACKGROUND: Haynaldia villosa (H. villosa) has been recognized as a species potentially useful for wheat improvement. The availability of its genomic sequences will boost its research and application. RESULTS: In this work, the short arm of H. villosa chromosome 4V (4VS) was sorted by flow cytometry and sequenced using Illumina platform. About 170.6 Mb assembled sequences were obtained. Further analysis showed that repetitive elements accounted for about 64.6% of 4VS, while the coding fraction, which is corresponding to 1977 annotated genes, represented 1.5% of the arm. The syntenic regions of the 4VS were searched and identified on wheat group 4 chromosomes 4AL, 4BS, 4DS, Brachypodium chromosomes 1 and 4, rice chromosomes 3 and 11, and sorghum chromosomes 1, 5 and 8. Based on genome-zipper analysis, a virtual gene order comprising 735 gene loci on 4VS genome was built by referring to the Brachypodium genome, which was relatively consistent with the scaffold order determined for Ae. tauschii chromosome 4D. The homologous alleles of several cloned genes on wheat group 4 chromosomes including Rht-1 gene were identified. CONCLUSIONS: The sequences provided valuable information for mapping and positional-cloning genes located on 4VS, such as the wheat yellow mosaic virus resistance gene Wss1. The work on 4VS provided detailed insights into the genome of H. villosa, and may also serve as a model for sequencing the remaining parts of H. villosa genome.
- Klíčová slova
- Chromosome arm 4VS, Flow sorting, Genome zipper, Haynaldia villosa, Scaffold,
- MeSH
- chromozomy rostlin genetika MeSH
- druhová specificita MeSH
- genomika MeSH
- lipnicovité genetika MeSH
- mapování chromozomů MeSH
- pořadí genů genetika MeSH
- repetitivní sekvence nukleových kyselin genetika MeSH
- sekvenční analýza DNA * MeSH
- Publikační typ
- časopisecké články MeSH
BACKGROUND: The number and complexity of repetitive elements varies between species, being in general most represented in those with larger genomes. Combining the flow-sorted chromosome arms approach to genome analysis with second generation DNA sequencing technologies provides a unique opportunity to study the repetitive portion of each chromosome, enabling comparisons among them. Additionally, different sequencing approaches may produce different depth of insight to repeatome content and structure. In this work we analyze and characterize the repetitive sequences of Triticum aestivum cv. Chinese Spring homeologous group 4 chromosome arms, obtained through Roche 454 and Illumina sequencing technologies, hereinafter marked by subscripts 454 and I, respectively. Repetitive sequences were identified with the RepeatMasker software using the interspersed repeat database mips-REdat_v9.0p. The input sequences consisted of our 4DS454 and 4DL454 scaffolds and 4ASI, 4ALI, 4BSI, 4BLI, 4DSI and 4DLI contigs, downloaded from the International Wheat Genome Sequencing Consortium (IWGSC). RESULTS: Repetitive sequences content varied from 55% to 63% for all chromosome arm assemblies except for 4DLI, in which the repeat content was 38%. Transposable elements, small RNA, satellites, simple repeats and low complexity sequences were analyzed. SSR frequency was found one per 24 to 27 kb for all chromosome assemblies except 4DLI, where it was three times higher. Dinucleotides and trinucleotides were the most abundant SSR repeat units. (GA)n/(TC)n was the most abundant SSR except for 4DLI where the most frequently identified SSR was (CCG/CGG)n. Retrotransposons followed by DNA transposons were the most highly represented sequence repeats, mainly composed of CACTA/En-Spm and Gypsy superfamilies, respectively. This whole chromosome sequence analysis allowed identification of three new LTR retrotransposon families belonging to the Copia superfamily, one belonging to the Gypsy superfamily and two TRIM retrotransposon families. Their physical distribution in wheat genome was analyzed by fluorescent in situ hybridization (FISH) and one of them, the Carmen retrotransposon, was found specific for centromeric regions of all wheat chromosomes. CONCLUSION: The presented work is the first deep report of wheat repetitive sequences analyzed at the chromosome arm level, revealing the first insight into the repeatome of T. aestivum chromosomes of homeologous group 4.
- MeSH
- chromozomy rostlin genetika MeSH
- DNA rostlinná analýza MeSH
- fyzikální mapování chromozomů MeSH
- pšenice genetika MeSH
- repetitivní sekvence nukleových kyselin * MeSH
- sekvenční analýza DNA metody MeSH
- Publikační typ
- časopisecké články MeSH
- práce podpořená grantem MeSH
- Názvy látek
- DNA rostlinná MeSH