Scientific Data
2020-04-07 | journal-article
Abstract
As the human population grows from 7.8 billion to 10 billion over the next 30 years, breeders must do everything possible to create crops that are highly productive and nutritious, while simultaneously having less of an environmental footprint. Rice will play a critical role in meeting this demand and thus, knowledge of the full repertoire of genetic diversity that exists in germplasm banks across the globe is required. To meet this demand, we describe the generation, validation and preliminary analyses of transposable element and long-range structural variation content of 12 near-gap-free reference genome sequences (RefSeqs) from representatives of 12 of 15 subpopulations of cultivated Asian rice. When combined with 4 existing RefSeqs, that represent the 3 remaining rice subpopulations and the largest admixed population, this collection of 16 Platinum Standard RefSeqs (PSRefSeq) can be used as a template to map resequencing data to detect virtually all standing natural variation that exists in the pan-genome of cultivated Asian rice.
Cite this article
Zhou Y#, Chebotarov D# (#co-first authors), Kudrna D, Llaca V, Lee S, Rajasekar S, Mohammed N, Al-Bader N, Sobel-Sorenson C, Parakkal P, Arbelaez L, Franco N, Alexandrov N, Hamilton N, Leung H, Mauleon R, Lorieux M, Zuccolo A*, McNally K, Zhang J* and Wing R* (*corresponding authors). A platinum standard pan-genome resource that represents the population structure of Asian rice. Scientific Data, 2020, 7:113. DOI: 10.1038/s41597-020-0438-2