The PanOryza pangene catalog of Asian cultivated rice

Genome Research

2025-12 | Journal article

Abstract

The rice genome underpins fundamental research and breeding, but the Nipponbare (japonica) reference does not fully encompass the genetic diversity of Asian rice. To address this gap, the Rice Population Reference Panel (RPRP) was developed, comprising high-quality assemblies of 16 rice cultivars to represent the japonicaindicaaus, and aromatic varietal groups. The RPRP has been consistently annotated and supported by extensive experimental data, and here, we report the computational assignment, characterization, and dissemination of stably identified pangenes, collectively called the PanOryza data set. We identify 25,178 core pangenes shared across all cultivars, alongside cultivar-specific and family-enriched genes. Core genes exhibit higher gene expression and proteomic evidence, higher confidence protein domains, and AlphaFold structures, whereas cultivar-specific genes are enriched for domains under selective breeding pressure, such as for disease resistance. We identify more than 5000 genes absent in the IRGSP rice reference genome and present in at least two other Oryzacultivars. We demonstrate the utility of this resource through various examples of pangenes and their protein domains. This resource, integrated into public databases, enables researchers to explore genetic and functional diversity via a population-aware “reference guide” across rice genomes, advancing both basic and applied research.

Cite this article

Contreras-Moreira B#, Sharma E#, Saraf S#, Naamati G, Gupta P, Elser J, Chebotarov D, Chougule K, Lu Z, Wei S, Olson A, Tsang I, Lodha D, Zhou Y, Yu Z, Zhao W, Zhang J, Amberkar S, Sue-Ob K, Sun Z, Martin M, McNally K, Ware D, Deutsch E, Copetti D, Wing R, Jaiswal P, Dyer S, and Jones A*. The PanOryza pangene catalog of Asian cultivated rice. Genome Research, 2025, in press. DOI: 10.1101/gr.280790.125