Decoding Rice Seed Storage Proteins: From Gene Identification to Structural Prediction
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Rice seed storage proteins (SSPs) serve as the primary source of dietary protein, energy, and nutrition; however, our understanding of these proteins remains limited, hindering efforts aimed at enhancing grain nutritional quality. To address this, we conducted a comprehensive genome-wide analysis of rice SSPs, encompassing phylogenetic relationships, domain and motif characterization, promoter cis -element mapping, expression profiling, and 3D structural modeling. Through homology- and domain-based searches, we identified 65 SSP-encoding genes, including 19 previously uncharacterized members. Phylogenetic analysis and domain comparisons revealed evolutionary proximity between albumins and prolamins; and between globulins and glutelins. Albumins, glutelins and prolamins clustered tandemly implying that duplication has driven SSP expansion. Transcriptomic evidence indicated that albumins, globulins, and glutelins were transcriptionally active from S2 seed development stage, whereas prolamins initiated expression from S3 stage. Promoter analysis revealed several seed-specific cis -regulatory elements, with CAATBOX1, EBOXBNNAPA, and DOFCOREZM being the most prevalent. Structural modeling showed that albumins and prolamins predominantly adopted α-helical conformations, whereas globulins and glutelins were enriched in β-strands and coils. This integrated analysis offers valuable insights into the classification, evolution, regulation, and structure of rice SSPs, providing a foundational resource for advancing grain nutritional improvement strategies.