Diversity-driven biochemical survey reveals dimeric structural origin of rubisco
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Rubisco is the entry point of nearly all organic carbon into the biosphere and is present in all domains of life. Despite its global importance, biochemical studies of this enzyme superfamily have been limited to a relatively narrow set of subclades. Recent advances in metagenomics have dramatically reshaped our understanding of both microbial and rubisco diversity; however, biochemical characterization of these sequences has not kept pace with the exponential growth in sequence data. To better survey the functional and structural diversity of rubisco, we systematically sampled and synthesized a library of diverse rubisco sequences with an emphasis on clades that have previously not been characterized. Our updated phylogenetic analysis reveals that many deep‑branching rubiscos assemble as dimers, supporting a dimeric origin for the superfamily -- in contrast to the ecologically dominant hexadecameric form I. Additionally, we discover and structurally characterize the largest rubisco described to date, originating from a cryptic, early-branching subclade with novel structural folds that have previously not been observed in the rubisco superfamily. By integrating biochemical data with an updated phylogenetic framework, we propose a revised nomenclature for the rubisco protein family that reflects current insights and will better accommodate future discoveries.