Functional diversity across families of bacterial metalloregulators: what can we learn about specificity from sequence similarity?

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

The exponential growth of sequence databases, driven by large-scale genome sequencing, has created a major challenge for the functional annotation of proteins, particularly within highly divergent families such as bacterial metal-responsive transcription factors. In these systems, low sequence identity places many proteins within the so-called “twilight zone” of the proteome, where homology inference and functional assignment become unreliable. Here, we integrate sequence similarity networks (SSNs) with structural approaches to explore the functional diversity of twelve metalloregulatory families. Using SSNs, we partition each family into putative isofunctional clusters and map available experimental annotations onto network topology, revealing substantial heterogeneity in both sequence diversity and functional characterization across families. While some families, such as ArsR and CsoR, display relatively well-defined functional landscapes, others, including LysR, TetR, and GntR, remain largely unexplored despite their large sequence space. Structural comparisons further show that, despite extensive sequence divergence, conserved architectural features underpin DNA recognition and regulatory mechanisms across families. Focusing on the MerR and Fur families, we identify conserved residues associated with inducer binding and DNA recognition, and uncover distinct functional subgroups, including metal-specific sensors and regulators with alternative signaling mechanisms. Finally, we demonstrate that cluster-derived HMM profiles enable sensitive detection of candidate regulators in non-model genomes, revealing lineage-specific expansions and diversification of metal-sensing repertoires. Together, our results provide a framework for mapping functional diversity in highly divergent protein families and highlight the potential of combining SSNs and structural information to guide the discovery of novel transcriptional sensors.

Article activity feed