GROQ-seq Datasets Across Transcription Factors (LacI, RamR, VanR), T7 RNA Polymerase and TEV Protease

Aviv Spinner
Shwetha Sreenivasan
James R. McLellan
Svetlana Ikonomova
Dana Cortade
Simon d’Oelsnitz
Kristen Sheldon
Olga Vasilyeva
Nina Y Alperovich
Anjali Chadha
Lily Nematollahi
Andi Dhroso
Zach Sisson
Corey M. Hudson
Erika DeBenedictis
Peter J. Kelly
Amanda Reider Apel
David Ross
Catherine Baranowski

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Predicting any protein’s function from its sequence alone would be a significant breakthrough in molecular biology. Although machine learning approaches have sought to tackle this, their limited generalizability reflects the absence of sufficiently large, open, diverse, and unified datasets. To address this data gap, we developed a high-throughput experimental platform called GROQ-seq ( Gro wth-based Q uantitative Seq uencing). In GROQ-seq, a protein’s function can be linked to a sequencing-based readout that enables scalable characterization of large variant libraries in Escherichia coli . Here, we present pilot datasets demonstrating its performance across three distinct protein function classes: transcription factors, polymerases, and proteases. The objective of this report is to present the datasets and to provide users with a clear and transparent characterization of their properties, including both the strengths and limitations.

Version published to 10.64898/2026.04.15.718744 on bioRxiv
Apr 18, 2026

Functional Profiling of Thousands of Sequence-Diverse Protease Homologs with GROQ-seq

This article has 9 authors:
1. James R. McLellan
2. Svetlana Ikonomova
3. Shwetha Sreenivasan
4. Alan N. Amin
5. Catherine Baranowski
6. Amanda Reider Apel
7. Peter Kelly
8. David Ross
9. Aviv Spinner
This article has no evaluationsLatest version May 5, 2026
GROQ-seq Enables Cross-site Reproducibility for High-Throughput Measurement of Protein Function

This article has 13 authors:
1. Aviv Spinner
2. David Ross
3. Dana Cortade
4. Svetlana Ikonomova
5. Catherine Baranowski
6. Andi Dhroso
7. Amanda Reider Apel
8. Kristen Sheldon
9. Courtney Duquette
10. Douglas Densmore
11. Peter J Kelly
12. Erika DeBenedictis
13. Corey M. Hudson
This article has no evaluationsLatest version Apr 9, 2026
Systematic Validation of AlphaFold-Predicted Interactomes with LUCIA

This article has 6 authors:
1. Tianyu Zhang
2. Julian Kraft
3. Timothy K. Soh
4. Malte Kansy
5. Ingvar Jonsson
6. Jens B. Bosse
This article has no evaluationsLatest version Apr 4, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Functional Profiling of Thousands of Sequence-Diverse Protease Homologs with GROQ-seq

GROQ-seq Enables Cross-site Reproducibility for High-Throughput Measurement of Protein Function

Systematic Validation of AlphaFold-Predicted Interactomes with LUCIA