Gen2: Building a Reviewer-Defensible Benchmark for Binding Hypothesis Triage in Cryptic Pocket Discovery

shakeel Hoosdally

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Benchmarking in cryptic-pocket and allosteric discovery is often weakened by forcing heterogeneous case studies into pooled scoring despite ambiguous labels, unstable site assignment, missing row-level outputs, or mismatched evidential standards. Here, we present Gen2 as a governance-first benchmark framework for binding hypothesis triage in cryptic pocket discovery. Rather than treating benchmark assembly as a secondary administrative step, Gen2 treats it as part of the scientific method: each candidate slice is screened against frozen evidential rules, assigned a bounded role, and either admitted, parked, excluded, or retained as calibration or falsification material before pooled evaluation is considered. Applying this framework to the current panel produced a preserved no-active-slice-open checkpoint. Under these rules, HIF-2α remained policy-closed, TP53 Y220C remained calibration-only, CK2 was retained as falsification material, and KRAS G12D and PTP1B remained non-row-ready for different reasons. The principal result is therefore not pooled benchmark performance, but demonstration that Gen2 prevents invalid pooled claims by blocking premature scoring and preserving only reviewer-defensible evaluable units. This establishes a reproducible benchmark-construction layer for future multi-slice evaluation once row-ready systems and explicit row mappings are available.

Version published to 10.21203/rs.3.rs-9336311/v1 on Research Square
Apr 9, 2026

Gen2-Allostery: A Replicate-Aware Framework for Evaluating Allosteric Binding-Site Hypotheses

This article has 1 author:
1. shakeel Hoosdally
This article has no evaluationsLatest version Apr 15, 2026
Three Classes of Confound in Gene-Regulatory-Network Inference: A Systematic Audit and Open-Source Diagnostic Toolkit

This article has 1 author:
1. Ihor Kendiukhov
This article has no evaluationsLatest version Mar 26, 2026
Preserve, don't prune: why strategic dormancy is an architectural necessity for clinical AI governance

This article has 1 author:
1. Florian Odi Stummer
This article has no evaluationsLatest version Apr 7, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Gen2-Allostery: A Replicate-Aware Framework for Evaluating Allosteric Binding-Site Hypotheses

Three Classes of Confound in Gene-Regulatory-Network Inference: A Systematic Audit and Open-Source Diagnostic Toolkit

Preserve, don't prune: why strategic dormancy is an architectural necessity for clinical AI governance