AXIS: A Lab-in-the-Loop Machine Learning approach for generalized detection of macromolecular crystals

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Macromolecular crystallography provides mechanistic understanding of biological processes and can be applied in drug design. Nowadays, the use of robotic systems for crystal growth and diffraction analysis is widespread and high throughput protein-to-structure pipelines for ligand and fragment screening are revolutionizing the field. However, the identification of crystals is still largely carried out through manual inspection, sometimes involving tens of thousands of images, which represents a bottleneck in an otherwise highly automated process. Here we describe AXIS , an A I-based C rystal I dentification S ystem combining the DINOv2 computer vision model, state-of-the-art transfer learning and MARCO, the largest crystallization dataset available to date, for automated crystal detection. AXIS can operate both with visible and UV light images and integrates a Lab-In-The-Loop approach combining ML and expert inputs for continuous learning and specialization. AXIS enables automated annotation of large crystallization image datasets with performance and accuracy comparable to that of human experts and the Lab-In-The-Loop approach introduced here enables efficient adaptation to local conditions facilitating widespread application, which has been a major limitation to date. AXIS can help correct human errors in image annotation and removes critical bottlenecks, particularly in the context of extensive crystallization screens or high throughput applications like fragment and ligand screening unlocking the potential for higher levels of automation that are key both in fundamental and translational research.

Appendix A.

Graphical Abstract

Article activity feed