Optimizing Image Data Management: A Workflow-Driven Approach for FAIR and Reusable High-Content Screening pipelines with OMERO
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
High-content screening (HCS) bioimaging is a powerful approach to studying biological processes, enabling the acquisition of large amounts of images from biological samples. However, it generates massive amounts of metadata, making HCS experiments a unique data management challenge. This data includes images, reagents, protocols, analytic outputs, and phenotypes, all of which must be stored, linked, and made accessible to users, scientists, collaborators, and the broader community to ensure sharable results. This study showcases different approaches using Workflow Management Systems (WMS) to create reusable semi-automatic workflows for HCS bioimaging data management, leveraging the image data management platform OMERO. The three developed workflows demonstrate the transition from a local file-based storage system to an automated and agile image data management framework. These workflows facilitate the management of large amounts of data, reduce the risk of human error, and improve the efficiency and effectiveness of image data management. We illustrate how applying WMS to HCS data management enables us to consistently transfer images across different locations in a structured and reproducible manner, reducing the risk of errors and increasing data consistency and reproducibility. Furthermore, we suggest future research direction, including developing new workflows and integrating machine learning algorithms for automated image analysis. This study provides a blueprint for developing efficient and effective image data management systems for HCS experiments and demonstrates how different WMS approaches can be applied to create reusable, semi-automated workflows for HCS bioimaging data management using OMERO.