Zero-Shot, Big-Shot, Active-Shot - How to estimate cell confluence, lazily
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Mesenchymal stem cell therapy shows promising results for difficult-to-treat diseases, but standardized manufacturing requires robust quality control through automated cell confluence monitoring. While deep learning can automate confluence estimation, research on cost-effective dataset curation and the role of foundation models in this task remains limited. We systematically investigate the most effective strategies for confluence estimation, focusing on active learning-based dataset curation, goal-specific labeling, and leveraging foundation models for zero-shot inference. Here, we show that zero-shot inference with the Segment Anything Model (SAM) achieves excellent confluence estimation without any task-specific training, outperforming fine-tuned smaller models. Further, our findings demonstrate that active learning does not significantly improve model dataset curation compared to random selection in homogeneous cell datasets. We show that goal-specific, simplified labeling strategies perform comparably to precise annotations while substantially reducing annotation effort. These results challenge common assumptions about dataset curation: neither active learning nor extensive fine-tuning provided significant benefits for our specific use case. Instead, we found that leveraging SAM’s zero-shot capabilities and targeted labeling strategies offers the most cost-effective approach to automated confluence estimation. Our work provides practical guidelines for implementing automated cell monitoring in MSC manufacturing, demonstrating that extensive dataset curation may be unnecessary when foundation models can effectively handle the task out of the box.