charisma: An R package to perform reproducible color characterization of digital images for biological studies
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Advances in digital imaging and software tools have provided increasingly accessible datasets and methods for analyzing color evolution. Despite the variety of computational packages available, most rely on color classification before running analyses. Previous methods limit the ability to analyze large-scale image databases, decreases the accuracy of downstream analyses, and is not always representative of biologically relevant color classes. Here, we present charisma, an R package designed to characterize the distribution of distinct color classes in images suitable for large-scale studies of biological organisms. Here, we demonstrate the utility of our package through an analysis of color evolution in a sample of diverse and charismatic birds, tanagers, in the subfamily Thraupinae. We show that charisma can quickly and accurately classify every pixel in an image and validate these results using pre-identified, canonical color swatches. We find that charisma color classifications are consistent with those made by color-pattern experts in the field. Applying charisma to tanager color evolution, we find that charisma outputs seamlessly integrate with downstream evolutionary analyses. Our results demonstrate that using charisma to manually curate and characterize colors in images provides a standardized, reliable, and reproducible framework for high-throughput color classification.