Ensuring Transparency and Trust in Supervised Machine Learning Studies: A Checklist for Psychological Researchers
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Machine learning (ML) algorithms are being rapidly incorporated into the work of psychologists, given their capability and flexibility in analyzing large-scale, complex, or otherwise messy datasets. In this context, and in the spirit of open science, ML research should be conducted in a transparent, understandable, and ethical manner. However, publications by psychology researchers and practitioners show a troubling lack of consistency in reporting ML information. Given that ML offers a wide range of analytical options, this article addresses an important need by providing a comprehensive, open-science checklist that specifies the information researchers should disclose at each stage of a supervised ML project—from data collection and preprocessing to model selection, evaluation, interpretation, and code sharing. We hope that psychological researchers will benefit from this checklist when reporting ML results and will adapt and extend this checklist further in the future.