Beyond age gates: a brief behavioral validity screen as a risk-tiering layer for social AI companions

Pawel Szczesny

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Current safety protocols for social AI companions rely on binary age thresholds (e.g., "18+") or self-reported declarations of maturity. However, chronological age poorly predicts psychosocial capacity to navigate high-intimacy AI systems, and self-report is easily gamed by users seeking unrestricted access. Adolescence is characterized by significant heterogeneity in social cognition and regulatory capacity; a 16-year-old and an 18-year-old may possess comparable capabilities regarding impulse control, rejection tolerance, and boundary reasoning. This paper argues that high-intimacy AI affordances require a safety layer beyond age verification: a brief, validity-informed behavioral screening battery. This screening tiers access to high-risk features (e.g., erotic roleplay, exclusivity cues) based on vulnerability-relevant metrics, such as inhibitory control or mentalization. Performance validity testing (PVT) informed checks help detect invalid responding/gaming, while task performance provides probabilistic vulnerability signals. An operational framework for risk-tiering is presented, alongside evaluation metrics, privacy safeguards to prevent discriminatory deployment and example implementation as a Chrome browser extension.

Version published to 10.31235/osf.io/gq2ah_v1 on OSF Preprints
Feb 26, 2026

You Must Not Fool Yourself: Feynman, Neurodiversity, and Honest AI in Digital Mental Health

This article has 1 author:
1. David Ruttenberg
This article has no evaluationsLatest version Feb 28, 2026
Loneliness biases social engagement decisions: Evidence from value-based choice models

This article has 2 authors:
1. Błażej Misiak
2. Wiktor Grzegorzewicz
This article has no evaluationsLatest version Apr 3, 2026
Acceptance Is Not Enough: Toward a Psychology of Calibrated GenAI use

This article has 5 authors:
1. Nicolas Bassan
2. Louise Blart
3. Charles Ayoubi
4. Sandra Najem
5. Philippe Roussiere
This article has no evaluationsLatest version Mar 15, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

You Must Not Fool Yourself: Feynman, Neurodiversity, and Honest AI in Digital Mental Health

Loneliness biases social engagement decisions: Evidence from value-based choice models

Acceptance Is Not Enough: Toward a Psychology of Calibrated GenAI use