Uncensored AI in the Wild: Tracking Publicly Available and Locally Deployable LLMs

Bahrad A. Sokhansanj

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Open-weight generative large language models (LLMs) can be freely downloaded and modified, yet little empirical evidence exists on how these models are systematically altered and redistributed. This study presents the first large-scale analysis of safety-modified open-weight LLMs, examining 8,608 model repositories scraped from Hugging Face to identify a growing population of uncensored models adapted to bypass alignment safeguards. Selected modified models are evaluated across unsafe prompts spanning election disinformation, criminal instruction, and regulatory evasion. The study demonstrates that modified models exhibit a complete safety inversion: while unmodified models complied with only 18.8% of unsafe requests, modified variants complied at a mean rate of 74.1%. Modification effectiveness was independent of model size, with smaller 14-billion parameter variants sometimes matching or exceeding the compliance levels of 70-billion parameter versions. The ecosystem is highly concentrated yet structurally decentralized; for example, the top 5% of providers account for over 60% of downloads, and the top 20 for nearly 86%. Moreover, more than half of the identified models use GGUF packaging, optimized for consumer hardware, and 4-bit quantization methods proliferate widely, though full- and 16-bit models remain the most downloaded. These findings demonstrate how locally deployable, modified LLMs represent a paradigm shift for Internet safety governance, calling for new regulatory approaches suited to decentralized AI.

Version published to 10.20944/preprints202509.1334.v1
Sep 16, 2025

Probing Hidden States for Calibrated, Alignment-Resistant Predictions in LLMs

This article has 10 authors:
1. Jacob Berkowitz
2. Sophia Kivelson
3. Apoorva Srinivasan
4. Undina Gisladottir
5. Kevin K. Tsang
6. Jose Miguel Acitores Cortina
7. Aditi Kuchi
8. Jake Patock
9. Ryan Czarny
10. Nicholas P. Tatonetti
This article has no evaluationsLatest version Sep 19, 2025
Toward Responsible AI in High-Stakes Domains: A Dataset for Building Static Analysis with LLMs in Structural Engineering

This article has 4 authors:
1. Carlos Fabian Avila Vega
2. Daniel Jefferson lbay Yupa
3. Paola Vanessa Tapia Tapia
4. Edgar David Rivera Tapia
This article has no evaluationsLatest version Sep 2, 2025
Fine-grained Insider Threat Detection with Large Language Models: A Comparative Study

This article has 4 authors:
1. Parvin Ahmadi Doval Amiri
2. Alexis Brissard
3. Frédéric Cuppens
4. Amal Zouaq
This article has no evaluationsLatest version Sep 23, 2025

Listed in

Abstract

Article activity feed

Related articles

Probing Hidden States for Calibrated, Alignment-Resistant Predictions in LLMs

Toward Responsible AI in High-Stakes Domains: A Dataset for Building Static Analysis with LLMs in Structural Engineering

Fine-grained Insider Threat Detection with Large Language Models: A Comparative Study