The Illusion of Intelligence: Evaluating Large Language Models Against Grounded Criteria of Artificial General Intelligence

Rashid Mehmood Dr. Rashid
Eid Rehman

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

As large language models (LLMs) become central to AI applications, their perceived intelligence often masks critical limitations. While LLMs demonstrate fluent language use and problem-solving, they fundamentally lack context-awareness, self-reflection, and the ability to act under constraints. This paper identifies a core issue: current LLMs produce seemingly intelligent outputs without possessing the internal mechanisms that constitute true intelligence. They fail to recognize or address their own limitations—such as hallucinations, inefficiency, and lack of common sense—and have not autonomously developed tools to enhance their performance. To address this gap, we propose a novel three-step benchmark for Artificial General Intelligence (AGI): Audit, Generate, Implement (AGI). This framework evaluates whether an AI system can autonomously assess its own failures, generate alternative strategies, and implement optimal solutions—all within fixed resource constraints. This approach reflects the way humans solve problems efficiently and adaptively, beyond mere pattern recognition. Our findings show that scaling models alone is insufficient for AGI. We emphasize that genuine intelligence requires meta-cognition, resource management, and tool creation—traits absent in current LLMs. This work offers a new direction and evaluative standard for future AI research, emphasizing cognitive depth over superficial linguistic performance.

Version published to 10.20944/preprints202505.2253.v1
May 28, 2025

Human Shadows in Machine Minds: Interpreting AI Responses to Rorschach Test

This article has 2 authors:
1. Katalin Csigó
2. György Cserey
This article has no evaluationsLatest version May 21, 2025
Sense-Making Reconsidered: Large Language Models and the Blind Spot of Embodied Cognition

This article has 1 author:
1. Tom Froese
This article has no evaluationsLatest version May 8, 2025
BNAI, NO-TOKEN, and MIND-UNITY: Pillars of a Systemic Revolution in Artificial Intelligence

This article has 2 authors:
1. Francesco Bulla
2. Stephanie Ewelu
This article has no evaluationsLatest version May 19, 2025

Listed in

Abstract

Article activity feed

Related articles

Human Shadows in Machine Minds: Interpreting AI Responses to Rorschach Test

Sense-Making Reconsidered: Large Language Models and the Blind Spot of Embodied Cognition

BNAI, NO-TOKEN, and MIND-UNITY: Pillars of a Systemic Revolution in Artificial Intelligence