Cognitive Software Architectures for Multimodal Perception and Human-AI Interaction

Jun Cui

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

This paper proposes a novel cognitive software architecture that enhances multimodal perception capabilities and human-AI interaction by integrating deep learning techniques with hierarchical processing frameworks. The architecture employs a multi-stage perception pipeline that processes visual, auditory, and tactile inputs through specialized neural networks before fusing them into a unified representation. Experimental results demonstrate that our approach achieves 27% higher accuracy in multimodal scene understanding compared to state-of-the-art unimodal systems and improves human-AI collaborative task completion rates by 34%. The architecture's modular design facilitates knowledge transfer across modalities while maintaining interpretability—a critical feature for building trustworthy AI systems. Our findings suggest that cognitive architectures with hierarchical multimodal integration can significantly enhance AI systems' ability to perceive, reason, and interact in complex real-world environments with humans.

Version published to 10.21203/rs.3.rs-6643531/v1 on Research Square
May 13, 2025

Cognitive Software Architectures for Multimodal Perception and Human-AI Interactio

This article has 1 author:
1. Jun Cui
This article has no evaluationsLatest version May 13, 2025
Comprehensive Neural Representations of Naturalistic Stimuli through Multimodal Deep Learning

This article has 5 authors:
1. Mingxue Fu
2. Guoqiu Chen
3. Yijie Zhang
4. Mingzhe Zhang
5. Yin Wang
This article has no evaluationsLatest version Apr 19, 2025
An autonomous AI agent for universal behavior analysis

This article has 13 authors:
1. Almir Aljović
2. Zuwan Lin
3. Wenbo Wang
4. Xinhe Zhang
5. Arnau Marin-Llobet
6. Ningyue Liang
7. Bradley Canales
8. Jaeyong Lee
9. Jongmin Baek
10. Ren Liu
11. Catherine Li
12. Na Li
13. Jia Liu
This article has no evaluationsLatest version May 20, 2025

Listed in

Abstract

Article activity feed

Related articles

Cognitive Software Architectures for Multimodal Perception and Human-AI Interactio

Comprehensive Neural Representations of Naturalistic Stimuli through Multimodal Deep Learning

An autonomous AI agent for universal behavior analysis