Dynamic Feature Engineering Through Reinforcement and Prompt Based Learning

Tanmay Karthik

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Feature engineering is an important part of machine learning processes that has a big impact on how well models work, how easy they are to understand, and how effective they are overall. Feature selection and transformation are often done with filter, wrapper, and embedding techniques, but they often require manual heuristics and subject knowledge. They are also ineffective in contexts characterized by high dimensionality and complexity. Recent studies have explored automated techniques utilizing big language models and reinforcement learning to address these limitations. This paper presents a thorough and critically analyzed review of cutting-edge research on reinforcement learning-based feature selection, reinforcement learning-driven feature production, and LLM-guided feature optimization. Three primary paradigms of technique are recognized. Initially, feature selection is conceptualized as a collaborative or directed decision-making challenge employing interactive and multi-agent reinforcement learning methodologies. These strategies assign agents to features and optimize long-term rewards based on domain-specific importance, redundancy, or model precision. Combinatorial Multi-Armed Bandits (CMAB) represent a computationally efficient alternative that facilitates scalable and effective feature selection with minimal learning overhead, being a component of the second paradigm. In the third type, LLMs are employed to either derive effective reward functions or generate novel features. They accomplish this through the utilization of reasoning-based prompts, external knowledge repositories, and prototype alignment. This work also addresses unresolved issues in bias management, computational overhead, and generalization to unfamiliar domains, as well as underexplored gaps, including the necessity for hybrid frameworks that integrate the exploration efficiency of reinforcement learning with the semantic reasoning of large language models.

Version published to 10.20944/preprints202505.2193.v1
May 28, 2025

A Brief Survey of Deep Reinforcement Learning Algorithms for Autonomous Systems

This article has 7 authors:
1. Maxwell Khan
2. Jackson Reynolds
3. Madison Taylor
4. Caleb Walker
5. Savannah Mitchell
6. Ethan Carter
7. Emma Davis
This article has no evaluationsLatest version Jan 22, 2026
Progressive Multi-Turn Reinforcement Learning for Dynamic User-Interactive Tool Agents

This article has 3 authors:
1. Xudong Han
2. Yue Ma
3. Jing Qiao
This article has no evaluationsLatest version Jan 9, 2026
Reinforcement Learning for Real-World Non-Stationary Systems: An Observation-Aware Survey

This article has 1 author:
1. Yugam Padha
This article has no evaluationsLatest version Jan 28, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A Brief Survey of Deep Reinforcement Learning Algorithms for Autonomous Systems

Progressive Multi-Turn Reinforcement Learning for Dynamic User-Interactive Tool Agents

Reinforcement Learning for Real-World Non-Stationary Systems: An Observation-Aware Survey