“Let’s Argue Both Sides”: Argument Generation Can Force Small Models to Utilize Previously Inaccessible Reasoning Capabilities

Kaveh Eskandari Miandoab
Vasanth Sarathy

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Large Language Models (LLMs), despite achieving state-of-the-art results in a number of evaluation tasks, struggle to maintain their performance when logical reasoning is strictly required to correctly infer a prediction. In this work, we propose _Argument Generation_ as a method of forcing models to utilize their reasoning capabilities when other approaches such as chain-of-thought reasoning prove insufficient. Our method involves the generation of arguments for each possible inference result, and asking the end model to rank the generated arguments. We show that _Argument Generation_ can serve as an appropriate substitute for zero-shot prompting techniques without the requirement to add layers of complexity. Furthermore, we argue that knowledge-probing techniques such as chain-of-thought reasoning and _Argument Generation_ are only useful when further reasoning is required to infer a prediction, making them auxiliary to more common zero-shot approaches. Finally, we demonstrate that our approach forces larger gains in smaller language models, showcasing a complex relationship between model size and prompting methods in foundation models.

Version published to 10.32388/8un60r
Oct 27, 2024

Diversity of Thought Elicits Stronger Reasoning Capabilities in Multi-Agent Debate Frameworks

This article has 1 author:
1. Mahmood Hegazy
This article has no evaluationsLatest version Oct 28, 2024
Beyond Text Generation: Assessing Large Language Models' Ability to Follow Rules and Reason Logically

This article has 5 authors:
1. Zhiyong Han
2. Fortunato Battaglia
3. Kush Mansuria
4. Yoav Heyman
5. Stanley R. Terlecky
This article has no evaluationsLatest version Oct 9, 2024
The Context Window Fallacy in Large Language Models

This article has 1 author:
1. Inês Hipólito
This article has no evaluationsLatest version Sep 27, 2024

Listed in

Abstract

Article activity feed

Related articles

Diversity of Thought Elicits Stronger Reasoning Capabilities in Multi-Agent Debate Frameworks

Beyond Text Generation: Assessing Large Language Models' Ability to Follow Rules and Reason Logically

The Context Window Fallacy in Large Language Models