Comparative Analysis of Prompt Strategies for Large Language Models: Single-Task vs. Multitask Prompts

Manuel Gozzi
Federico Di Maio

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

This study investigates the effectiveness of prompt engineering strategies for Large Language Models (LLMs), comparing single-task and multitasking prompts. Specifically, we analyze whether a single prompt handling multiple tasks—such as named entity recognition (NER), sentiment analysis, and JSON output formatting—can achieve performance comparable to dedicated single-task prompts. To substantiate our findings, we employ statistical analyses, including paired Wilcoxon tests, McNemar tests, and Friedman tests, to validate claims of performance similarity or superiority. Experiments were conducted using five open-weight LLMs: LLama3.1 8B, Qwen2 7B, Mistral 7B, Phi3 Medium, and Gemma2 9B. The results indicate that there is no definitive rule favoring single-task prompts over multitask prompts; rather, their relative performance is highly contingent on the specific model’s data and architecture. This study highlights the nuanced interplay between prompt strategies and LLM characteristics, offering insights into optimizing their use for specific NLP tasks. Limitations and future directions, such as expanding task types, are also discussed.

Version published to 10.3390/electronics13234712
Nov 28, 2024
Version published to 10.20944/preprints202410.1334.v1
Oct 17, 2024

MultiLLM – Self Reflect Iterative Prompt Methodology based Automated Essay Scoring System

This article has 2 authors:
1. R. Johnsi
2. G. Bharadwaja Kumar
This article has no evaluationsLatest version Jun 18, 2025
Prompt Engineering for Structured Data A Comparative Evaluation of Styles and LLM Performance

This article has 3 authors:
1. Jules White
2. Ashraf Elnashar
3. Douglas Schmidt
This article has no evaluationsLatest version Jun 24, 2025
Human Researchers are Superior to Large Language Models in Writing a Systematic Review in a Comparative Multitask Assessment

This article has 8 authors:
1. Martina Sollini
2. Cristiano Pini
3. Alexandra Lazar
4. Fabrizia Gelardi
5. Gaia Ninatti
6. Matteo Bauckneht
7. Arturo Chiti
8. Margarita Kirienko
This article has no evaluationsLatest version Jun 13, 2025

Listed in

Abstract

Article activity feed

Related articles

MultiLLM – Self Reflect Iterative Prompt Methodology based Automated Essay Scoring System

Prompt Engineering for Structured Data A Comparative Evaluation of Styles and LLM Performance

Human Researchers are Superior to Large Language Models in Writing a Systematic Review in a Comparative Multitask Assessment