Efficiency in Language Understanding and Generation: An Evaluation of Four Open-Source Large Language Models

Siu Ming Wong
Ho Leung
Ka Yan Wong

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

This study provides a comprehensive evaluation of the efficiency of Large Language Models (LLMs) in performing diverse language understanding and generation tasks. Through a systematic comparison of open-source models including GPT-Neo, Bloom, FLAN-T5, and Mistral-7B, the research explores their performance across widely recognized benchmarks such as GLUE, SuperGLUE, LAMBADA, and SQuAD. Our findings reveal significant variations in model accuracy, computational efficiency, scalability, and adaptability, underscoring the influence of model architecture and training paradigms on performance outcomes. The study identifies key factors contributing to the models' efficiency and offers insights into potential optimization strategies for enhancing their applicability in real-world NLP applications. By highlighting the strengths and limitations of current LLMs, this research contributes to the ongoing development of more effective, efficient, and adaptable language models, paving the way for future advancements in the field of natural language processing.

Version published to 10.21203/rs.3.rs-4063228/v1 on Research Square
Mar 11, 2024

Evaluating the Effectiveness of Large Language Models in Abstract Screening: A Comparative Analysis

This article has 3 authors:
1. Michael Li
2. Jianping Sun
3. Xianming Tan
This article has no evaluationsLatest version Mar 27, 2024
SuperGLUE: The AI race

This article has 2 authors:
1. Silvano Herculano da Luz Júnior
2. Yúri Faro Dantas de Sant’Anna
This article has no evaluationsLatest version Apr 4, 2024
The Geometry of Language: Analyzing Input Complexity and Transformation Matrices in Large Language Models

This article has 2 authors:
1. Rudransh Agnihotri
2. Ankit Pandey
This article has no evaluationsLatest version Apr 16, 2024

Listed in

Abstract

Article activity feed

Related articles

Evaluating the Effectiveness of Large Language Models in Abstract Screening: A Comparative Analysis

SuperGLUE: The AI race

The Geometry of Language: Analyzing Input Complexity and Transformation Matrices in Large Language Models