Comparing Neural Architectures for English-Spanish Machine Translation: From LSTM to Transformer

Jingyuan Zhu
Anbang Chen
Bowen Wang
Sining Huang
Yukun Song
Yixiao Kang

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

This paper presents a systematic comparison of neural architectures for English-to-Spanish machine translation. We implement and evaluate five model configurations ranging from vanilla LSTM encoder-decoders to Transformer models with pretrained embeddings. Using the OPUS-100 corpus (1M training pairs) and FLORES+ benchmark (2,009 test pairs), we evaluate translation quality using BLEU, chrF, and COMET metrics. Our best Transformer model achieves a BLEU score of 20.26, closing approximately 65% of the performance gap between our strongest LSTM baseline (BLEU 10.66) and the state-of-the-art Helsinki-NLP model (BLEU 26.60). We analyze the impact of architectural choices, data scale, and pretrained embeddings on translation quality, providing insights into the trade-offs between model complexity and performance.

Version published to 10.20944/preprints202601.0361.v1
Jan 6, 2026

Neural Machine Translation and Multilingual NLP: A Survey of Methods, Architectures, and Applications

This article has 3 authors:
1. Yao Yuna
2. Junhao Song
3. Jing Qiao
This article has no evaluationsLatest version Jan 6, 2026
Variability in Low-Resource Machine Translation Evaluation: Authentic vs. LLM-Generated Training Corpora

This article has 3 authors:
1. Sofía García González¹
2. German Rigau Claramunt²
3. Jose Ramom Pichel Campos
This article has no evaluationsLatest version Jan 21, 2026
Part-of-Speech Tagging for the Kangri Language Using CRF and BiLSTM Models: A Comprehensive Comparative Study

This article has 1 author:
1. Prateek Kaushal
This article has no evaluationsLatest version Jan 6, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Neural Machine Translation and Multilingual NLP: A Survey of Methods, Architectures, and Applications

Variability in Low-Resource Machine Translation Evaluation: Authentic vs. LLM-Generated Training Corpora

Part-of-Speech Tagging for the Kangri Language Using CRF and BiLSTM Models: A Comprehensive Comparative Study