Quality-Aware Evaluation for Journal Recommendation

Nir Roguin
Omri Dominsky
Yariv Yogev
Yoav Baruch

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Objective: Journal recommendation systems are tools that analyze a manuscript’s content and suggest potential target journals. Most tools that suggest journals are evaluated using simple accuracy measures, such as Top-K accuracy metrics: the recommendation is considered correct only if the exact journal is predicted. All other recommendations are counted as errors. However, this approach ignores journal quality. Recommending a different journal of similar quality is very different from recommending a journal that is far below the appropriate quality level. We developed a Quality-Aware evaluation tool designed to assess whether journal recommendation tools suggest journals of an appropriate quality level, rather than simply whether they predict the exact target journal. Methods: We developed specialty-specific journal recommendation models for medical fields using transformer-based architectures trained on nearly one million PubMed articles (2020–2024; 2020–2025 for Cardiology). Using SCImago quartiles as a proxy for journal quality, we evaluated models using both standard metrics and novel Quality-Aware metrics (Quality Accuracy, Undersell Rate, Severe Undersell Rate). We examined the relationship between these metric families to characterize when recommendation lists can be considered reliable. Results: Across five specialties, mean accuracy@1 was 47.9% (range: 39.9%–55.0%), with Mean Reciprocal Rank (MRR) and Normalized Discounted Cumulative Gain (NDCG) of 0.60 and 0.68, respectively. However, Quality Accuracy@1 averaged 67.7%, exceeding raw accuracy by approximately 20 percentage points. This gap indicates that more than a third of all prediction “errors” under standard metrics involved journals of equivalent quality. Furthermore, Quality Consistency@3 averaged 54.0%, demonstrating that, on average, the majority of journals in the top-3 recommendations consistently aligned with the target journal’s quality tier. Severe Undersell Rates (recommendations two or more quartiles below the ground truth) averaged 7.9%, with the best-performing models achieving rates as low as 5.2%. Conclusion: Standard evaluation metrics for journal recommendation are insufficient because they treat all errors as equivalent. In this study, Quality Accuracy@1 substantially exceeded top@1 accuracy, indicating that a large proportion of apparent “errors” involved journals of equivalent quality. Quality-Aware metrics, combined with ranking quality measures, provide a more complete assessment of whether a system produces reliable recommendations. We propose that journal recommendation systems report Quality-Aware metrics alongside traditional accuracy to better characterize real-world utility. MSC Codes: 68T10 (Pattern recognition); 62P10 (Applications of statistics to biology and medical sciences) JEL Codes: O32 (Management of Technological Innovation and R&D)

Version published to 10.21203/rs.3.rs-9053132/v1 on Research Square
Mar 31, 2026

Strategic Journal Tier Selection for Literature Reviews: A Bibliometric Analysis Using Topic Modeling and Diversity Metrics

This article has 3 authors:
1. Narges Mashhadi Nejad
2. Marcelo Alvarado-Vargas
3. Mehrdad Jalali Sepehr
This article has no evaluationsLatest version Feb 25, 2026
Strategic determinants of journal selection among Chinese biomedical researchers

This article has 11 authors:
1. Shenjuan Li
2. Caixia Xia
3. Hongjie Shi
4. Kaiyu Qian
5. Fangjin Chen
6. Xinyue Cao
7. Yi Zhang
8. Lingao Ju
9. Gang Wang
10. Yu Xiao
11. Zilin Xu
This article has no evaluationsLatest version Mar 11, 2026
Academic Impact vs. Societal Attention: A Dual-Analysis of Top-Cited Artificial Intelligence Articles in Medicine

This article has 2 authors:
1. Furkan Şakiroğlu
2. Cemil Çolak
This article has no evaluationsLatest version Mar 13, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Strategic Journal Tier Selection for Literature Reviews: A Bibliometric Analysis Using Topic Modeling and Diversity Metrics

Strategic determinants of journal selection among Chinese biomedical researchers

Academic Impact vs. Societal Attention: A Dual-Analysis of Top-Cited Artificial Intelligence Articles in Medicine