Climate Research Domain BERTs: Pretraining, Adaptation, and Evaluation

Andrija Poleksić
Sanda Martinčić-Ipšić

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Motivated by the pressing issue of climate change and the growing volume of data, we pretrain three new language models using climate change research papers published in top-tier journals. Adaptation of existing domain-specific models is utilized for CliSciBERT and SciClimateBERT and pretraining from scratch resulted in CliReBERT (Climate Research BERT). The performance assessment is performed on the climate change NLP benchmark ClimaBench. We evaluate SciBERT, ClimateBERT, BERT, RoBERTa and DistilRoBERTa - along with our new models - CliReBERT, CliSciBERT and SciClimateBERT - using five different random seeds on all seven ClimaBench datasets. CliReBERT achieves the highest overall performance with a macro-averaged F1 score of 65.45%, and outperforms all other models on three of the seven tasks. Additionally, CliReBERT demonstrates the most stable fine-tuning behavior, yielding the lowest average standard deviation across seeds (0.0118). The 5-fold stratified cross-validation on the SciDCC dataset showed that CliReBERT achieved the highest overall macro-average F1 score (53.75%), slightly outperforming RoBERTa and DistilRoBERTa, while the domain-adapted models underperformed their base counterparts. The superior performance of CliReBERT is accompanied by the lowest tokenizer fertility, suggesting appropriateness to model domain-specific vocabulary.

Version published to 10.21203/rs.3.rs-6644722/v1 on Research Square
May 19, 2025

Can large language models effectively reason about adverse weather conditions?

This article has 2 authors:
1. Nima Zafarmomen
2. Vidya Samadi
This article has no evaluationsLatest version Apr 26, 2025
Towards HydroLLM: A Benchmark Dataset for Hydrology-Specific Knowledge Assessment for Large Language Models

This article has 4 authors:
1. Dilara Kizilkaya
2. Ramteja Sajja
3. Yusuf Sermet
4. Ibrahim Demir
This article has no evaluationsLatest version Jun 5, 2025
Model Predictive Task Sampling for Efficient and Robust Adaptation

This article has 7 authors:
1. Qi (Cheems) Wang
2. Zehao Xiao
3. Yixiu Mao
4. Yun Qu
5. Jiayi Shen
6. Yiqin Lv
7. Xiangyang Ji
This article has no evaluationsLatest version May 20, 2025

Listed in

Abstract

Article activity feed

Related articles

Can large language models effectively reason about adverse weather conditions?

Towards HydroLLM: A Benchmark Dataset for Hydrology-Specific Knowledge Assessment for Large Language Models

Model Predictive Task Sampling for Efficient and Robust Adaptation