Utilizing LLMs and ML Algorithms in Disaster-Related Social Media Content

Vasileios Linardos
Maria Drakaki
Panagiotis Tzionas

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

In this research, we explore the use of Large Language Models (LLMs) and clustering techniques to automate the structuring and labeling of disaster-related social media con-tent. With a gathered dataset comprising millions of tweets related to various disasters, our approach aims to transform unstructured and unlabeled data into a structured and labeled format that can be readily used for training machine learning algorithms and en-hancing disaster response efforts. We leverage LLMs to preprocess and understand the semantic content of the tweets, applying several semantic properties to the data, followed by the application of clustering techniques to identify emerging themes and patterns that may not be captured by predefined categories and are surfaced through topic extraction of the clusters. We proceed with manual labeling and evaluation of 10,000 examples to evaluate the LLMs' ability to understand tweet features. Our methodology is applied to re-al-world data for disaster events, with results directly applicable to actual crisis situations.

Version published to 10.20944/preprints202505.1798.v1
May 23, 2025

Exploration of Large Language Models forGeotagging of Social Media Posts

This article has 2 authors:
1. Riwaz Udas
2. Richard Sinnott
This article has no evaluationsLatest version Feb 3, 2026
A Hybrid Ensemble Framework for Interpretable Topic and Sentiment Analysis of Social Media Content

This article has 2 authors:
1. Dhimesh Parmar
2. Paresh Tanna
This article has no evaluationsLatest version Jan 30, 2026
Text as Data for Crisis-Early Warning: A Comparative Assessment of NLP Methods for Conflict Prediction

This article has 1 author:
1. Julian Walterskirchen
This article has no evaluationsLatest version Dec 23, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Exploration of Large Language Models forGeotagging of Social Media Posts

A Hybrid Ensemble Framework for Interpretable Topic and Sentiment Analysis of Social Media Content

Text as Data for Crisis-Early Warning: A Comparative Assessment of NLP Methods for Conflict Prediction