Leveraging Social Media for Public Health: NLP Implementations for Blood Donation Data Analysis in Japan

Roberto Espinoza
Kazumasa Kishimoto
Chang Liu
Luciano H.O. Santos
Yukiko Mori
Goshiro Yamamoto
Tomohiro Kuroda

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Background:: Blood donation is crucial for healthcare systems, yet maintaining an adequate supply is a persistent challenge. Traditional methods to understand public sentiment and donor behavior are often limited. Social media, particularly Twitter, offers a promising alternative for real-time insights. This study explores the viability of using Twitter data to analyze blood donation sentiment in Japan, considering the evolving perspectives of younger generations. Methods:: We replicated previous study results using the Tohoku BERT model and tested a refined Blood Donation Tweets for User Classification (BDT-UC) dataset and another customized version of the model for better classification. We also compared various topic modeling methods, including Latent Dirichlet Allocation (LDA), Non-Negative Matrix Factorization (NMF), and BERT-based models, using two different preprocessing techniques. Finally, we integrated the classification into the Topic Modeling analysis for a final evaluation. Results:: Our findings indicate that although the refined dataset has an overall lower classification performance, it improved the implementation results, ensuring more balanced labeling across the data. Our refined model had a small reduction in overall precision (from 78.4% in the best evaluated model to 75.8% in the refined model). However, we improved the implementation results, ensuring more balanced labeling across the data. For topic modeling, BERT-based topic models, particularly those preprocessed with the MeCab library, achieved higher coherence and diversity scores than traditional methods. Additionally, there were significant differences when the dataset was processed by user category, with increased coherence and diversity for the undetermined one but notably lower coherence values for the other categories. Conclusion:: This study underscores the significance of initial classification and preprocessing for effective topic modeling, which impacts the viability of extracting insights from Japanese social media data. The developed methodologies could support more effective analysis of blood donation groups, and better targeted donation campaigns.

Version published to 10.21203/rs.3.rs-5000403/v1 on Research Square
Sep 30, 2024

A Large Language Model-based Approach for Analyzing Covariates of Health Equity in Registered Research Projects

This article has 2 authors:
1. Navapat Nananukul
2. Mayank Kejriwal
This article has no evaluationsLatest version Sep 26, 2024
Multilingual User Perceptions Analysis from Twitter using Zero Shot Learning for Border Control Technologies

This article has 4 authors:
1. Sarang Shaikh
2. Sule Yildirim Yayilgan
3. Erjon Zoto
4. Mohamed Abomhara
This article has no evaluationsLatest version Oct 24, 2024
Impact of Telemedicine through Social Media: A Study of Topics in User Comments on Twitter

This article has 3 authors:
1. Mario Sierra Martín
2. Fang-Wei Chen
3. Pilar Alarcón Urbistondo
This article has no evaluationsLatest version Oct 14, 2024

Listed in

Abstract

Article activity feed

Related articles

A Large Language Model-based Approach for Analyzing Covariates of Health Equity in Registered Research Projects

Multilingual User Perceptions Analysis from Twitter using Zero Shot Learning for Border Control Technologies

Impact of Telemedicine through Social Media: A Study of Topics in User Comments on Twitter