Building an Analytical Framework for Tobacco-Related Misinformation on Social Media: An Exploratory Analysis with Generative AI Assistance

Read the full article See related articles

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Background

The propagation of tobacco-related misinformation significantly impacts public health, particularly affecting people with less access to reliable information sources (such as those with lower education), who may also su affer disproportionate tobacco-related morbidity and mortality. This study analyzed a dataset from Twitter to identify the characteristics of tobacco-related misinformation, with the goal of creating a framework for its identification, categorization, and validation.

Methods

A collection of 3.4 million tweets related to tobacco and nicotine was refined to 842,754 after removing irrelevant and duplicate posts. LDA topic modeling identified six unique topics, from which two randomly selected samples of tweets were drawn to perform qualitative analysis and AI-assisted analysis to identify categories of tobacco misinformation.

Results

The identified tobacco-related misinformation was categorized by three dimensions (1) content, including safety and health effects, cessation, substance, and policy; (2) type of falsehood, which included fabrication and unsubstantiated claims, misrepresentations, and distortions; and (3) source, ranging from individuals and retail stores to advocacy groups and influencers.

A notable finding was the prevalence of policy-related discussions of tobacco misinformation on Twitter (X), highlighting this often-overlooked domain. The controversy over vaping has amplified pro-vaping voices on social media, with content frequently misinterpreting scientific findings, policies, and expert opinions, reflecting more nuanced and difficult to recognize falsehood in the misleading content.

Conclusion

This study offers a comprehensive framework for analyzing tobacco-related misinformation on social media, emphasizing key issues in policy debates and the presence of conspiracy narratives. This framework can inform the design of interventions for less informed populations and enhance data annotation for machine learning tasks.

Article activity feed