Leveraging Data Mining to Extract Accidental Drug Overdose Death Patterns: 2012-2014 US Dataset as Case Study

Noor UL Amin

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

This study examined data on drug-related accidental deaths in the United States as found in the Accidental_Drug_Related_Deaths.csv dataset to understand patterns, trends, and risk factors and gain an understanding of the potential applicability of secondary data in public health planning. The dataset contains a total of 11,981 records with 48 single-value fields that include demographic information, location of event, and substances involved. The study undertook extensive data preprocessing on the dataset that included replacing missing values, standardizing elements of the dataset, reducing the data for analysis while maintaining the ability to examine the original structure, and transforming or restructuring fields for a meaningful analysis of the data. The study applied various data mining techniques such as association rule mining, classification, clustering, and outlier detection to draw insights from the dataset. The study identified high-risk demographic groups and combinations of drugs most often found in overdose situations, spatial hotspots for overdoses, and a few outliers. The study included several visualizations and interpretations of the data, and assessed ethical considerations of privacy, data exploitation or misappropriation, and biases. The study found data mining an effective data analysis strategy to help public health, policy development and emergency management organizations anticipate and/or mitigate drug overdose incidence and severity.

Version published to 10.20944/preprints202509.0260.v1
Sep 3, 2025

Data Mining of Public Databases to Identify TCM Syndrome Patterns in Gout: A Retrospective Study

This article has 7 authors:
1. Guihua Yue
2. Chengxiang Guo
3. Dongming Zhang
4. Tong Mo
5. Yihan Liu
6. Xiaohua Yang
7. Yuran Feng
This article has no evaluationsLatest version Feb 2, 2026
Machine Learning Analysis of COVID19 Transmission Dynamics Demographic Risk and Contact Tracing Outcomes in Nigeria

This article has 7 authors:
1. Bolanle Adefowoke Ojokoh
2. Oluwafemi A. Sarumi
3. Sadura Priscilla Akinrinwa
4. Abimbola H. Afolayan
5. Tobore V. Igbe
6. Abiola Ezekiel Taiwo
7. Uchechukwu M. Chukwuocha
This article has no evaluationsLatest version Dec 12, 2025
Comparing Algorithm Effectiveness in Health Data Analysis

This article has 1 author:
1. Abdulmalik Hazaa Alshammari
This article has no evaluationsLatest version Jan 22, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Data Mining of Public Databases to Identify TCM Syndrome Patterns in Gout: A Retrospective Study

Machine Learning Analysis of COVID19 Transmission Dynamics Demographic Risk and Contact Tracing Outcomes in Nigeria

Comparing Algorithm Effectiveness in Health Data Analysis