A chemical space model for the exploration of eco-toxicological data

Read the full article See related articles

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

With over 350000 chemicals and mixtures currently registered for production and use worldwide, up to 90% of authorized chemicals lack adequate toxicity data, leaving the majority of chemicals poorly characterized. International agencies urge scientists to develop screening methods to explore, identify, and predict chemical hazards, supporting the prioritization of chemical risk assessment. Here, Tree Manifold Approximation and Projection (TMAP) were applied, with the aim of reducing the dimensionality of large toxicological dataset, providing the foundations to data imputation methods allowing to get an understanding of chemical modes of action. Specifically, TMAP was implemented using MHFP6 fingerprints and the NORMAN SusDat database, containing over 100000 compounds. Physicochemical properties and CTD toxicogenomic data were retrieved and a graph-based spatial imputation function was generated to obtain insights into the potential ecotoxicity mechanisms of data poor chemicals. The relevance and meaningfulness of TMAP chemical space was explored for Daphnia magna , Pimephales promelas and Algae . Chemical classes known to be structurally similar were found to be grouped together in the TMAP chemical space, while heterogeneous classes were found to be sparse. Data imputation allowed for the identification of known and potential chemical mechanisms of action. Indeed, acethylcolinesterase and transthyretin were confirmed as major mechanisms of action of organothiophosphate and brominated flame retardant toxicity in Daphnia magna and Pimephales promelas , respectively. Overall, transdisciplinary toxicological databases combined with TMAP, stand out as a powerful, fast and scalable method to explore large datasets, allowing for meaningful and interpretable associations between chemical structures and chemical hazard identification.

Graphical Abstract

Highlights

  • Transdisciplinary toxicological datasets are unique resources for hazard identification.

  • TMAP is a powerful tool for the visualization of very large ecotoxicological datasets.

  • Graph-based imputation helps to identify targets for additional toxicological evaluation.

  • Article activity feed