Functional Effect Predictions For Ion Channel Missense Variants Using a Protein Language Model

Abstract

Channelopathies represent a group of diseases often caused by missense variants in ion channels affecting the functioning of tissues like the nervous system, heart, and muscle. The gold standard for functionally characterizing a variant is to measure the electrophysiological changes in channel properties using cell-based heterologous expression systems. As this method is time-consuming and generally unavailable, clinical practice often relies on in silico models to predict the functional consequences of ion channel variants. We constructed a Missense ION (MissION) channel variant classifier based on a protein language model and trained on 3176 gain- or loss-of-function variants, the largest set collected to date, in order to predict the functional effects of variants across a broad range of ion channels. MissION achieves a significant increase in predictive performance (Area Under the Receiver Operating Characteristic Curve (ROC-AUC): 0.925, compared to 0.897 for the current leading model). Moreover, the model generalizes well to ion channel genes for which little or no electrophysiological recordings are available. MissION provides functional predictions for over 600,000 ion channel variants, made available through an online interface at www.synaptica.nl/variant-interpreter that allows variant interpretation for a wide range of channelopathies.

This Zenodo record is a permanently preserved version of a PREreview. You can view the complete PREreview at https://prereview.org/reviews/18842040.

This review is the result of a virtual, collaborative Live Review organized and hosted by the PREreview team as part of an ongoing collaboration with the Rare As One Network on December 3, 2025. The discussion was joined by 4 people: 2 facilitators from the PREreview Team, 1 members of the Chan-Zuckerberg Initiative Rare As One Project team, and 1 Live Review participants from the Rare As One Network. We thank all participants who contributed to the discussion and made it possible for us to provide feedback on this preprint.

This manuscript describes the development and validation of a novel machine learning classifier (MissION) designed to predict whether missense variants in ion channel genes result in gain-of-function (GOF) or loss-of-function (LOF) effects. Recognizing the clinical importance of accurately distinguishing GOF from LOF variants for diagnosis, treatment selection, and eligibility for clinical trials, the authors assembled a large dataset of 3,176 ion channel missense variants derived from published literature and standardized sources. They trained a protein language model–based classifier and evaluated performance through cross-validation, benchmarking against existing tools, and assessment of generalizability. The model demonstrated strong predictive accuracy and outperformed existing approaches, particularly for ion channel genes lacking electrophysiological recordings, thereby enabling functional inference where experimental data are limited. The inclusion of diverse channel types, including potassium channels, and the integration of both molecular and clinical context enhanced predictive power.

Overall, the research approach is sound and aligned with the study's goal of improving variant interpretation to support positive patient outcomes. However, as with many machine learning studies relying on curated literature data, performance depends on the accuracy and consistency of prior experimental annotations, which may not always reflect a perfect ground truth. The authors appropriately acknowledge key limitations, including imperfect ground truth labels, variants with mixed or context-dependent effects, and gene-level variability that may influence model performance. Data appear to be accessible (e.g., through the Synaptica Variant Interpreter website), supporting transparency, although full reproducibility would depend on the availability of detailed implementation materials. The conclusions are well supported by the data presented, and the figures and tables effectively communicate model performance and comparative analyses.

We thank the authors of the preprint for sharing their research openly which allowed us to learn about the work and share our feedback openly.

Competing interests

The authors declare that they have no competing interests.

Use of Artificial Intelligence (AI)

The authors declare that they did not use generative AI to come up with new ideas for their review.

Read the original source

Functional Effect Predictions For Ion Channel Missense Variants Using a Protein Language Model

This article has been Reviewed by the following groups

Discuss this preprint

Listed in

Abstract

Article activity feed

Competing interests

Use of Artificial Intelligence (AI)

Protein Language Models Rescue Variant Pathogenicity Prediction in Intrinsically Disordered Regions Through Synergistic Integration with Structure-Based Methods

Integrative Transcriptomic and Machine Learning Analysis Reveals a Robust Gene Signature in Parkinson’s Disease

Exploring genetic variability in FOXE1 through in silico approaches: new insights on UTR variants

This article has been Reviewed by the following groups

Discuss this preprint

Listed in

Abstract

Article activity feed

Competing interests

Use of Artificial Intelligence (AI)

Related articles

Protein Language Models Rescue Variant Pathogenicity Prediction in Intrinsically Disordered Regions Through Synergistic Integration with Structure-Based Methods

Integrative Transcriptomic and Machine Learning Analysis Reveals a Robust Gene Signature in Parkinson’s Disease

Exploring genetic variability in FOXE1 through in silico approaches: new insights on UTR variants