Real-time detection of spoken speech from unlabeled ECoG signals: A pilot study with an ALS participant

Miguel Angrick
Shiyu Luo
Qinwan Rabbani
Shreya Joshi
Daniel N. Candrea
Griffin W. Milsap
Chad R. Gordon
Kathryn Rosenblatt
Lora Clawson
Nicholas Maragakis
Francesco V. Tenore
Matthew S. Fifer
Nick F. Ramsey
Nathan E. Crone

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Objective . Brain-Computer Interfaces (BCIs) hold significant promise for restoring communication in individuals with partial or complete loss of the ability to speak due to paralysis from amyotrophic lateral sclerosis (ALS), brainstem stroke, and other neurological disorders. Many of the approaches to speech decoding reported in the BCI literature have required time-aligned target representations to allow successful training – a major challenge when translating such approaches to people who have already lost their voice. Approach . In this pilot study, we made a first step toward scenarios in which no ground truth is available. We utilized a graph-based clustering approach to identify temporal segments of speech production from electrocorticographic (ECoG) signals alone. We then used the estimated speech segments to train a voice activity detection (VAD) model using only ECoG signals. We evaluated our approach using held-out open-loop recordings of a single dysarthric clinical trial participant living with ALS, and we compared the resulting performance to previous solutions trained with ground truth acoustic voice recordings. Main results . Our approach achieves a median error rate of around 0.5 seconds with respect to the actual spoken speech. Embedded into a real-time BCI, our approach is capable of providing VAD results with a latency of only 10 ms. Significance . To the best of our knowledge, our results show for the first time that speech activity can be predicted purely from unlabeled ECoG signals, a crucial step toward individuals who cannot provide this information anymore due to their neurological condition, such as patients with locked-in syndrome. Clinical Trial Information . ClinicalTrials.gov, registration number NCT03567213 .

Version published to 10.1101/2024.09.18.24313755 on medRxiv
Sep 22, 2024

A Novel Machine Learning Model for Non-Invasive EEG-Based Inner-Speech Translation in ALS

This article has 1 author:
1. Alex Steiner
This article has no evaluationsLatest version Jan 2, 2026
Remote Optical Decoding of Inner Speech in Broca’s Area via AI-based Speckle Pattern Analysis

This article has 7 authors:
1. Natalya Segal
2. Moshe Bar
3. Daniel Rubinstein
4. Sergey Agdarov
5. Yafim Beiderman
6. Yevgeny Beiderman
7. Zeev Zalevsky
This article has no evaluationsLatest version Jan 30, 2026
Pilot Study of Voice Biomarkers: Exploring Healthy Controls in a Non-Clinical Setting

This article has 7 authors:
1. Tara Chatty
2. Shreshtha Das
3. Corinthian Ewesuedo
4. Ezimma Onwuka
5. Waleed Shirwa
6. Paul C. Bryson
7. Colin K. Drummond
This article has no evaluationsLatest version Dec 14, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A Novel Machine Learning Model for Non-Invasive EEG-Based Inner-Speech Translation in ALS

Remote Optical Decoding of Inner Speech in Broca’s Area via AI-based Speckle Pattern Analysis

Pilot Study of Voice Biomarkers: Exploring Healthy Controls in a Non-Clinical Setting