Performance and limitations of smartwatch estimation of VO2max during free-living activity in non-athletes
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Background. Maximal oxygen consumption (VO 2 max) is the gold standard for assessing cardiorespiratory fitness (CRF), and it is a strong predictor of clinical outcomes, but it is measured during cardiopulmonary exercise testing (CPET), which is expensive and time-consuming. Smartwatches provide a great opportunity to assess CRF at scale. Although smartwatches have been previously validated in young and physically active individuals, their accuracy in sedentary individuals remains undetermined and this limits the use smartwatch VO 2 max in large clinical or epidemiological studies. This study aims to determine the accuracy of smartwatch VO 2 max in a large group of individuals with varying underlying CRF. Methods. N=203 adults (160 (79%) female; median (interquartile range; IQR) 55 (32,61) years old) from the CONVALESCENCE case-control study of long COVID, performed a maximal CPET on a semi-recumbent ergometer and wore a consumer-grade smartwatch for three months. Smartwatch-estimated VO 2 max was compared to reference VO 2 max measured by CPET and predicted VO 2 max calculated using standard non-exercise equations. Agreement was assessed using Bland-Altman analysis (Bias [Limits of Agreement; LoA]), error (ΔVO 2 max), absolute error (|ΔVO 2 max|), and Pearson’s correlation coefficient (r). Results. CPET-measured VO 2 max was 22.2 (17.5, 27.3) ml/kg/min. Smartwatch-estimated VO 2 max correlated moderately with CPET-measured VO 2 max (r=0.65) and strongly with VO 2 max predicted using non-exercise equations (r=0.92). Bland-Altman plots showed a large positive bias and wide LoA (14.5 [2.2, 26.8 ml/kg/min]). Inaccuracy of smartwatch-estimated VO 2 max (ΔVO 2 max) linearly increased with increasing mismatch between expected and measured CRF (correlation with %pVO 2 max r=-0.81). The absolute error was larger in those with lower-than-expected CRF (%pVO 2 max<100%) and in those engaging in light intensity activities, while no differences were observed when comparing participants with or without long COVID or symptoms of fatigue. Multiple linear regression analysis of the absolute error identified the mismatch between expected and measured CRF (%pVO 2 max) as the main contributor to smartwatch inaccuracy. Conclusions. In non-athletes, smartwatch-derived VO 2 max correlated with reference values but did not provide additional accuracy to VO 2 max prediction based on sex, age, weight and height. Inaccuracy significantly increased in those with lower-than-expected CRF. These findings highlight current limitations in the use of smartwatch VO 2 max for clinical applications and large epidemiological studies. Trial registration number: Z6364106/2021/06/04.