COVID-19 trajectories among 57 million adults in England: a cohort study using electronic health records

Johan H Thygesen
Christopher Tomlinson
Sam Hollings
Mehrdad A Mizani
Alex Handy
Ashley Akbari
Amitava Banerjee
Jennifer Cooper
Alvina G Lai
Kezhi Li
Bilal A Mateen
Naveed Sattar
Reecha Sofat
Ana Torralbo
Honghan Wu
Angela Wood
Jonathan A C Sterne
Christina Pagel
William N Whiteley
Cathie Sudlow
Harry Hemingway
Spiros Denaxas
Hoda Abbasizanjani
Nida Ahmed
Badar Ahmed
Ashley Akbari
Abdul Qadr Akinoso-Imran
Elias Allara
Freya Allery
Emanuele Di Angelantonio
Mark Ashworth
Vandana Ayyar-Gupta
Sonya Babu-Narayan
Seb Bacon
Steve Ball
Ami Banerjee
Mark Barber
Jessica Barrett
Marion Bennie
Colin Berry
Jennifer Beveridge
Ewan Birney
Lana Bojanić
Thomas Bolton
Anna Bone
Jon Boyle
Tasanee Braithwaite
Ben Bray
Norman Briffa
David Brind
Katherine Brown
Maya Buch
Dexter Canoy
Massimo Caputo
Raymond Carragher
Alan Carson
Genevieve Cezard
Jen-Yu Amy Chang
Kate Cheema
Richard Chin
Yogini Chudasama
Jennifer Cooper
Emma Copland
Rebecca Crallan
Rachel Cripps
David Cromwell
Vasa Curcin
Gwenetta Curry
Caroline Dale
John Danesh
Jayati Das-Munshi
Ashkan Dashtban
Alun Davies
Joanna Davies
Gareth Davies
Neil Davies
Joshua Day
Antonella Delmestri
Spiros Denaxas
Rachel Denholm
John Dennis
Alastair Denniston
Salil Deo
Baljean Dhillon
Annemarie Docherty
Tim Dong
Abdel Douiri
Johnny Downs
Alexandru Dregan
Elizabeth A Ellins
Martha Elwenspoek
Fabian Falck
Florian Falter
Yat Yi Fan
Joseph Firth
Lorna Fraser
Rocco Friebel
Amir Gavrieli
Moritz Gerstung
Ruth Gilbert
Clare Gillies
Myer Glickman
Ben Goldacre
Raph Goldacre
Felix Greaves
Mark Green
Luca Grieco
Rowena Griffiths
Deepti Gurdasani
Julian Halcox
Nick Hall
Tuankasfee Hama
Alex Handy
Anna Hansell
Pia Hardelid
Flavien Hardy
Daniel Harris
Camille Harrison
Katie Harron
Abdelaali Hassaine
Lamiece Hassan
Russell Healey
Harry Hemingway
Angela Henderson
Naomi Herz
Johannes Heyl
Mira Hidajat
Irene Higginson
Rosie Hinchliffe
Julia Hippisley-Cox
Frederick Ho
Mevhibe Hocaoglu
Sam Hollings
Elsie Horne
David Hughes
Ben Humberstone
Mike Inouye
Samantha Ip
Nazrul Islam
Caroline Jackson
David Jenkins
Xiyun Jiang
Shane Johnson
Umesh Kadam
Costas Kallis
Zainab Karim
Jake Kasan
Michalis Katsoulis
Kim Kavanagh
Frank Kee
Spencer Keene
Seamus Kent
Sara Khalid
Anthony Khawaja
Kamlesh Khunti
Richard Killick
Deborah Kinnear
Rochelle Knight
Ruwanthi Kolamunnage-Dona
Evan Kontopantelis
Amanj Kurdi
Ben Lacey
Alvina Lai
Andrew Lambarth
Milad Nazarzadeh Larzjan
Deborah Lawler
Thomas Lawrence
Claire Lawson
Qiuju Li
Ken Li
Miguel Bernabeu Llinares
Paula Lorgelly
Deborah Lowe
Jane Lyons
Ronan Lyons
Pedro Machado
Mary Joan Macleod
John Macleod
Evaleen Malgapo
Mamas Mamas
Mohammad Mamouei
Sinduja Manohar
Rutendo Mapeta
Javiera Leniz Martelli
David Moreno Martos
Bilal Mateen
Aoife McCarthy
Craig Melville
Rebecca Milton
Mehrdad Mizani
Marta Pineda Moncusi
Daniel Morales
Ify Mordi
Lynn Morrice
Carole Morris
Eva Morris
Yi Mu
Tanja Mueller
Lars Murdock
Vahé Nafilyan
George Nicholson
Elena Nikiphorou
John Nolan
Tom Norris
Ruth Norris
Laura North
Teri-Louise North
Dan O'Connell
Dominic Oliver
Adejoke Oluyase
Abraham Olvera-Barrios
Efosa Omigie
Sarah Onida
Sandosh Padmanabhan
Tom Palmer
Laura Pasea
Riyaz Patel
Rupert Payne
Jill Pell
Carmen Petitjean
Arun Pherwani
Owen Pickrell
Livia Pierotti
Munir Pirmohamed
Rouven Priedon
Dani Prieto-Alhambra
Alastair Proudfoot
Terry Quinn
Jennifer Quint
Elena Raffetti
Kazem Rahimi
Shishir Rao
Cameron Razieh
Brian Roberts
Caroline Rogers
Jennifer Rossdale
Safa Salim
Nilesh Samani
Naveed Sattar
Christian Schnier
Roy Schwartz
David Selby
Olena Seminog
Sharmin Shabnam
Ajay Shah
Jon Shelton
James Sheppard
Shubhra Sinha
Mirek Skrypak
Martina Slapkova
Katherine Sleeman
Craig Smith
Reecha Sofat
Filip Sosenko
Matthew Sperrin
Sarah Steeg
Jonathan Sterne
Serban Stoica
Maria Sudell
Cathie Sudlow
Luanluan Sun
Arun Karthikeyan Suseeladevi
Michael Sweeting
Matt Sydes
Rohan Takhar
Howard Tang
Johan Thygesen
George Tilston
Claire Tochel
Clea du Toit
Christopher Tomlinson
Renin Toms
Fatemeh Torabi
Ana Torralbo
Julia Townson
Adnan Tufail
Tapiwa Tungamirai
Susheel Varma
Sebastian Vollmer
Venexia Walker
Tianxiao Wang
Huan Wang
Alasdair Warwick
Ruth Watkinson
Harry Watson
William Whiteley
Hannah Whittaker
Harry Wilde
Tim Wilkinson
Gareth Williams
Michelle Williams
Richard Williams
Eloise Withnell
Charles Wolfe
Angela Wood
Lucy Wright
Honghan Wu
Jinge Wu
Jianhua Wu
Tom Yates
Francesco Zaccardi
Haoting Zhang
Huayu Zhang
Luisa Zuccolo

This article has been Reviewed by the following groups

Read the full article

Listed in

Evaluated articles (ScreenIT)

Abstract

No abstract available

Version published to 10.1016/s2589-7500(22)00091-7
Jul 1, 2022

SciScore for 10.1101/2021.11.08.21265312: (What is this?)

Please note, not all rigor criteria are appropriate for all manuscripts.

Table 1: Rigor

Ethics	not detected.
Sex as a biological variable	not detected.
Randomization	not detected.
Blinding	not detected.
Power Analysis	not detected.
Cell Line Authentication	Authentication: We assessed 270 previously described comorbidities14, across 16 clinical specialities / organ systems, using validated CALIBER phenotypes and data records from 1st of January 1996 until 31st December 2019 from primary care, hospitalisation and procedure data14 (Supplementary Figure 3).

Table 2: Resources

Software and Algorithms
Sentences	Resources
A Reporting of studies Conducted using Observational Routinely-collected Data (RECORD) statement can be found in the supplement.	RECORD suggested: (RECORD, RRID:SCR_009097)
Data cleaning, …

SciScore for 10.1101/2021.11.08.21265312: (What is this?)

Please note, not all rigor criteria are appropriate for all manuscripts.

Table 1: Rigor

Ethics	not detected.
Sex as a biological variable	not detected.
Randomization	not detected.
Blinding	not detected.
Power Analysis	not detected.
Cell Line Authentication	Authentication: We assessed 270 previously described comorbidities14, across 16 clinical specialities / organ systems, using validated CALIBER phenotypes and data records from 1st of January 1996 until 31st December 2019 from primary care, hospitalisation and procedure data14 (Supplementary Figure 3).

Table 2: Resources

Software and Algorithms
Sentences	Resources
A Reporting of studies Conducted using Observational Routinely-collected Data (RECORD) statement can be found in the supplement.	RECORD suggested: (RECORD, RRID:SCR_009097)
Data cleaning, exploratory analysis, phenotype creation and cohort assembly was performed using Python (3.7) and Spark SQL (2.4.5) on Databricks Runtime 6.4 for Machine Learning.	Python suggested: (IPython, RRID:SCR_001658)
Analysis was performed in RStudio (Professional) Version 1.3.1093.1	RStudio suggested: (RStudio, RRID:SCR_000432)
Figures were constructed using ggplot2 (3.3.3), VennDiagram (1.6.20), igraph (1.2.6), survival (3.2.7) and survminer (0.4.8) packages.	ggplot2 suggested: (ggplot2, RRID:SCR_014601) VennDiagram suggested: (VennDiagram, RRID:SCR_002414)

Results from OddPub: We did not detect open data. We also did not detect open code. Researchers are encouraged to share open data when possible (see Nature blog).

Results from LimitationRecognizer: We detected the following sentences addressing limitations in the study:

Strengths and limitations: A key strength of this work using national-scale data is that by definition it is representative of the general population across all age groups, ethnicities, deprivation levels and demographic characteristics. To our knowledge, this is the largest population-wide research study of COVID-19 phenotypes which includes: a) multiple healthcare settings through data linkage at a population level, b) detailed identification of specific ventilatory treatments, c) classification of COVID-19 related deaths, and d) exploration of transitions between COVID-19 events. Using multiple EHR sources spanning different healthcare settings, maximised infection ascertainment and reduced the effects of variable testing and data recording patterns (especially during the first wave). As the focus of this work was to create COVID-19 related phenotypes, and describe the characteristics of individuals experiencing them, we have not conducted multivariable regression analyses to control for confounders. The findings presented are therefore not associative statements and should not be interpreted as causal relationships. However by sharing reproducible phenotype definitions we hope to facilitate further work to address the questions raised in this and other COVID-19 studies exploring national level data, as exemplified by recent research17–19. Whilst our definitions of the pandemic waves differ from others, we believe using non-contiguous dates enabled a balanced comparison ac...

Results from TrialIdentifier: No clinical trial numbers were referenced.

Results from Barzooka: We did not find any issues relating to the usage of bar graphs.

Results from JetFighter: We did not find any issues relating to colormaps.

Results from rtransparent:

Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
No funding statement was detected.
No protocol registration statement was detected.

Results from scite Reference Check: We found no unreliable references.

Read the original source

Version published to 10.1101/2021.11.08.21265312v1 on medRxiv
Nov 9, 2021

Characteristics and Early Diagnosis of Motor Neuron Disease (MND) in 67 million individuals in England: a comparative study on phenotyping models derived by AI, Knowledge Graphs and the MND Association

This article has 19 authors:
1. Yusuf Abdulle
2. Jinge Wu
3. Sanjay Budhdeo
4. Yunsoo Kim
5. Jiashu Shen
6. Emily Sun
7. Waqar Ali
8. Chengliang Dai
9. Phil Scordis
10. Arijit Patra
11. Ahmad Al Khleifat
12. Ammar Al-Chalabi
13. Alfredo Iacoangeli
14. Huanyu Zhang
15. Paul Taylor
16. Sarah Wild
17. Zina Ibrahim
18. Richard Dobson
19. Honghan Wu
This article has no evaluationsLatest version Jul 2, 2025
A Statewise Analysis of the Socioeconomic and Health Impacts of the COVID-19 Pandemic in India: Lessons for Future Health System Preparedness

This article has 9 authors:
1. Geetha R. Menon
2. U Venkatesh
3. Jeetendra Yadav
4. Krushna Chandra Sahoo
5. Tanu Anand
6. Ashoo Grover
7. Saurabh Sharma
8. Sandhya Singh
9. Firoz Khan
This article has no evaluationsLatest version Jul 2, 2025
Time trends in new diagnoses of 19 long-term conditions: a population-level cohort study in England using OpenSAFELY

This article has 24 authors:
1. Mark D Russell
2. Andrea Schaffer
3. Katie Bechman
4. Mark Gibson
5. Jon Massey
6. Rose Higgins
7. Brian MacKenna
8. Peter Inglesby
9. Seb Bacon
10. Amir Mehrkar
11. Ben Goldacre
12. Edward Alveyn
13. Victoria Allen
14. Zijing Yang
15. Samir Patel
16. Maryam A Adas
17. Gurjinder Sandhu
18. Elizabeth Price
19. Rouvick M Gama
20. Kate Bramham
21. Matthew Hotopf
22. Sam Norton
23. Andrew P Cope
24. James B Galloway
This article has no evaluationsLatest version Jul 11, 2025

This article has been Reviewed by the following groups

Listed in

Abstract

Article activity feed

Related articles

Characteristics and Early Diagnosis of Motor Neuron Disease (MND) in 67 million individuals in England: a comparative study on phenotyping models derived by AI, Knowledge Graphs and the MND Association

A Statewise Analysis of the Socioeconomic and Health Impacts of the COVID-19 Pandemic in India: Lessons for Future Health System Preparedness

Time trends in new diagnoses of 19 long-term conditions: a population-level cohort study in England using OpenSAFELY