Prevalence and risk factors for long COVID among adults in Scotland using electronic health records: a national, retrospective, observational cohort study

Karen Jeffrey, Lana Woolford, Rishma Maini, Siddharth Basetti, Ashleigh Batchelor, David Weatherill, Chris White, Vicky Hammersley, Tristan Millington, Calum Macdonald, Jennifer K. Quint, Robin Kerr, Steven Kerr, Syed Ahmar Shah, Igor Rudan, Adeniyi Francis Fagbamigbe, Colin R. Simpson, Srinivasa Vittal Katikireddi, Chris Robertson, Lewis RitchieAziz Sheikh, Luke Daines* (Corresponding Author)

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Summary Background Long COVID is a debilitating multisystem condition. The objective of this study was to estimate the prevalence of long COVID in the adult population of Scotland, and to identify risk factors associated with its development. Methods In this national, retrospective, observational cohort study, we analysed electronic health records (EHRs) for all adults (≥18 years) registered with a general medical practice and resident in Scotland between March 1, 2020, and October 26, 2022 (98–99% of the population). We linked data from primary care, secondary care, laboratory testing and prescribing. Four outcome measures were used to identify long COVID: clinical codes, free text in primary care records, free text on sick notes, and a novel operational definition. The operational definition was developed using Poisson regression to identify clinical encounters indicative of long COVID from a sample of negative and positive COVID-19 cases matched on time-varying propensity to test positive for SARS-CoV-2. Possible risk factors for long COVID were identified by stratifying descriptive statistics by long COVID status. Findings Of 4,676,390 participants, 81,219 (1.7%) were identified as having long COVID. Clinical codes identified the fewest cases (n = 1,092, 0.02%), followed by free text (n = 8,368, 0.2%), sick notes (n = 14,469, 0.3%), and the operational definition (n = 64,193, 1.4%). There was limited overlap in cases identified by the measures; however, temporal trends and patient characteristics were consistent across measures. Compared with the general population, a higher proportion of people with long COVID were female (65.1% versus 50.4%), aged 38–67 (63.7% versus 48.9%), overweight or obese (45.7% versus 29.4%), had one or more comorbidities (52.7% versus 36.0%), were immunosuppressed (6.9% versus 3.2%), shielding (7.9% versus 3.4%), or hospitalised within 28 days of testing positive (8.8% versus 3.3%%), and had tested positive before Omicron became the dominant variant (44.9% versus 35.9%). The operational definition identified long COVID cases with combinations of clinical encounters (from four symptoms, six investigation types, and seven management strategies) recorded in EHRs within 4–26 weeks of a positive SARS-CoV-2 test. These combinations were significantly (p 
Original languageEnglish
Article number102590
Number of pages13
JournalEClinicalMedicine
Volume71
Early online date11 Apr 2024
DOIs
Publication statusE-pub ahead of print - 11 Apr 2024

Bibliographical note

Acknowledgements
This work was supported by the Chief Scientist Office, grant number COV/LTE/20/15. EAVE II is supported by a grant (MC_PC_19075) from the Medical Research Council; and a grant (MC_PC_19004) from BREATHE–The Health Data Research Hub for Respiratory Health, funded through the UK Research and Innovation Industrial Strategy Challenge Fund. LD was supported by a post-doctoral clinical fellowship from the Asthma UK Centre for Applied Research. SVK acknowledges funding from a NRS Senior Clinical Fellowship (SCAF/15/02), the Medical Research Council (MC_UU_00022/2) and the Scottish Government Chief Scientist Office (SPHSU17). The authors would like to acknowledge the support of Dave Kelly and Lamorna Brown of Albasoft Ltd., and Sharon Kennedy, Mike Birnie, Safraj Shahul Hameed and Elliott Hall of Public Health Scotland for their involvement in obtaining approvals, provisioning, and linking data and the use of the secure analytical platform within the National Safe Haven.
Funding Chief Scientist Office (Scotland), Medical Research Council, and BREATHE.

Data Availability Statement

All code, metadata and documentation for this project is publicly available at https://github.com/EAVE-II/Long-COVID. Most of the data used in this study are highly sensitive and will not be made available publicly.

Keywords

  • Long COVID
  • Population surveillance
  • Primary health care
  • Clinical coding
  • Matched-pair analysis

Fingerprint

Dive into the research topics of 'Prevalence and risk factors for long COVID among adults in Scotland using electronic health records: a national, retrospective, observational cohort study'. Together they form a unique fingerprint.

Cite this