Automatic generation of textual summaries from neonatal intensive care data

Francois Portet, Ehud Reiter, Albert Gatt, Jim Hunter, Somayajulu Sripada, Yvonne Freer, Cindy Sykes

Research output: Contribution to journalArticlepeer-review

188 Citations (Scopus)


Effective presentation of data for decision support is a major issue when large volumes of data are generated as happens in the Intensive Care Unit (ICU). Although the most common approach is to present the data graphically, it has been shown that textual summarisation can lead to improved decision making. As part of the BabyTalk project, we present a prototype, called BT-45, which generates textual summaries of about 45 minutes of continuous physiological signals and discrete events (e.g.: equipment settings and drug administration). Its architecture brings together techniques from the different areas of signal processing, medical reasoning, knowledge engineering, and natural language generation. A clinical off-ward experiment in a Neonatal ICU (NICU) showed that human expert textual descriptions of NICU data lead to better decision making than classical graphical visualisation, whereas texts generated by BT-45 lead to similar quality decision-making as visualisations. Textual analysis showed that BT-45 texts were inferior to human expert texts in a number of ways, including not reporting temporal information as well and not producing good narratives. Despite these deficiencies, our work shows that it is possible for computer systems to generate effective textual Summaries of complex continuous and discrete temporal clinical data. (c) 2008 Elsevier B.V. All rights reserved.

Original languageEnglish
Pages (from-to)789-816
Number of pages28
JournalArtificial Intelligence
Issue number7-8
Early online date25 Dec 2008
Publication statusPublished - May 2009

Bibliographical note

A paid open access option is available for this journal.
Voluntary deposit by author of pre-print allowed on Institutions open scholarly website and pre-print servers
Voluntary deposit by author of authors post-print allowed on institutions open scholarly website including Institutional Repository
Deposit due to Funding Body, Institutional and Governmental mandate only allowed where separate agreement between repository and publisher exists
Set statement to accompany deposit
Published source must be acknowledged
Must link to journal home page or articles' DOI
Publisher's version/PDF cannot be used
Articles in some journals can be made Open Access on payment of additional charge
NIH Authors articles will be submitted to PMC after 12 months
Authors who are required to deposit in subject repositories may also use Sponsorship Option
Pre-print can not be deposited for The Lancet


  • Natural language generation
  • Intelligent data analysis
  • Intensive care unit
  • Decision support systems
  • Oriented clinical-data
  • Temporal-abstraction
  • Weather forecasts
  • Time
  • Information
  • Intelligent
  • System
  • Exploration
  • Models


Dive into the research topics of 'Automatic generation of textual summaries from neonatal intensive care data'. Together they form a unique fingerprint.

Cite this