An Architecture for Data-to-Text Systems

Ehud Baruch Reiter

An Architecture for Data-to-Text Systems

Ehud Baruch Reiter

Computing Science

Research output: Chapter in Book/Report/Conference proceeding › Published conference contribution

145 Citations (Scopus)

Abstract

I present an architecture for data-to-text systems, that is NLG systems which produce texts from non-linguistic input data; this essentially extends the architecture of Reiter and Dale (2000) to systems whose input is raw data instead of AI knowledge bases. This architecture is being used in the BabyTalk project, and is based on experiences in several projects at Aberdeen; it also seems to be compatible with many data-to-text systems developed elsewhere. It consists of four stages which are organised in a pipeline: Signal Analysis, Data Interpretation, Document Planning, and Microplanning and Realisation.

Original language	English
Title of host publication	Proceedings of the Eleventh European Workshop on Natural Language Generation (ENLG 07)
Editors	Stephan Busemann
Place of Publication	Stroudsburg
Publisher	Association for Computational Linguistics
Pages	97-104
Number of pages	8
Publication status	Published - 2007

Access to Document

http://aclweb.org/anthology-new/W/W07/W07-2315.pdf

Mainstream communication of big data using natural language generation (NLG)
Ehud Reiter (Coordinator) & Gowri Sripada (Coordinator)
Impact: Economic and/or Commercial

Cite this

@inproceedings{cbb4303f654243a586c03eb15e65ab62,

title = "An Architecture for Data-to-Text Systems",

abstract = "I present an architecture for data-to-text systems, that is NLG systems which produce texts from non-linguistic input data; this essentially extends the architecture of Reiter and Dale (2000) to systems whose input is raw data instead of AI knowledge bases. This architecture is being used in the BabyTalk project, and is based on experiences in several projects at Aberdeen; it also seems to be compatible with many data-to-text systems developed elsewhere. It consists of four stages which are organised in a pipeline: Signal Analysis, Data Interpretation, Document Planning, and Microplanning and Realisation.",

author = "Reiter, {Ehud Baruch}",

year = "2007",

language = "English",

pages = "97--104",

editor = "Stephan Busemann",

booktitle = "Proceedings of the Eleventh European Workshop on Natural Language Generation (ENLG 07)",

publisher = "Association for Computational Linguistics",

}

TY - GEN

T1 - An Architecture for Data-to-Text Systems

AU - Reiter, Ehud Baruch

PY - 2007

Y1 - 2007

N2 - I present an architecture for data-to-text systems, that is NLG systems which produce texts from non-linguistic input data; this essentially extends the architecture of Reiter and Dale (2000) to systems whose input is raw data instead of AI knowledge bases. This architecture is being used in the BabyTalk project, and is based on experiences in several projects at Aberdeen; it also seems to be compatible with many data-to-text systems developed elsewhere. It consists of four stages which are organised in a pipeline: Signal Analysis, Data Interpretation, Document Planning, and Microplanning and Realisation.

AB - I present an architecture for data-to-text systems, that is NLG systems which produce texts from non-linguistic input data; this essentially extends the architecture of Reiter and Dale (2000) to systems whose input is raw data instead of AI knowledge bases. This architecture is being used in the BabyTalk project, and is based on experiences in several projects at Aberdeen; it also seems to be compatible with many data-to-text systems developed elsewhere. It consists of four stages which are organised in a pipeline: Signal Analysis, Data Interpretation, Document Planning, and Microplanning and Realisation.

M3 - Published conference contribution

SP - 97

EP - 104

BT - Proceedings of the Eleventh European Workshop on Natural Language Generation (ENLG 07)

A2 - Busemann, Stephan

PB - Association for Computational Linguistics

CY - Stroudsburg

ER -

An Architecture for Data-to-Text Systems

Abstract

Access to Document

Fingerprint

Impacts

Mainstream communication of big data using natural language generation (NLG)

Cite this