Should corpora texts be gold standards for NLG?

Ehud Reiter; Somayajulu Sripada

Should corpora texts be gold standards for NLG?

Ehud Reiter, Somayajulu Sripada

Computing Science

Research output: Chapter in Book/Report/Conference proceeding › Published conference contribution

33 Citations (Scopus)

Abstract

There is increasing interest in using corpora in NLG, perhaps because of the success of corpus-based techniques in other areas of speech and language processing. Many uses of corpora in NLG implicitly assume that the human-authored texts in a corpora are a 'gold standard', in other words that the NLG system should produce texts similar to the corpora texts. However, our experience with several corpora raises questions about this assumption, because human authors make mistakes and because different people write differently.

Original language	English
Title of host publication	Proceedings of the International Natural Language Generation Conference, INLG 2002
Publication status	Published - 2002

Bibliographical note

Publisher Copyright:
© International Natural Language Generation Conference, INLG 2002

Cite this

@inproceedings{5b7a160e26924b409b2a71edb7b16eaf,

title = "Should corpora texts be gold standards for NLG?",

abstract = "There is increasing interest in using corpora in NLG, perhaps because of the success of corpus-based techniques in other areas of speech and language processing. Many uses of corpora in NLG implicitly assume that the human-authored texts in a corpora are a 'gold standard', in other words that the NLG system should produce texts similar to the corpora texts. However, our experience with several corpora raises questions about this assumption, because human authors make mistakes and because different people write differently.",

author = "Ehud Reiter and Somayajulu Sripada",

note = "Publisher Copyright: {\textcopyright} International Natural Language Generation Conference, INLG 2002",

year = "2002",

language = "English",

booktitle = "Proceedings of the International Natural Language Generation Conference, INLG 2002",