Are Experts Needed? On Human Evaluation of Counselling Reflection Generation

Zixiu Wu, Simone Balloccu, Ehud Reiter, Rim Helaoui, Diego Reforgiato Recupero, Daniele Riboni

Research output: Chapter in Book/Report/Conference proceedingPublished conference contribution

2 Citations (Scopus)
1 Downloads (Pure)

Abstract

Reflection is a crucial counselling skill where the therapist conveys to the client their interpretation of what the client said. Language models have recently been used to generate reflections automatically, but human evaluation is challenging, particularly due to the cost of hiring experts. Laypeople-based evaluation is less expensive and easier to scale, but its quality is unknown for reflections. Therefore, we explore whether laypeople can be an alternative to experts in evaluating a fundamental quality aspect: coherence and context-consistency. We do so by asking a group of laypeople and a group of experts to annotate both synthetic reflections and human reflections from actual therapists. We find that both laypeople and experts are reliable annotators and that they have moderate-to-strong inter-group correlation, which shows that laypeople can be trusted for such evaluations. We also discover that GPT-3 mostly produces coherent and consistent reflections, and we explore changes in evaluation results when the source of synthetic reflections changes to GPT-3 from the less powerful GPT-2.
Original languageEnglish
Title of host publicationProceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
EditorsAnna Rogers, Jordan Boyd-Graber, Naoaki Okazaki
Place of PublicationToronto, Canada
PublisherAssociation for Computational Linguistics
Pages6906-6930
Number of pages25
DOIs
Publication statusPublished - 2023
EventThe 61st Annual Meeting of the Association for Computational Linguistics - Toronto, Canada
Duration: 9 Jul 202314 Jul 2023
Conference number: 61
https://2023.aclweb.org/

Conference

ConferenceThe 61st Annual Meeting of the Association for Computational Linguistics
Country/TerritoryCanada
CityToronto
Period9/07/2314/07/23
Internet address

Fingerprint

Dive into the research topics of 'Are Experts Needed? On Human Evaluation of Counselling Reflection Generation'. Together they form a unique fingerprint.

Cite this