Abstract
This work explores an important dimension of variation in the language used by dialogue participants: their age. While previous work showed differences at various linguistic levels between age groups when experimenting with written discourse data (e.g., blog posts), previous work on dialogue has largely been limited to acoustic information related to voice and prosody. Detecting fine-grained linguistic properties of human dialogues is of crucial importance for developing AI-based conversational systems which are able to adapt to their human interlocutors. We therefore investigate whether, and to what extent, current text-based NLP models can detect such linguistic differences, and what the features driving their predictions are. We show that models achieve a fairly good performance on age-group prediction, though the task appears to be more challenging compared to discourse. Through in-depth analysis of the best models’ errors and the most predictive cues, we show that, in dialogue, differences among age groups mostly concern stylistic and lexical choices. We believe these findings can inform future work on developing controlled generation models for adaptive conversational systems.
Original language | English |
---|---|
Title of host publication | Proceedings of the Eighth Italian Conference on Computational Linguistics CliC-it 2021 |
Editors | Elisabetta Fersini, Marco Passarotti, Viviana Patti |
Place of Publication | Torino |
Publisher | Accademia University Press |
Pages | 184-190 |
Number of pages | 8 |
ISBN (Electronic) | 9791280136947 |
DOIs | |
Publication status | Published - 20 Oct 2022 |
Event | CLiC-it 2021 Italian Conference on Computational Linguistics 2021 - Milan, Italy Duration: 29 Jun 2021 → 1 Jul 2021 https://clic2021.disco.unimib.it/ |
Publication series
Name | Collana dell'Associazione Italiana di Linguistica Computazionale |
---|---|
Publisher | Accademia University Press |
ISSN (Electronic) | 2531-4548 |
Conference
Conference | CLiC-it 2021 Italian Conference on Computational Linguistics 2021 |
---|---|
Abbreviated title | CLiC-it 2021 |
Country/Territory | Italy |
City | Milan |
Period | 29/06/21 → 1/07/21 |
Internet address |
Bibliographical note
This work received funding from the University of Amsterdam’s Research Priority Area Human(e) AI and from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement No. 819455).Code and data available at: https://github.com/lennertjansen/detecting-age-in-dialogue
Fingerprint
Dive into the research topics of 'Detecting Age-Related Linguistic Patterns in Dialogue: Toward Adaptive Conversational Systems'. Together they form a unique fingerprint.Datasets
-
Data and code from: Detecting Age-Related Linguistic Patterns in Dialogue: Toward Adaptive Conversational Systems
Sinclair, A. (Creator), University of Aberdeen, Jun 2022
DOI: 10.20392/efb3885c-9833-4b4d-bbfb-af2d6e9c11ed, https://github.com/lennertjansen/detecting-age-in-dialogue
Dataset