Show simple item record

dc.contributor.authorVázquez Risco, Alain
dc.contributor.authorRamírez, Angela María
dc.contributor.authorPullabhotla, Neha
dc.contributor.authorQiang, Nan
dc.contributor.authorZhang, Haoran
dc.contributor.authorWalker, Marilyn
dc.contributor.authorTorres Barañano, María Inés ORCID
dc.date.accessioned2025-03-19T13:02:18Z
dc.date.available2025-03-19T13:02:18Z
dc.date.issued2024-09
dc.identifier.citation25th Annual Meeting of the Special Interest Group on Discourse and Dialogue. Proceedings : 78-91 (2024)es_ES
dc.identifier.urihttp://hdl.handle.net/10810/73031
dc.description.abstractOpen domain spoken dialogue systems need to controllably generate many different dialogue acts (DAs) to allow Natural Language Generation (NLG) to create interesting and engaging conversational interactions with users. We aim to create an NLG engine that can produce a variety of DAs that make substantive knowledge-grounded contributions to a conversation. Training such an NLG typically requires dialogue corpora that are labelled for DAs, which are expensive to produce and vulnerable to quality issues. Here, we present a prompt-based learning approach to transfer DAs from one domain, video games, to 7 new domains. For each novel domain, we first crawl WikiData to create Meaning Representations that systematically vary both the number of attributes and hops on the WikiData Knowledge Graph. The proposed method involves a self-training step to create prompt examples for each domain followed by an overgeneration and ranking step. The result is a novel, high-quality dataset, Wiki-Dialogue, of 71K knowledge-grounded utterances, covering 9 DAs and the Art, Movies, Music, Sports, TV, Animal, and Boardgames domains, whose combined DA and semantic accuracy is 89%. We assess the corpus quality using both automatic and human evaluations and find it high. The corpus is found to be safe, lexically rich, and large in vocabulary, when compared to similar datasets.es_ES
dc.language.isoenges_ES
dc.publisherAssociation for Computational Linguisticses_ES
dc.rightsinfo:eu-repo/semantics/openAccesses_ES
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/*
dc.subjectnatural language generationes_ES
dc.subjectdialogue systemses_ES
dc.subjectdatasetes_ES
dc.subjectmeaning representationes_ES
dc.titleKnowledge-grounded dialogue act transfer using prompt-based learning for controllable open-domain NLGes_ES
dc.typeinfo:eu-repo/semantics/conferenceObjectes_ES
dc.rights.holderMaterials published in or after 2016 are licensed on a Creative Commons Attribution 4.0 International License.es_ES
dc.relation.publisherversionhttps://aclanthology.org/2024.sigdial-1.7/es_ES
dc.identifier.doi10.18653/v1/2024.sigdial-1.7
dc.departamentoesElectricidad y electrónicaes_ES
dc.departamentoeuElektrizitatea eta elektronikaes_ES


Files in this item

No Thumbnail [100%x80]

This item appears in the following Collection(s)

Show simple item record

Materials published in or after 2016 are licensed on a Creative Commons Attribution 4.0 International License.
Except where otherwise noted, this item's license is described as Materials published in or after 2016 are licensed on a Creative Commons Attribution 4.0 International License.