Towards a top-down approach for an automatic discourse analysis for Basque: Segmentation and Central Unit detection tool

Atutxa Salazar, Aitziber; Bengoetxea Kortazar, Kepa Xabier; Díaz de Ilarraza Sánchez, Arantza; Iruskieta Quintian, Mikel

dc.contributor.author	Atutxa Salazar, Aitziber
dc.contributor.author	Bengoetxea Kortazar, Kepa Xabier
dc.contributor.author	Díaz de Ilarraza Sánchez, Arantza
dc.contributor.author	Iruskieta Quintian, Mikel
dc.date.accessioned	2020-01-10T11:49:10Z
dc.date.available	2020-01-10T11:49:10Z
dc.date.issued	2019-09-04
dc.identifier.citation	Plos One 14(9) : (2019) // Article ID e0221639	es_ES
dc.identifier.issn	1932-6203
dc.identifier.uri	http://hdl.handle.net/10810/37574
dc.description.abstract	Lately, discourse structure has received considerable attention due to the benefits its application offers in several NLP tasks such as opinion mining, summarization, question answering, text simplification, among others. When automatically analyzing texts, discourse parsers typically perform two different tasks: i) identification of basic discourse units (text segmentation) ii) linking discourse units by means of discourse relations, building structures such as trees or graphs. The resulting discourse structures are, in general terms, accurate at intra-sentence discourse-level relations, however they fail to capture the correct inter-sentence relations. Detecting the main discourse unit (the Central Unit) is helpful for discourse analyzers (and also for manual annotation) in improving their results in rhetorical labeling. Bearing this in mind, we set out to build the first two steps of a discourse parser following a top-down strategy: i) to find discourse units, ii) to detect the Central Unit. The final step, i.e. assigning rhetorical relations, remains to be worked on in the immediate future. In accordance with this strategy, our paper presents a tool consisting of a discourse segmenter and an automatic Central Unit detector.	es_ES
dc.description.sponsorship	This study was carried out within the framework of the following projects: IXA Group: natural language processing IT1343-19 (Basque Government), DL4NLP KK-2019/00045 (Basque Government), PROSA-MED TIN2016-77820-C3-1-R (MINECO) and DeepReading: RTI2018-096846-B-C21 (MCIU/AEI/FEDER, UE).	es_ES
dc.language.iso	eng	es_ES
dc.publisher	Public Library Science	es_ES
dc.relation	info:eu-repo/grantAgreement/MINECO/PROSA-MED TIN2016-77820-C3-1-R	es_ES
dc.rights	info:eu-repo/semantics/openAccess	es_ES
dc.rights.uri	http://creativecommons.org/licenses/by/3.0/es/	*
dc.subject	Brazilian Portuguese	es_ES
dc.title	Towards a top-down approach for an automatic discourse analysis for Basque: Segmentation and Central Unit detection tool	es_ES
dc.type	info:eu-repo/semantics/article	es_ES
dc.rights.holder	This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Attribution 4.0 International (CC BY 4.0)	es_ES
dc.rights.holder	Atribución 3.0 España	*
dc.relation.publisherversion	https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0221639	es_ES
dc.identifier.doi	10.1371/journal.pone.0221639
dc.departamentoes	Didáctica de la Lengua y la Literatura	es_ES
dc.departamentoes	Lenguajes y sistemas informáticos	es_ES
dc.departamentoeu	Hizkuntza eta sistema informatikoak	es_ES
dc.departamentoeu	Hizkuntzaren eta literaturaren didaktika	es_ES

Files in this item

Name:: journal.pone.0221639.pdf
Size:: 1.879Mb
Format:: PDF
Description:: Artículo principal

View/Open

Name:: license_rdf
Size:: 914bytes
Format:: application/rdf+xml

View/Open

This item appears in the following Collection(s)

Artículos

Show simple item record

This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Attribution 4.0 International (CC BY 4.0)

Except where otherwise noted, this item's license is described as This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Attribution 4.0 International (CC BY 4.0)