Measuring Language Distance of Isolated European Languages
dc.contributor.author | Gamallo, Pablo | |
dc.contributor.author | Pichel Campos, José Ramom | |
dc.contributor.author | Alegría Loinaz, Iñaki | |
dc.date.accessioned | 2020-04-30T18:33:06Z | |
dc.date.available | 2020-04-30T18:33:06Z | |
dc.date.issued | 2020-03-27 | |
dc.identifier.citation | Information 11(4) : (2020) // Article ID 181 | es_ES |
dc.identifier.issn | 2078-2489 | |
dc.identifier.uri | http://hdl.handle.net/10810/42972 | |
dc.description.abstract | Phylogenetics is a sub-field of historical linguistics whose aim is to classify a group of languages by considering their distances within a rooted tree that stands for their historical evolution. A few European languages do not belong to the Indo-European family or are otherwise isolated in the European rooted tree. Although it is not possible to establish phylogenetic links using basic strategies, it is possible to calculate the distances between these isolated languages and the rest using simple corpus-based techniques and natural language processing methods. The objective of this article is to select some isolated languages and measure the distance between them and from the other European languages, so as to shed light on the linguistic distances and proximities of these controversial languages without considering phylogenetic issues. The experiments were carried out with 40 European languages including six languages that are isolated in their corresponding families: Albanian, Armenian, Basque, Georgian, Greek, and Hungarian. | es_ES |
dc.description.sponsorship | This work received financial support from DOMINO project (PGC2018-102041-B-I00, MCIU/AEI/FEDER, UE), eRisk project (RTI2018-093336-B-C21), the Consellería de Cultura, Educación e Ordenación Universitaria (accreditation 2016-2019, ED431G/08, Consolidation and structuring of Groups with Growth Potential: ED431B 2017/39), and the European Regional Development Fund (ERDF). | es_ES |
dc.language.iso | eng | es_ES |
dc.publisher | MDPI | es_ES |
dc.relation | info:eu-repo/grantAgreement/MCIU/RTI2018-093336-B-C21 | es_ES |
dc.relation | info:eu-repo/grantAgreement/MCIU/PGC2018-102041-B-I00 | es_ES |
dc.rights | info:eu-repo/semantics/openAccess | es_ES |
dc.rights.uri | http://creativecommons.org/licenses/by/3.0/es/ | |
dc.subject | language distance | es_ES |
dc.subject | phylogenetics | es_ES |
dc.subject | perplexity | es_ES |
dc.subject | clustering | es_ES |
dc.subject | kullback leibler divergence | es_ES |
dc.title | Measuring Language Distance of Isolated European Languages | es_ES |
dc.type | info:eu-repo/semantics/article | es_ES |
dc.date.updated | 2020-04-28T13:42:56Z | |
dc.rights.holder | © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). | es_ES |
dc.relation.publisherversion | https://www.mdpi.com/2078-2489/11/4/181 | es_ES |
dc.identifier.doi | 10.3390/info11040181 | |
dc.departamentoes | Arquitectura y Tecnología de Computadores | |
dc.departamentoeu | Konputagailuen Arkitektura eta Teknologia |
Files in this item
This item appears in the following Collection(s)
Except where otherwise noted, this item's license is described as © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).