Show simple item record

dc.contributor.authorArtetxe Zurutuza, Mikel
dc.contributor.authorLabaka Intxauspe, Gorka ORCID
dc.contributor.authorAgirre Bengoa, Eneko ORCID
dc.date.accessioned2024-10-17T14:04:29Z
dc.date.available2024-10-17T14:04:29Z
dc.date.issued2017
dc.identifier.citationProceedings of the 55th Annual Meeting of the Association for Computational Linguistics 1 : 451-462 (2017)es_ES
dc.identifier.urihttp://hdl.handle.net/10810/69993
dc.description.abstractMost methods to learn bilingual word embeddings rely on large parallel corpora, which is difficult to obtain for most language pairs. This has motivated an active research line to relax this requirement, with methods that use document-aligned corpora or bilingual dictionaries of a few thousand words instead. In this work, we further reduce the need of bilingual resources using a very simple self-learning approach that can be combined with any dictionary-based mapping technique. Our method exploits the structural similarity of embedding spaces, and works with as little bilingual evidence as a 25 word dictionary or even an automatically generated list of numerals, obtaining results comparable to those of systems that use richer resources.es_ES
dc.description.sponsorshipThis research was partially supported by a Google Faculty Award, the Spanish MINECO (TUNER TIN2015-65308-C5-1-R, MUSTER PCIN-2015-226 and TADEEP TIN2015-70214-P, cofunded by EU FEDER), the Basque Government (MODELA KK-2016/00082) and the UPV/EHU (excellence research group). Mikel Artetxe enjoys a doctoral grant from the Spanish MECD.es_ES
dc.language.isoenges_ES
dc.publisherACLes_ES
dc.rightsinfo:eu-repo/semantics/openAccesses_ES
dc.rights.urihttp://creativecommons.org/licenses/by/3.0/es/*
dc.titleLearning bilingual word embeddings with (almost) no bilingual dataes_ES
dc.typeinfo:eu-repo/semantics/conferenceObjectes_ES
dc.rights.holder(c) 2017 The authors under the Creative Commons Attribution 4.0 International (CC BY 4.0)es_ES
dc.relation.publisherversionhttps://doi.org/10.18653/v1/P17-1042es_ES
dc.identifier.doi10.18653/v1/P17-1042
dc.departamentoesLenguajes y sistemas informáticoses_ES
dc.departamentoeuHizkuntza eta sistema informatikoakes_ES


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record

(c) 2017 The authors under the Creative Commons Attribution 4.0 International (CC BY 4.0)
Except where otherwise noted, this item's license is described as (c) 2017 The authors under the Creative Commons Attribution 4.0 International (CC BY 4.0)