Show simple item record

dc.contributor.advisorLabaka Intxauspe, Gorka ORCID
dc.contributor.advisorGojenola Galletebeitia, Koldobika
dc.contributor.authorHedberg, Linda Emilia
dc.date.accessioned2019-07-18T09:39:03Z
dc.date.available2019-07-18T09:39:03Z
dc.date.issued2019-07-17
dc.date.submitted2019-06-27
dc.identifier.urihttp://hdl.handle.net/10810/34706
dc.description.abstract[EU] Proiektu honetan punta-puntako itzulpen automatiko neuronalaren domeinu egokitzapena aztertu da. Horretarako, suediera-gaztelera bikoterako lau itzultzaile automatiko garatu ditugu: domeinu orokorreko bi sistema (bat itzulpen norabide bakoitzerako) eta domeinura egokitutako beste bi sistema (bat itzulpen norabide bakoitzerako). Gure domeinura egokitutako sistemak Ingeniaritza Zibilerako egokitu ditugu. Domeinu barruko sistemak sortzeko, domeinuz kanpoko datu paraleloez gain, domeinu barruko datu elebakarrak bildu eta erabili ditugu. Testu elebakar horiek automatikoki itzuliz, domeinuko bi corpus paralelo sintetiko sortu ditugu. Guk sortutako lau sistemak neurri automatikoak erabiliz ebaluatu eta Google Translatek sortzen dituen itzulpenekin konparatu ditugu. Gazteleratik suedierara itzultzen duten bi sistemak, gainera, eskuz ebaluatu ditugu post-edizio ataza bat erabiliz, sistemaren produktibitatea neurtuz. Emaitza automatikoek gure lau sistemek antzeko emaitzak sortzen dituztela erakusten dute, guztiak Google Translate nabarmen gainditzen dutelarik. Produktibitate testak, ordea, domeinu-testu zehatzak post-editatzerakoan domeinura egokitutako itzulpen sistemak onuragarriak direla erakusten du.es_ES
dc.description.abstract[EN] This project explores domain adaptation of state-of-the-art neural machine translation. We develop a total of four self-attention-based NMT systems for Swedish-Spanish: two general out-domain systems (one in each translation direction) and two in-domain systems (also one in each translation direction). We adapt our in-domain systems for the Civil Engineering domain. Apart from out-domain parallel data, we collect and use in-domain monolingual data that we translate creating two synthetic parallel in-domain corpora with the original sentences as target language. All four systems are tested and compared with automatic evaluation metrics. The Spanish-to-Swedish systems are also evaluated manually by a post-editing task measuring the systems productivity. Results show that our four systems perform similarly, all of them outperforming significantly Google Translate. The productivity test indicates, however, that post-editing domain-specific text benefits from domain-adapted MT.es_ES
dc.language.isoenges_ES
dc.rightsinfo:eu-repo/semantics/openAccesses_ES
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/3.0/es/
dc.subjectitzulpen automatikoaes_ES
dc.subjectdomeinu egokitzeaes_ES
dc.subjectitzulpenaes_ES
dc.subjectproduktibitatearen ebaluazioaes_ES
dc.subjectmachine translationes_ES
dc.subjectdomain adaptationes_ES
dc.subjectback-translationes_ES
dc.subjectproductivity evaluationes_ES
dc.titleSpanish-Swedish neural machine translation for the civil engineering domaines_ES
dc.typeinfo:eu-repo/semantics/masterThesises_ES
dc.rights.holderAtribución-NoComercial-CompartirIgual 3.0 España
dc.departamentoesLenguajes y sistemas informáticoses_ES
dc.departamentoeuHizkuntza eta sistema informatikoakes_ES


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record

Atribución-NoComercial-CompartirIgual 3.0 España
Except where otherwise noted, this item's license is described as Atribución-NoComercial-CompartirIgual 3.0 España