Search

Now showing items 1-10 of 49

Multilingual word embeddings and their utility in cross-lingual learning

Kulmizev, Artur (2018-10-15)

Word embeddings - dense vector representations of a word’s distributional semantics - are an indespensable component of contemporary natural language processing (NLP). Bilingual embeddings, in particular, have attracted ...

Automating the anonymisation of textual corpora

García Sardiña, Laura (2018-11-04)

[EU] Gaur egun, testu berriak etengabe sortzen doaz sare sozialetako mezu, osasun-txosten, dokumentu o zial eta halakoen ondorioz. Hala ere, testuok informazio pertsonala baldin badute, ezin dira ikerkuntzarako edota ...

Noisy speech recognition using Kaldi and neural architectures

González Docasal, Ander (2018-02)

[EN]Noisy Speech Recognition using Kaldi and Neural Architectures ABSTRACT The goal of an Automatic Speech Recognition (ASR) system is to transform a set of acoustic features into a sequence of words. It mainly consists ...

to post-edit or to translate ... That is the question: a case study of a recommender system for Quality Estimation of Machine Translation based on linguistic features

De Gibert Bonet, Ona (2018)

[EN]The implementation of a machine translation system into production is not enough to warrant its efficient use. There exists the need to know when it is profitable to use machine translation as opposed to translating ...

Exploring metrics for post-editing effort: and their ability to detect errors in machine translated output

Cumbreño Díez, Cristina (2019-07-17)

As more companies integrate machine translation (MT) systems into their translation workflows, it becomes increasingly relevant to accurately measure post-editing (PE) effort. In this paper we explore how different types ...

End to end approach for i2b2 2012 challenge based on Cross-lingual models

Santamaría, Edgar Andrés (2020-11-26)

BACKGROUND - We propose a Cross-lingual approach to i2b2 2012 challenge for Clinical Records focused on the temporal relations in clinical narratives. Corpus of discharge summaries annotated with temporal information was ...

Neural natural language generation with unstructured contextual information

Gete Ugarte, Harritxu (2018-11-04)

[EU] Lan honetan, hizkuntza naturalaren sorrera automatikoan informazio ez-egituratuaren esplotazioak izan dezakeen eragina aztertzen da. Bere helburu nagusia, sistema batek aurrez ikusi gabeko informazioa erabiliz testu ...

Unsupervised methods to predict example difficulty in word sense annotation

Aceta Moreno, Cristina (2018-06)

[EU]Hitzen Adiera Desanbiguazioa (HAD) Hizkuntzaren Prozesamenduko (HP) erronkarik handienetakoa da. Frogatu denez, HAD sistema ahalik eta arrakastatsuenak entrenatzeko, oso garrantzitsua da entrenatze-datuetatik adibide ...

Distributional semantics and machine learning for statistical machine translation

Artetxe Zurutuza, Mikel (2016-05-24)

[EU]Lan honetan semantika distribuzionalaren eta ikasketa automatikoaren erabilera aztertzen dugu itzulpen automatiko estatistikoa hobetzeko. Bide horretan, erregresio logistikoan oinarritutako ikasketa automatikoko eredu ...

Ebaluatoia: crowd evaluation of English-Basque machine translation

Aranberri Monasterio, Nora (2016-04-08)

[EU]Lan honetan Ebaluatoia aurkezten da, eskala handiko ingelesa-euskara itzulpen automatikoko ebaluazio kanpaina, komunitate-elkarlanean oinarritua. Bost sistemaren itzulpen kalitatea konparatzea izan da kanpainaren ...

1
2
3
4
. . .
5