Search
Now showing items 1-10 of 49
Multilingual word embeddings and their utility in cross-lingual learning
(2018-10-15)
Word embeddings - dense vector representations of a word’s distributional semantics - are an indespensable component of contemporary natural language processing (NLP). Bilingual embeddings, in particular, have attracted ...
Automating the anonymisation of textual corpora
(2018-11-04)
[EU] Gaur egun, testu berriak etengabe sortzen doaz sare sozialetako mezu, osasun-txosten,
dokumentu o zial eta halakoen ondorioz. Hala ere, testuok informazio pertsonala baldin
badute, ezin dira ikerkuntzarako edota ...
Noisy speech recognition using Kaldi and neural architectures
(2018-02)
[EN]Noisy Speech Recognition using Kaldi and Neural Architectures ABSTRACT The goal of an Automatic Speech Recognition (ASR) system is to transform a set of acoustic features into a sequence of words. It mainly consists ...
to post-edit or to translate ... That is the question: a case study of a recommender system for Quality Estimation of Machine Translation based on linguistic features
(2018)
[EN]The implementation of a machine translation system into production is not enough to warrant its efficient use. There exists the need to know when it is profitable to use machine translation as opposed to translating ...
Exploring metrics for post-editing effort: and their ability to detect errors in machine translated output
(2019-07-17)
As more companies integrate machine translation (MT) systems into their translation workflows, it becomes increasingly relevant to accurately measure post-editing (PE) effort. In this paper we explore how different types ...
End to end approach for i2b2 2012 challenge based on Cross-lingual models
(2020-11-26)
BACKGROUND - We propose a Cross-lingual approach to i2b2 2012 challenge for Clinical
Records focused on the temporal relations in clinical narratives. Corpus of discharge
summaries annotated with temporal information was ...
Neural natural language generation with unstructured contextual information
(2018-11-04)
[EU] Lan honetan, hizkuntza naturalaren sorrera automatikoan informazio ez-egituratuaren esplotazioak izan dezakeen eragina aztertzen da. Bere helburu nagusia, sistema batek aurrez ikusi gabeko informazioa erabiliz testu ...
Unsupervised methods to predict example difficulty in word sense annotation
(2018-06)
[EU]Hitzen Adiera Desanbiguazioa (HAD) Hizkuntzaren Prozesamenduko (HP) erronkarik handienetakoa da. Frogatu denez, HAD sistema ahalik eta arrakastatsuenak entrenatzeko, oso garrantzitsua da entrenatze-datuetatik adibide ...
Distributional semantics and machine learning for statistical machine translation
(2016-05-24)
[EU]Lan honetan semantika distribuzionalaren eta ikasketa automatikoaren erabilera aztertzen
dugu itzulpen automatiko estatistikoa hobetzeko. Bide horretan, erregresio logistikoan
oinarritutako ikasketa automatikoko eredu ...
Ebaluatoia: crowd evaluation of English-Basque machine translation
(2016-04-08)
[EU]Lan honetan Ebaluatoia aurkezten da, eskala handiko ingelesa-euskara itzulpen automatikoko ebaluazio kanpaina, komunitate-elkarlanean oinarritua. Bost sistemaren itzulpen kalitatea konparatzea izan da kanpainaren ...