Buscar
Mostrando ítems 1-10 de 61
Multilingual word embeddings and their utility in cross-lingual learning
(2018-10-15)
Word embeddings - dense vector representations of a word’s distributional semantics - are an indespensable component of contemporary natural language processing (NLP). Bilingual embeddings, in particular, have attracted ...
Automating the anonymisation of textual corpora
(2018-11-04)
[EU] Gaur egun, testu berriak etengabe sortzen doaz sare sozialetako mezu, osasun-txosten,
dokumentu o zial eta halakoen ondorioz. Hala ere, testuok informazio pertsonala baldin
badute, ezin dira ikerkuntzarako edota ...
Noisy speech recognition using Kaldi and neural architectures
(2018-02)
[EN]Noisy Speech Recognition using Kaldi and Neural Architectures ABSTRACT The goal of an Automatic Speech Recognition (ASR) system is to transform a set of acoustic features into a sequence of words. It mainly consists ...
to post-edit or to translate ... That is the question: a case study of a recommender system for Quality Estimation of Machine Translation based on linguistic features
(2018)
[EN]The implementation of a machine translation system into production is not enough to warrant its efficient use. There exists the need to know when it is profitable to use machine translation as opposed to translating ...
Gazteak eta euskara sare sozialetan. Zer, nori, nork: euskarazko txio formal eta informalak sailkatuz eta konparatuz
(2018-11-09)
[EU]Teknologia berrien etengabeko garapenak aldaketak eragin ditu gizakion arteko komunikazio moduetan. Honela, geroz eta ohikoagoa da sare sozialak eguneroko bizitzan erabiltzea, inolako mugarik gabeko komunikazioa ...
Exploring metrics for post-editing effort: and their ability to detect errors in machine translated output
(2019-07-17)
As more companies integrate machine translation (MT) systems into their translation workflows, it becomes increasingly relevant to accurately measure post-editing (PE) effort. In this paper we explore how different types ...
Tex2kor: sekuentziatik sekuentziarako euskararako korreferentzia-ebazpena
(2020-02-25)
[EU]Korreferentzia-ebazpena testuko bi aipamenek mundu errealeko entitate bera erreferentziatzen dutela identi katzeari deritzo. Lan honetan, korreferentzia-ebazpena sekuentziatik sekuentziara lantzeko hurbilpen berri bat ...
BertsoBot: lehen urratsak
(2012)
[EU]Hizkuntzaren prozesamenduko teknikak erabilita, poesia-sorkuntza automatikoan lehen urratsak eman dira. Hau erdiesteko, corpusen prozesamenduan oinarritutako bilaketak erabili dira, bai bilaketa arruntak eta baita ...
Coping with Data Scarcity: First Steps towards Word Expansion for a Chatbot in the Urban transportation Domain
(2020-11-26)
Hizkuntzaren Prozesamenduan (HP) zenbait arlotan hitzak erabili izan dira tradizionalki
zabaltze-tekniken garapenean, hala nola Informazioaren Berreskurapenean (IB) edota
Galdera-Erantzun (GE) sistemetan. Master tesi ...
End to end approach for i2b2 2012 challenge based on Cross-lingual models
(2020-11-26)
BACKGROUND - We propose a Cross-lingual approach to i2b2 2012 challenge for Clinical
Records focused on the temporal relations in clinical narratives. Corpus of discharge
summaries annotated with temporal information was ...