Buscar

Mostrando ítems 1-10 de 61

Multilingual word embeddings and their utility in cross-lingual learning

Kulmizev, Artur (2018-10-15)

Word embeddings - dense vector representations of a word’s distributional semantics - are an indespensable component of contemporary natural language processing (NLP). Bilingual embeddings, in particular, have attracted ...

Automating the anonymisation of textual corpora

García Sardiña, Laura (2018-11-04)

[EU] Gaur egun, testu berriak etengabe sortzen doaz sare sozialetako mezu, osasun-txosten, dokumentu o zial eta halakoen ondorioz. Hala ere, testuok informazio pertsonala baldin badute, ezin dira ikerkuntzarako edota ...

Noisy speech recognition using Kaldi and neural architectures

González Docasal, Ander (2018-02)

[EN]Noisy Speech Recognition using Kaldi and Neural Architectures ABSTRACT The goal of an Automatic Speech Recognition (ASR) system is to transform a set of acoustic features into a sequence of words. It mainly consists ...

to post-edit or to translate ... That is the question: a case study of a recommender system for Quality Estimation of Machine Translation based on linguistic features

De Gibert Bonet, Ona (2018)

[EN]The implementation of a machine translation system into production is not enough to warrant its efficient use. There exists the need to know when it is profitable to use machine translation as opposed to translating ...

Gazteak eta euskara sare sozialetan. Zer, nori, nork: euskarazko txio formal eta informalak sailkatuz eta konparatuz

Fernández de Landa, Joseba (2018-11-09)

[EU]Teknologia berrien etengabeko garapenak aldaketak eragin ditu gizakion arteko komunikazio moduetan. Honela, geroz eta ohikoagoa da sare sozialak eguneroko bizitzan erabiltzea, inolako mugarik gabeko komunikazioa ...

Exploring metrics for post-editing effort: and their ability to detect errors in machine translated output

Cumbreño Díez, Cristina (2019-07-17)

As more companies integrate machine translation (MT) systems into their translation workflows, it becomes increasingly relevant to accurately measure post-editing (PE) effort. In this paper we explore how different types ...

Tex2kor: sekuentziatik sekuentziarako euskararako korreferentzia-ebazpena

Urbizu Garmendia, Gorka (2020-02-25)

[EU]Korreferentzia-ebazpena testuko bi aipamenek mundu errealeko entitate bera erreferentziatzen dutela identi katzeari deritzo. Lan honetan, korreferentzia-ebazpena sekuentziatik sekuentziara lantzeko hurbilpen berri bat ...

BertsoBot: lehen urratsak

Agirrezabal Zabaleta, Manex (2012)

[EU]Hizkuntzaren prozesamenduko teknikak erabilita, poesia-sorkuntza automatikoan lehen urratsak eman dira. Hau erdiesteko, corpusen prozesamenduan oinarritutako bilaketak erabili dira, bai bilaketa arruntak eta baita ...

Coping with Data Scarcity: First Steps towards Word Expansion for a Chatbot in the Urban transportation Domain

García Montero, Eneritz (2020-11-26)

Hizkuntzaren Prozesamenduan (HP) zenbait arlotan hitzak erabili izan dira tradizionalki zabaltze-tekniken garapenean, hala nola Informazioaren Berreskurapenean (IB) edota Galdera-Erantzun (GE) sistemetan. Master tesi ...

End to end approach for i2b2 2012 challenge based on Cross-lingual models

Santamaría, Edgar Andrés (2020-11-26)

BACKGROUND - We propose a Cross-lingual approach to i2b2 2012 challenge for Clinical Records focused on the temporal relations in clinical narratives. Corpus of discharge summaries annotated with temporal information was ...

1
2
3
4
. . .
7