Word n-gram attention models for sentence similarity and inference

López Gazpio, Iñigo; Maritxalar Anglada, Montse; Lapata, Mirella; Agirre Bengoa, Eneko

Ver/

Preprint (601.3Kb)

Fecha

2019-04-22

Autor

López Gazpio, Iñigo

Maritxalar Anglada, Montse

Lapata, Mirella

Agirre Bengoa, Eneko

Metadatos

Mostrar el registro completo del ítem

Estadisticas en RECOLECTA
(LA Referencia)

Expert Systems with Applications 132 : 1-11 (2019)

URI

http://hdl.handle.net/10810/69008

Resumen

Semantic Textual Similarity and Natural Language Inference are two popular natural language understanding tasks used to benchmark sentence representation models where two sentences are paired. In such tasks sentences are represented as bag of words, sequences, trees or convolutions, but the attention model is based on word pairs. In this article we introduce the use of word n-grams in the attention model. Our results on five datasets show an error reduction of up to 41% with respect to the word-based attention model. The improvements are especially relevant with low data regimes and, in the case of natural language inference, on the recently released hard subset of Natural Language Inference datasets.

Colecciones

Artículos