On demand translation for querying incompletely aligned datasets

Torre-Bastida, Ana I.; Bermúdez de Andrés, Jesús; Illarramendi Echave, María Aránzazu; González, Marta

View/Open

TR 02-2014.pdf (504.0Kb)

Date

2014-12-01

Author

Torre-Bastida, Ana I.

Bermúdez de Andrés, Jesús

Illarramendi Echave, María Aránzazu

González, Marta

Metadata

Show full item record

Estadisticas en RECOLECTA
(LA Referencia)

URI

http://hdl.handle.net/10810/13802

Abstract

More and more users aim at taking advantage of the existing Linked Open Data environment to formulate a query over a dataset and to then try to process the same query over different datasets, one after another, in order to obtain a broader set of answers. However, the heterogeneity of vocabularies used in the datasets on the one side, and the fact that the number of alignments among those datasets is scarce on the other, makes that querying task difficult for them. Considering this scenario we present in this paper a proposal that allows on demand translations of queries formulated over an original dataset, into queries expressed using the vocabulary of a targeted dataset. Our approach relieves users from knowing the vocabulary used in the targeted datasets and even more it considers situations where alignments do not exist or they are not suitable for the formulated query. Therefore, in order to favour the possibility of getting answers, sometimes there is no guarantee of obtaining a semantically equivalent translation. The core component of our proposal is a query rewriting model that considers a set of transformation rules devised from a pragmatic point of view. The feasibility of our scheme has been validated with queries defined in well known benchmarks and SPARQL endpoint logs, as the obtained results confirm.

Collections

Informes técnicos y Documentos de trabajo