Vis enkel innførsel

dc.contributor.authorStuhr, Magnuseng
dc.date.accessioned2012-08-03T06:55:04Z
dc.date.available2012-08-03T06:55:04Z
dc.date.issued2012-05-31eng
dc.date.submitted2012-05-31eng
dc.identifier.urihttp://hdl.handle.net/1956/5893
dc.description.abstractThe Resource Description Framework (RDF) is the W3C recommended standard for data on the semantic web, while the SPARQL Protocol and RDF Query Language (SPARQL) is the query language that retrieves RDF triples by subject, predicate, or object. RDF data often contain valuable information that can only be queried through filter functions. The SPARQL query language for RDF can include filter clauses in order to define specific data criteria, such as full-text searches, numerical filtering, and constraints and relationships between data resources. However, the downside of executing SPARQL filter queries is the frequently slow query execution times. Due to the fact that SPARQL filter queries can retrieve information that non-filter SPARQL queries cannot, decreasing the query execution time of SPARQL filter queries will greatly enhance the efficiency of the SPARQL query language. This thesis presents a SPARQL filter query processing engine for conventional triplestores called FILT (Filtering Indexed Lucene Triples), which is built on top of the Apache Lucene framework for storing and retrieving indexed documents. The objective of FILT was to decrease the query execution time of SPARQL filter queries. This was evaluated by performing a benchmark test of FILT compared to the Joseki triplestore, focusing on two different use-cases; SPARQL regular expression filtering in medical data, and SPARQL numerical/logical filtering of geo-coordinates in geographical locations.en_US
dc.format.extent1677841 byteseng
dc.format.mimetypeapplication/pdfeng
dc.language.isoengeng
dc.publisherThe University of Bergeneng
dc.subjectRDF full-text searcheng
dc.subjectSPARQL filter querieseng
dc.subjectSPARQL regex filter querieseng
dc.subjectSPARQL numerical filter querieseng
dc.subjectRDF data indexingeng
dc.subjectLuceneeng
dc.titleFILT - Filtering Indexed Lucene Tripleseng
dc.typeMaster thesisen_US
dc.description.localcodeINFO390
dc.description.localcodeMASV-INFO
dc.subject.nus735115eng
dc.subject.nsiVDP::Social science: 200::Media science and journalism: 310eng
fs.subjectcodeINFO390


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel