SOFOS: Demonstrating the Challenges of Materialized View Selection on Knowledge Graphs

Georgia Troullinou, Haridimos Kondylakis, Matteo Lissandrini, Davide Mottin

SIGMOD 2021

TL;DR

Perform analytical (e.g., aggregate) queries over large knowledge graph (RDF) data. We present a system, SOFOS that

  • Addresses the problem of providing fast query answering for analytical queries on KGs
  • Provides a generic solution to be deployed on any RDF triple store with SPARQL query processing
  • Highlights possible limitations of six alternative approaches.

Abstract

Analytical queries over RDF data are becoming prominent as a result of the proliferation of knowledge graphs. Yet, RDF databases are not optimized to perform such queries efficiently, leading to long processing times. A well known technique to improve the performance of analytical queries is to exploit materialized views.Although popular in relational databases, view materialization for RDF and SPARQL has not yet transitioned into practice, due to the non-trivial application to the RDF graph model. Motivated by a lack of understanding of the impact of view materialization alternatives for RDF data, we demonstrate Sofos, a system that implements and compares several cost models for view materialization. Sofos is, to the best of our knowledge, the first attempt to adapt cost models, initially studied in relational data, to the generic RDF setting, and to propose new ones, analyzing their pitfalls and merits. Sofos takes an RDF dataset and an analytical query for some facet in the data, and compares and evaluates alternative cost models, displaying statistics and insights about time, memory consumption, and query characteristics.