OBDA Sub-Problems

In this posts, we will explore the objects appearing in query answering in OBDA system. We have introduce what we call an RDF integration system denoted \(\langle \onto, \rules, \mappings, \extensions \rangle\), where \(\onto\) is an ontology, \(\rules\) is reasoning rules set, \(\mappings\) is a mappings set and \(\extensions\) is a set of RDF extensions.

Mappings

To define a OBDA system, it is easy to define each mapping as GLAV mapping of the form:

\[m: q_{1}(\bar x) \leadsto q_{2}(\bar x)\]

We will see in the following how we can benefit from a intermediate view \(V_{m}\), defining a new extension set, to split the mapping \(m\) into one LAV and one GAV mapping as follow:

a GAV mapping between the source query and the view \(q_{1}(\bar x) \leadsto V_{m}(\bar x)\);
a LAV mapping between the view and the query on the global schema, \(V_{m}(\bar x) \leadsto q_{2}(\bar x)\)

In this section, we will redefine previous GLAV mapping to common GLAV mappings on heterogeneous sources using the splitting GAV-LAV. Then, we will see how we can minimize mappings set using this new definition.

Definitions of View based GLAV Mappings

Traditionally in a GLAV mapping \(m\), \(q_{1}\) is a query in one data source, hence if we have two sources as follow:

\(\mathrm{Collection}(user, doc)\) containing the collection of documents of each user;
\(\mathrm{Topic}(doc, topic)\) containing the topic of each document.

It is not possible to define a GLAV mapping exposing the user documents topics of the users, without revealing the users collections. Such mapping would be the following: \[\mathrm{Collection}(user, doc) \wedge \mathrm{Topic}(doc, topic) \leadsto \triple{user}{\irin{haveDocumentOn}}{topic}\]

We notice that this mapping have a body joining two different sources, which is not supported by the GLAV mapping definition.

An view based GLAV mappings collection is a pair of mappings sets \(\mappings = (\mappings_{V}, \mappings_{G})\), where \(\mappings_{V}\) is a set of GAV mappings whose bodies are conjunctive queries on one sources, called the views definitions of \(\mappings\) and \(\mappings_{G}\) is a set of GLAV mappings whose bodies are conjunctive queries using predicates from mappings heads of \(\mappings_{V}\), it is called GLAV mappings set of \(\mappings\).

We can define \(\mappings = (\mappings_{V}, \mappings_{G})\) a view based GLAV mappings collection for defining the previous mapping example. We start by defining \(\mappings_V\) the views definitions containing the two following mappings:

for a view of the \(\mathrm{Collection}\) relation \[\mathrm{Collection}(user, doc) \leadsto V_{C}(user, doc)\]
for a view of the \(\mathrm{Topic}\) relation \[\mathrm{Topic}(doc, topic) \leadsto V_{T}(doc, topic)\]

Then, we definite \(\mappings_{G}\) the GLAV mappings set containing one mapping: \[V_{C}(user, doc) \wedge V_{T}(doc, topic) \leadsto \triple{user}{\irin{haveDocumentOn}}{topic}\]

Minimization of View based GLAV Mappings

Inspired from dipintoOptimizingQueryRewriting2013, we can take advantage of the view based GLAV mapping representation to optimized the mapping set, by minimizing it.

Mapping Creating Schema

pintoMappingDataHigherOrder
dipintoAcquiringOntologyAxioms2019a In this article, the authors present the mapping-based knowledge base which extends the OBDA formalisation by allowing the T-box (DL-Lite_R) to be induced from data sources by GAV mapping.
giacomoHigherOrderDescriptionLogics describes the higher order DL language.

Mappings Generation

look at the problem of generating mapping from spreadsheet (Question that comes up during Duc's defense)
look at the thesis of Ugo Camignani who investigate the problem of repairing mappings and rewriting mapppings in order that they respect some privacy constraints.

Query

Higher Order Query

Non Positive Query

cimaQueriesInequalitiesDLLite

Regular Path Query

OPTIONAL in SPARQL

SPARQL queries build with BGP and OPTIONAL operator are not positive queries, in the sense that we need the negation operator to express such queries. But OPTIONAL SPARQL queries are monotonous, which in OWA assumptions, means that the more the KB contains positive atoms, the more the query have answers and the number of its answers is independent of the number of negative atoms in the KB.

OBDA Sub-Problems

Mappings

Definitions of View based GLAV Mappings

Minimization of View based GLAV Mappings

Mapping Creating Schema

Mappings Generation

Query

Higher Order Query

Non Positive Query

Regular Path Query

OPTIONAL in SPARQL

Update Query

Reasoning

Equality Dependency

Mediation

Optimization

Join ordering