Development/Tutorials/Metadata/Nepomuk/ChatLogger: Difference between revisions

From KDE TechBase
(Filled up the Use Case)
No edit summary
Line 46: Line 46:


=== Getting Started ===
=== Getting Started ===
Link the basic Nepomuk testing Template and explain parts of it
 
The simplest way to get started would be to use an existing Nepomuk template. There exists a test template over [https://projects.kde.org/projects/kde/kdeexamples/repository/revisions/master/show/nepomuk/test here].
 
That template contains more headers than are actually required. Don't use it in the real world!


=== Creating a Resource ===
=== Creating a Resource ===


In Nepomuk everything is stored as a Resource.  
In Nepomuk, a <i>Resource</i> is one of the fundamental building blocks.  A resource is basically a unique URI ( Uniform Resource Identifier ) which identifies the resource, and it contains many properties.
 
All of this is stored in the Nepomuk Repository in statements of the form - <code>Subject Predicate Object</code>


==== Nepomuk::Resource ====
==== Nepomuk::Resource ====


The most common way to create or manipulate a resource is using the Nepomuk::Resource class.  
The most common way to create or manipulate a resource is using the Nepomuk::Resource class -
 
<code>
Nepomuk::Resource res;
res.addType( Nepomuk::Vocabulary::NMO::IMMessage() );
</code>
 
This would initially create a empty resource, and then add the type 'IMMessage' to it. In programming languages, you need to mention the type before creation of an object. Here a resource needs to have its type set as a property.
 
Internally it would use the <i>rdf:type</i> property to store the type. One could also add the type nmo:Message, but since <i>nmo:IMMessage</i> is a subclass of <i>nmo:Message</i>, it's automatically implied.
 
Nepomuk automatically assigns a unique uri to each Nepomuk Resource. It is in form of '''nepomuk:/res/32-bit-uuid'''
 
An example - <br>
date1 '''Bob:''' Hi!<br>
date2 '''Mary:''' Good Evening Bob.<br>
 
Here we have 2 messages. If we had to store this information in Nepomuk, we would need 2 distinct resources. One for each message.
 
<code>
Nepomuk::Resource mes1;
mes1.addType( Nepomuk::Vocabulary::NMO::IMMessage() );


Show code
mes1.setProperty( Nepomuk::Vocabulary::NMO::plainTextMessageContent(), "Hi!" );
mes1.setProperty( Nepomuk::Vocabulary::NMO::isRead(), false );
mes1.setProperty( Nepomuk::Vocabulary::NMO::receivedDate(), QDateTime( date1 ) );


The next step would obviously be of adding the required properties
Nepomuk::Resource mes2;
mes2.addType( Nepomuk::Vocabulary::NMO::IMMessage() );


more code
mes2.setProperty( Nepomuk::Vocabulary::NMO::plainTextMessageContent(), "Good Evening Bob." );
mes2.setProperty( Nepomuk::Vocabulary::NMO::isRead(), false );
mes2.setProperty( Nepomuk::Vocabulary::NMO::receivedDate(), QDateTime( date2 ) );


If we look at the ontology specification - The nmo:toMessage property has a nco:Contact in it's range
</code>
    < exact line >


So we need to creates a separate Resource for each Contact. Suppose Bob and Mary are chatting.
If we look at the ontology specification - The nmo:toMessage and nmo:fromMessage property has a nco:ContactMedium in its range


Bob: Hi!
<code>
Mary: Good Evening Bob.
    nmo:messageFrom
          a      rdf:Property ;
          rdfs:comment "The sender of the message" ;
          rdfs:domain nmo:Message ;
          rdfs:label "from" ;
          rdfs:range nco:ContactMedium .
</code>


Here we would need 2 resources of the type nco:Contact and 2 resources of type nmo:IMMessage ie one for each message.
So, this means that we need some kind of nco:ContactMedium for Bob and Mary. At this point you should probably try to read through the Nepomuk Contact Ontology.


For a quick summary - nco:Medium is a generic base class for
            nco:ContactMedium
        ____/      |    \__________
        /          |                \
    nco:IMAccount  nco:PhoneNumber nco:EmailAddress
Since we are dealing with IM chat logging, we would need an nco:IMAccount for both Bob and Mary.
< code to create a Contact >
< code to create a Contact >


Line 79: Line 124:
==== The Resource Generator ====
==== The Resource Generator ====


In our case ( chat logger ) we know exactly what kind of resources we want - nmo:IMMessage. So it would be a lot easier for us to generate C++ classes for them, so we don't have to use the cumbersome addProperty( uri, value ) method.
Most of the given code is horribly verbose and long. Plus, it's not really the C++ way of doing things. So it would be a lot easier for us to generate C++ classes for the ontologies that we use, so we don't have to use the cumbersome addProperty( uri, value ) method.
 
This can be done by adding the following code to the CMakeLists.txt
 
<code>
nepomuk_add_ontology_classes(
  SRCS
  ONTOLOGIES
  ${SHAREDDESKTOPONTOLOGIES_ROOT_DIR}/nie/nmo.trig
  )
 
nepomuk_add_ontology_classes(
  SRCS
  ONTOLOGIES
  ${SHAREDDESKTOPONTOLOGIES_ROOT_DIR}/nie/nco.trig
  )
</code>
 
Both NMO and NCO are part of the NIE Ontologies. They are generally installed in the /usr/share/ontology/ folder by the shared-desktop-ontologies project.
 
This code calls the CMake macro nepomuk_add_ontology_classes, which essentially creates C++ classes for each class in the specified ontology.
 
On running this, there should be header and source files for each class in the ontology.


Show the CMake Code


Show the same code but using the Resource Generator
Show the same code but using the Resource Generator

Revision as of 13:11, 9 February 2011


Development/Tutorials/Metadata/Nepomuk/ResourceGenerator


Creating a ChatLogger using Nepomuk
Tutorial Series   Nepomuk
Previous  
What's Next  
Further Reading  


Rough Basics

In Nepomuk, ontologies are very important. For a semi-decent analogy - An ontology is like a class specification. You get to define exactly which properties, sub-classes and super-classes a class has. Every object of that class, in Nepomuk terms, is called a Resource.

In the linked data world, properties are also a special kind of class. Unlike conventional programming languages where a property is limited to that class and its descendants. In linked data, each property has a domain and range, indicating which classes it can be mapped to. Properties can also be derived from to have sub-properties.

Use Case

The first thing one generally does when storing some information in a database is to decide what data should be stored. In the case of a chat logger, what we need is -

  • From
  • To
  • Time stamp
  • Message Content
  • Status of the message

Additionally, we would require the database to be persistent and query-able. Nepomuk provides that.

Finding the correct Ontology

The first thing one generally needs to know when working on some Nepomuk related project, is how is the data going to be stored? To be specific - which ontology would be required?

Ontologies already exist for common use cases like Email, Messaging, Notes, etc. In the case of a chat logging system we would want to take a look at NMO - The Nepomuk Messaging Ontology. ( Add a link )

The easiest way to know the exact contents of the ontology is to read its trig file - Link. On inspection we realize that there exists several classes -

        nmo:Message
        /      \ 
       /        \
 nmo:Email   nmo:IMMessage

Here nmo:IMMessage is a sub class of nmo:Message. On further inspection of the properties of nmo:Message, we notice it has properties like nmo:messageTo, nmo:messageFrom, nmo:isRead, nmo:receivedDate, and nmo:plainTextMessageContent

These properties depict exactly what we need in the case of the chat logger. So we know exactly how we're going to store the data.

Implementation

Getting Started

The simplest way to get started would be to use an existing Nepomuk template. There exists a test template over here.

That template contains more headers than are actually required. Don't use it in the real world!

Creating a Resource

In Nepomuk, a Resource is one of the fundamental building blocks. A resource is basically a unique URI ( Uniform Resource Identifier ) which identifies the resource, and it contains many properties.

All of this is stored in the Nepomuk Repository in statements of the form - Subject Predicate Object

Nepomuk::Resource

The most common way to create or manipulate a resource is using the Nepomuk::Resource class -

Nepomuk::Resource res; res.addType( Nepomuk::Vocabulary::NMO::IMMessage() );

This would initially create a empty resource, and then add the type 'IMMessage' to it. In programming languages, you need to mention the type before creation of an object. Here a resource needs to have its type set as a property.

Internally it would use the rdf:type property to store the type. One could also add the type nmo:Message, but since nmo:IMMessage is a subclass of nmo:Message, it's automatically implied.

Nepomuk automatically assigns a unique uri to each Nepomuk Resource. It is in form of nepomuk:/res/32-bit-uuid

An example -
date1 Bob: Hi!
date2 Mary: Good Evening Bob.

Here we have 2 messages. If we had to store this information in Nepomuk, we would need 2 distinct resources. One for each message.

Nepomuk::Resource mes1; mes1.addType( Nepomuk::Vocabulary::NMO::IMMessage() );

mes1.setProperty( Nepomuk::Vocabulary::NMO::plainTextMessageContent(), "Hi!" ); mes1.setProperty( Nepomuk::Vocabulary::NMO::isRead(), false ); mes1.setProperty( Nepomuk::Vocabulary::NMO::receivedDate(), QDateTime( date1 ) );

Nepomuk::Resource mes2; mes2.addType( Nepomuk::Vocabulary::NMO::IMMessage() );

mes2.setProperty( Nepomuk::Vocabulary::NMO::plainTextMessageContent(), "Good Evening Bob." ); mes2.setProperty( Nepomuk::Vocabulary::NMO::isRead(), false ); mes2.setProperty( Nepomuk::Vocabulary::NMO::receivedDate(), QDateTime( date2 ) );

If we look at the ontology specification - The nmo:toMessage and nmo:fromMessage property has a nco:ContactMedium in its range

   nmo:messageFrom
         a       rdf:Property ;
         rdfs:comment "The sender of the message" ;
         rdfs:domain nmo:Message ;
         rdfs:label "from" ;
         rdfs:range nco:ContactMedium .

So, this means that we need some kind of nco:ContactMedium for Bob and Mary. At this point you should probably try to read through the Nepomuk Contact Ontology.

For a quick summary - nco:Medium is a generic base class for

            nco:ContactMedium
        ____/      |     \__________
       /           |                \
   nco:IMAccount  nco:PhoneNumber nco:EmailAddress 

Since we are dealing with IM chat logging, we would need an nco:IMAccount for both Bob and Mary.

< code to create a Contact >

< add contact info >


The Resource Generator

Most of the given code is horribly verbose and long. Plus, it's not really the C++ way of doing things. So it would be a lot easier for us to generate C++ classes for the ontologies that we use, so we don't have to use the cumbersome addProperty( uri, value ) method.

This can be done by adding the following code to the CMakeLists.txt

nepomuk_add_ontology_classes(

 SRCS
 ONTOLOGIES
 ${SHAREDDESKTOPONTOLOGIES_ROOT_DIR}/nie/nmo.trig
 )

nepomuk_add_ontology_classes(

 SRCS
 ONTOLOGIES
 ${SHAREDDESKTOPONTOLOGIES_ROOT_DIR}/nie/nco.trig
 )

Both NMO and NCO are part of the NIE Ontologies. They are generally installed in the /usr/share/ontology/ folder by the shared-desktop-ontologies project.

This code calls the CMake macro nepomuk_add_ontology_classes, which essentially creates C++ classes for each class in the specified ontology.

On running this, there should be header and source files for each class in the ontology.


Show the same code but using the Resource Generator

Checking if the data exists

After compiling and running our short program - We would want to check if the data actually exists in the Nepomuk Repository. The simplest way to do that would be using a sparql query

select ?r ?p ?o where { ?r a nmo:IMMessage . ?r ?p ?o. }

The query can be executed using either 'nepomukcmd' or by executing it in the nepomukshell.