Projects/Nepomuk: Difference between revisions

From KDE TechBase
(Removed page from translation)
(112 intermediate revisions by 13 users not shown)
Line 1: Line 1:
[[Image:Nepomuk_logo_big.png|center|300px]]
[[Image:Nepomuk_logo_big.png|center|300px]]


== About Nepomuk ==
== About Nepomuk ==


This page is dedicated to Nepomuk development ideas, progress, experiments, and is a general starting point for new developers.
'''Nepomuk''' serves as a cross application semantic storage backend. It aims at collecting data from various sources - file indexing, the web, applications, etc, and linking them all together to form a cohesive map of data.


For general information about the Nepomuk project see the [http://nepomuk.kde.org/ dedicated Nepomuk homepage].
This page is dedicated to third party documentation for '''Nepomuk'''. To know more about '''Nepomuk''' from a user's point of view, head over to the [http://userbase.kde.org/Special:myLanguage/Nepomuk Nepomuk page on UserBase]. Or to know more about the Nepomuk community and getting involved in '''Nepomuk''', head over to the [http://community.kde.org/Projects/Nepomuk Nepomuk Community Page].


== Documentation ==


== Developer Coordination ==
Any new project is intimidating and jumping right into the [http://api.kde.org/4.x-api/kdelibs-apidocs/nepomuk-core/html/index.html API Documentation] can be scary. So, we have prepared some articles which explain the different aspects of '''Nepomuk''' and even touch on some advanced features.


The Nepomuk project is maintained by [mailto:trueg@kde.org Sebastian Trueg] of Mandriva.
The documentation of any project is always in progress as the code base is always evolving. If you feel that the documentation is lacking in some regard, please come talk to us. We'd love to hear your feedback, and the documentation might just get improved in the process.
 
 
== Documentation ==


The following links provide good reads for getting used to the Nepomuk system and its APIs.
'''Nepomuk Mailing List: ''' nepomuk@kde.org <br/>
'''IRC Channel:''' #nepomuk-kde on freenode


* [[Development/Tutorials/Metadata/Nepomuk|Development Tutorials]]
=== Introductory Material ===
* [http://api.kde.org/4.x-api/kdelibs-apidocs/nepomuk/html/index.html Nepomuk API Documentation]
* [http://soprano.sourceforge.net/apidox/trunk/index.html Soprano (RDF storage) API]


If you're just getting started with '''Nepomuk''' and want to know a quick way to fetch some data.


== ToDo ==
* [[Special:myLanguage/Projects/Nepomuk/QuickStart| Quick Start]]
* [[Special:myLanguage/Projects/Nepomuk/OntologyBasics| Basic Ontology concepts]]
* [[Special:myLanguage/Projects/Nepomuk/Uris| Questions about URIs]]


Nepomuk is a rather young project with a notorious shortage in developers. There are many tasks and subprojects to get ones hands dirty on. Unlike other projects like Plasma, however, developing for Nepomuk is not easy. One has to read up on a lot of things and fight some day-to-day annoyances. But: helping with the development will improve the situation in any case.
=== Managing Data ===


If you are interested in working on a task in this list, please contact [mailto:trueg@kde.org Sebastian Trueg].
This section includes more in-depth articles on how manage the data in '''Nepomuk'''. As a starting point you should probably open up the [http://api.kde.org/4.x-api/kdelibs-apidocs/nepomuk-core/html/index.html Nepomuk API Documentation]. It is generally more up to date than the articles mentioned below.


=== Low level Nepomuk Development Tasks ===
* [[Special:myLanguage/Projects/Nepomuk/Resources| Using Resources]]
* [[Special:myLanguage/Projects/Nepomuk/ResourceWatcher| Monitoring Changes]]
* [[Special:myLanguage/Projects/Nepomuk/BulkChanges| Bulk Changes]]
* [[Special:myLanguage/Projects/Nepomuk/DataFeeders| Data Feeders]]


The low-level development tasks are those that are not directly reflected in the GUI or even in the API used by most developers. However, they are important in terms of performance, scalability, and compatibility.
=== File Indexing ===


==== Soprano Transaction Support ====
With 4.10, the file indexing architecture has substantially changed. We no longer rely on strigi, and have our own plugin based interface.


[http://soprano.sf.net/ Soprano] is the RDF database framework used in Nepomuk. Currently Soprano does not support transactions, i.e. sets of commands that can be rolled back. An [http://websvn.kde.org/branches/soprano/experimental experimental development] branch exists which already contains new API for transaction support (while keeping BC).
* [[Special:myLanguage/Projects/Nepomuk/IndexingPlugin| Writing an Indexing Plugin]]


It still misses an implementation of the transaction support in Soprano backends (Sesame2 and Virtuoso) and in the client/server architecture.
=== Querying ===


=== General Nepomuk ===
As you advance into '''Nepomuk''', you'll want to move beyond just fetching and pushing data and will want to query '''Nepomuk''' for specialized data. One can query '''Nepomuk''' in many different ways, the important part is to optimize your queries and make sure they run well on production systems where the database sizes may way very large.


==== Handling of external storage ====
* [[Special:myLanguage/Projects/Nepomuk/QueryingMethods| Different ways to Query Nepomuk]]
* [[Special:myLanguage/Projects/Nepomuk/QueryLibrary| Nepomuk Query Library]]
* [[Special:myLanguage/Projects/Nepomuk/SparqlQueries| Sparql Queries]]


A typical problem with the way Nepomuk handles files and file metadata are removable storage devices. They can be mounted at different paths on different systems. But still one wants to keep the metadata stored in Nepomuk.
=== Architectural Overview ===
If possible one would even want to be able to search for files saved on an USB stick even if it is not plugged in.


The [http://trueg.wordpress.com/2009/04/15/portable-meta-information-yet-again-only-this-time-there-is-code/ blog entry about removable storage in Nepomuk] already discusses this problem and shows some existing code in KDE's [http://websvn.kde.org/trunk/playground/base/removablestorageservice/ playground] which tries to tackle this problem.
If you're looking to get more involved with '''Nepomuk''' development process, you should probably need to need to figure out our basic architecture and where you can find all the relevant code.


However, one actually needs more. The system would have to be embedded into KIO to make sure the metadata cache on the removable storage device is always up-to-date. Also it is directly related to the problem of relative vs. absolute file URLs.
* [[Special:myLanguage/Projects/Nepomuk/Repositories| Nepomuk Repositories]]
* [[Special:myLanguage/Projects/Nepomuk/ComponentOverview| Nepomuk Architectural Overview]]
* [[Special:myLanguage/Projects/Nepomuk/kioslaves| Nepomuk KIO Slaves]]


==== Relative vs. Absolute File URLs ====
=== Nepomuk Internals ===


Currently Nepomuk uses the absolute file URLs as URI identifiers for the resources representing the files in the Nepomuk RDF store. The file ''~/test.png'' for example has the resource URI ''file:///home/<username>/test.png''. This is nice in many situations since one can simply use the file URL to query file metadata but on the other hand we need to change a lot of triples whenever the file is moved (not to mention the removable storage problem above).
When you decide to dig even deeper.


Thus, the idea is to use random URI identifiers for new file resources and store the file path relative to the mount point. This would solve the above problem with removable devices and make updates after file moves simpler (only update the path).
* [[Special:myLanguage/Projects/Nepomuk/GraphConcepts| Graph handling]]
* [[Special:myLanguage/Projects/Nepomuk/VirtuosoInternal| Virtuoso Internals]]
* [[Special:myLanguage/Projects/Nepomuk/OntologyExtention| Extending the Ontologies]]


This problem should probably be tackled by introducing a class Nepomuk::File as a subclass to ''[http://api.kde.org/4.x-api/kdelibs-apidocs/nepomuk/html/classNepomuk_1_1Resource.html Nepomuk::Resource]'' which handles all these special file stuff like making sure we have a correct nao:filePath property and so on (currently all that is done with an ''if'' clause in ''Nepomuk::Resource''.
=== Miscellaneous ===


== Ideas ==
* [[Special:myLanguage/Projects/Nepomuk/Nepomuk2Port| Porting to Nepomuk2]]
* [[Special:myLanguage/Projects/Nepomuk/ManagingNepomukProcesses| Managing Nepomuk Processes]]
* [[Special:myLanguage/Projects/Nepomuk/TestEnvironment| Nepomuk Test Environment]]
* [[Special:myLanguage/Development/Tutorials/Metadata/Nepomuk/TipsAndTricks| Nepomuk Tips and Tricks]]
* [[Special:myLanguage/Projects/Nepomuk/NepomukShow| Debugging Nepomuk Data]]


There are many ideas on how to improve the Nepomuk system or on how to use it. This is the place to list them all.
==== Outdated links ====


Feel free to add your own ideas. Please leave your name in case someone wants to contact you for details or a discussion of the idea.
The following links provide good reads for getting used to the '''Nepomuk''' system and its APIs. <br\>
They are slightly outdated, but still has some useful material.
* [[Special:myLanguage/Development/Tutorials/Metadata/Nepomuk|Development Tutorials]]
* [[Special:myLanguage/Projects/Nepomuk/Ideas|Random Ideas]]
* [[Special:myLanguage/Projects/Nepomuk/Qualified_Relations_Idea| Qualified Relations Idea]]
* [[Special:myLanguage/Projects/Nepomuk/ScenarioExamples| Scenario Examples]]


=== FIXME: add ideas ===
[[Category:Documentation]]

Revision as of 12:34, 9 February 2018

About Nepomuk

Nepomuk serves as a cross application semantic storage backend. It aims at collecting data from various sources - file indexing, the web, applications, etc, and linking them all together to form a cohesive map of data.

This page is dedicated to third party documentation for Nepomuk. To know more about Nepomuk from a user's point of view, head over to the Nepomuk page on UserBase. Or to know more about the Nepomuk community and getting involved in Nepomuk, head over to the Nepomuk Community Page.

Documentation

Any new project is intimidating and jumping right into the API Documentation can be scary. So, we have prepared some articles which explain the different aspects of Nepomuk and even touch on some advanced features.

The documentation of any project is always in progress as the code base is always evolving. If you feel that the documentation is lacking in some regard, please come talk to us. We'd love to hear your feedback, and the documentation might just get improved in the process.

Nepomuk Mailing List: [email protected]
IRC Channel: #nepomuk-kde on freenode

Introductory Material

If you're just getting started with Nepomuk and want to know a quick way to fetch some data.

Managing Data

This section includes more in-depth articles on how manage the data in Nepomuk. As a starting point you should probably open up the Nepomuk API Documentation. It is generally more up to date than the articles mentioned below.

File Indexing

With 4.10, the file indexing architecture has substantially changed. We no longer rely on strigi, and have our own plugin based interface.

Querying

As you advance into Nepomuk, you'll want to move beyond just fetching and pushing data and will want to query Nepomuk for specialized data. One can query Nepomuk in many different ways, the important part is to optimize your queries and make sure they run well on production systems where the database sizes may way very large.

Architectural Overview

If you're looking to get more involved with Nepomuk development process, you should probably need to need to figure out our basic architecture and where you can find all the relevant code.

Nepomuk Internals

When you decide to dig even deeper.

Miscellaneous

Outdated links

The following links provide good reads for getting used to the Nepomuk system and its APIs. <br\> They are slightly outdated, but still has some useful material.