<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://ontologforum.com/index.php?action=history&amp;feed=atom&amp;title=OntologySummit2014_Hackathon_ReferenceDataForSPARQLPeformanceBenchmarking</id>
	<title>OntologySummit2014 Hackathon ReferenceDataForSPARQLPeformanceBenchmarking - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://ontologforum.com/index.php?action=history&amp;feed=atom&amp;title=OntologySummit2014_Hackathon_ReferenceDataForSPARQLPeformanceBenchmarking"/>
	<link rel="alternate" type="text/html" href="https://ontologforum.com/index.php?title=OntologySummit2014_Hackathon_ReferenceDataForSPARQLPeformanceBenchmarking&amp;action=history"/>
	<updated>2026-06-28T08:55:47Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.39.0</generator>
	<entry>
		<id>https://ontologforum.com/index.php?title=OntologySummit2014_Hackathon_ReferenceDataForSPARQLPeformanceBenchmarking&amp;diff=2758&amp;oldid=prev</id>
		<title>imported&gt;Admin: Last updated at: 2014-03-19 04:23:08 By user: VictorChernov</title>
		<link rel="alternate" type="text/html" href="https://ontologforum.com/index.php?title=OntologySummit2014_Hackathon_ReferenceDataForSPARQLPeformanceBenchmarking&amp;diff=2758&amp;oldid=prev"/>
		<updated>2014-04-28T02:59:11Z</updated>

		<summary type="html">&lt;p&gt;Last updated at: 2014-03-19 04:23:08 By user: VictorChernov&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;= [[OntologySummit2014_Hackathon]] - Project:  =&lt;br /&gt;
&lt;br /&gt;
== Optimized SPARQL performance management via native API  ==&lt;br /&gt;
&lt;br /&gt;
Project roster page: [[OntologySummit2014_Hackathon_ReferenceDataForSPARQLPeformanceBenchmarking]] (this page). &lt;br /&gt;
&lt;br /&gt;
Team lead: [[VictorChernov]] (MSK, UTC+4) vchernov at nitrosbase.com &lt;br /&gt;
&lt;br /&gt;
Event starts 29th of March 2014 14:00 MSK / 10:00 UTC / 03:00 PST all over the world via mikogo.com (the session # will come later) &lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
The Goals of the project are &lt;br /&gt;
&lt;br /&gt;
Studying the kinds of queries revealing the advantages of one or another RDF database. The goals imply: &lt;br /&gt;
&lt;br /&gt;
* Selection of a SPARQL subset from SP2Bench &lt;br /&gt;
* Formin a dataset and loading it to all triple-stores. &lt;br /&gt;
* Implementing measurement aids, testing &lt;br /&gt;
* Accurate time measurement, getting min, max, average and median times. &lt;br /&gt;
* Reflection on the results, advantages and disadvantages of the triplestores on each selected query. &lt;br /&gt;
&lt;br /&gt;
The following triplestores will be compared: &lt;br /&gt;
&lt;br /&gt;
* [http://virtuoso.openlinksw.com/ Virtuoso] &lt;br /&gt;
* [http://stardog.com/ Stardog] &lt;br /&gt;
* [http://nitrosbase.com/ [[NitrosBase]]] &lt;br /&gt;
&lt;br /&gt;
The triplestores have the following important advantages: &lt;br /&gt;
&lt;br /&gt;
* Very high  performance on demonstrated on sp2bench benchmark &lt;br /&gt;
* Linux and Windows versions &lt;br /&gt;
* Native API for fast query processing &lt;br /&gt;
&lt;br /&gt;
It is important to use native API for fast query execution. All 3 tools provide native API: &lt;br /&gt;
&lt;br /&gt;
; Virtuoso : Jena, Sesame and Virtuoso ODBC RDF Extensions for SPASQL &lt;br /&gt;
; Stardog : the core SNARL (Stardog Native API for the RDF Language) classes and interfaces &lt;br /&gt;
; [[NitrosBase]] : C++ and .NET native API &lt;br /&gt;
&lt;br /&gt;
We suppose writing additional codes needed for accurate testing: &lt;br /&gt;
&lt;br /&gt;
* Accurate time measurement; &lt;br /&gt;
* Functions for getting min, max, average and median times; &lt;br /&gt;
* Functions for getting time of scanning through the whole query result; &lt;br /&gt;
* Functions for getting time of retrieving first several records (for example, the first page of web grid); &lt;br /&gt;
* Etc. &lt;br /&gt;
&lt;br /&gt;
The following steps are needed for loading test dataset: &lt;br /&gt;
&lt;br /&gt;
* Selecting a data subset from sp2bench benchmark &lt;br /&gt;
* Measuring data loading time &lt;br /&gt;
&lt;br /&gt;
'''''Note:'''''&lt;br /&gt;
''Data are considered as loaded as soon as the system is ready to perform a simplest search query. This is done to eliminate background processes (eg. indexing).'' &lt;br /&gt;
&lt;br /&gt;
We are going to explore the query execution performance by the databases under consideration (Virtuoso, Stardog, [[NitrosBase]]). &lt;br /&gt;
&lt;br /&gt;
The queries should be fairly simple and cover the different techniques, for example: &lt;br /&gt;
&lt;br /&gt;
* search the small range of values &lt;br /&gt;
* search the big range of values &lt;br /&gt;
* Sorting &lt;br /&gt;
* Aggregation &lt;br /&gt;
* Several different join queries &lt;br /&gt;
* Retrieving part of result &lt;br /&gt;
* Retrieving whole result &lt;br /&gt;
* etc. &lt;br /&gt;
&lt;br /&gt;
'''''Note:'''''&lt;br /&gt;
''During testing each database may allocate a lot of resources, that can affect the performance of other databases. Thats why each test should be stared from system reboot.''&lt;/div&gt;</summary>
		<author><name>imported&gt;Admin</name></author>
	</entry>
</feed>