Skip to content
forked from Merck/Halyard

Halyard-star is an extremely horizontally scalable Triplestore with support for Named Graphs, designed for integration of extremely large Semantic Data Models, and for storage and SPARQL 1.1 querying of the whole Linked Data universe snapshots.

License

Notifications You must be signed in to change notification settings

pulquero/Halyard

 
 

Repository files navigation

Halyard-star

CI Coverage

Fork of Halyard with support for RDF-star, XPath 3 functions, SPIN functions (and many other improvements/fixes).

NB: this fork is not compatible with the original.

Halyard is an extremely horizontally scalable triple store with support for named graphs, designed for integration of extremely large semantic data models and for storage and SPARQL 1.1 querying of complete Linked Data universe snapshots. Halyard implementation is based on Eclipse RDF4J framework and Apache HBase database, and it is completely written in Java.

Documentation: https://pulquero.github.io/Halyard

Get started

Download and unzip the latest halyard-sdk-<version>.zip bundle to a Apache Hadoop cluster node with configured Apache HBase client.

Halyard is expected to run on an Apache Hadoop cluster node with configured Apache HBase client. Apache Hadoop and Apache HBase components are not bundled with Halyard. The runtime requirements are:

  • Apache Hadoop version 3.3 or higher
  • Apache HBase version 2.5 or higher
  • Java 11 Runtime

(For convenience, here is a Java 11 build of HBase 2.5 against Hadoop 3.3).

To run the webapps on Tomcat, create bin/setenv.sh with the line export CLASSPATH="$CATALINA_HOME/lib/*:/mnt/hbase-2.5.5/conf:/mnt/hbase-2.5.5/lib/shaded-clients/*:/mnt/hbase-2.5.5/lib/client-facing-thirdparty/*:/mnt/hadoop-3.3.6/etc/hadoop:/mnt/hadoop-3.3.6/share/hadoop/client/*".

See Documentation for usage examples, architecture information, and more.

Repository contents

  • common - a library for direct mapping between an RDF data model and Apache HBase
  • strategy - a generic parallel asynchronous implementation of RDF4J Evaluation Strategy
  • sail - an implementation of the RDF4J Storage and Inference Layer on top of Apache HBase
  • tools - a set of command line and Apache Hadoop MapReduce tools for loading, updating, querying, and exporting the data with maximum performance
  • sdk - a distributable bundle of Eclipse RDF4J and Halyard for command line use on an Apache Hadoop cluster with configured HBase
  • webapps - a re-distribution of Eclipse RDF4J Web Applications (RDF4J-Server and RDF4J-Workbench), patched and enhanced to include Halyard as another RDF repository option

Guides

JMX Console

Elasticsearch Export

Reference

Supported SPARQL functions

About

Halyard-star is an extremely horizontally scalable Triplestore with support for Named Graphs, designed for integration of extremely large Semantic Data Models, and for storage and SPARQL 1.1 querying of the whole Linked Data universe snapshots.

Topics

Resources

License

Stars

Watchers

Forks

Languages

  • Java 94.4%
  • HTML 4.5%
  • XSLT 0.6%
  • CSS 0.3%
  • Shell 0.1%
  • Ruby 0.1%