Possible memory leak in SPARQL endpoint? #123

cKlee · 2016-04-18T11:23:17Z

Hi,
I'm using the solrdf-1.0 branch with JRE 1.7.0 and successfully loaded aprox. 125000000 documents to the store (and optimized). Solr queries are fine. But with simple SPARQL queries I always get a "java heap space out of memory" exception (even with LIMIT 1).

I'm running a 4 CPU 64 bit Debian with 16G RAM. I'm not very familiar with Maven, but I tried to give Solrdf a little bit more RAM with

$ mvn -DargLine="-Xmx8g" cargo:run

... but with the same exception.

The text was updated successfully, but these errors were encountered:

agazzarini · 2016-04-18T12:50:16Z

Hi @cKlee,
unfortunately this is part of #96 which is a huge work, still pending. The current implementation of SolRDF uses the general-purpose SPARQL Algebra implementation bundled with Jena, which is good for small or in-memory datasets.

A more sophisticated logic is needed here, in order to gain advantage from the underlying inverted index and most important, to avoid what you're seeing.

Specifically, even small queries containing joins or count() with simple graph patterns, completely scans the entire index; I guess that's the underlying reason of your out of memory issue.

The bad thing is that I have no a precise idea about when this thing will be done, as it is complicated and it is taking me a lot of time.

AG

agazzarini added bug enhancement labels Apr 18, 2016

agazzarini self-assigned this Apr 18, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Possible memory leak in SPARQL endpoint? #123

Possible memory leak in SPARQL endpoint? #123

cKlee commented Apr 18, 2016

agazzarini commented Apr 18, 2016

Possible memory leak in SPARQL endpoint? #123

Possible memory leak in SPARQL endpoint? #123

Comments

cKlee commented Apr 18, 2016

agazzarini commented Apr 18, 2016