Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ElasticSearch or OpenSearch #5540

Open
krabina opened this issue Oct 11, 2023 · 6 comments
Open

ElasticSearch or OpenSearch #5540

krabina opened this issue Oct 11, 2023 · 6 comments
Labels

Comments

@krabina
Copy link
Contributor

krabina commented Oct 11, 2023

@Designburo raised this topic about SMW here https://github.com/orgs/Open-CSP/discussions/5

So the general question is: should SMW stay with ElasticSearch (which since 7.11 has a changed their license model and is currently at version 8.10) or move to Amazon OpenSearch (which is a fork of Elasticsearch 7.10.2.)

For those who are new to the topic, there are some articles for further reading:

My personal opinion is that it should stay with whatever decision the Wikimedia Foundation takes with CirrusSearch, but this seems not to be decided yet:

@krabina
Copy link
Contributor Author

krabina commented Oct 11, 2023

This will only become valid once we resolve the problem that currently SMW is not compatible with ElasticSearch >7, see #5328

@krabina
Copy link
Contributor Author

krabina commented Oct 11, 2023

see also #5451

@Designburo
Copy link

Designburo commented Oct 11, 2023

The problem we face with customers is that Elastic Search 7.11 is already end-of-life and they do not accept that.
Elastic Search itself is a very proven solution and I would favour that. For users that do not use Cirrus Search, it would be fine to use ES 8.x. It is in a way all depending on what the Wikimedia Foundation will decide. However we cannot wait that long any more.

@marijnvanwezel
Copy link
Contributor

We should also consider supporting both ElasticSearch and OpenSearch. I have already started some work on this (https://github.com/WikibaseSolutions/SemanticMediaWiki/tree/elastic-factory-abstraction).

The fact that still no choice has been made indicates that there are good reasons to choose either (e.g. you dislike Amazon, have infrastructure requirements or have a support pact with ElasticSearch and therefore choose ElasticSearch, or you want to use free software and choose OpenSearch).

@krabina
Copy link
Contributor Author

krabina commented Oct 31, 2023

I totally agree @marijnvanwezel. Thank you for your PR an thank you @JeroenDeDauw for reviewing it!
It is a great idea to support both.

Btw: a question. I think it is a pity that users have to decide between ElasticStore and an RDF store. Do you think, conceptually it would be possible to have the SMWStore to use ElasticStore (which does not replace the SQLStore) or alternatively use ElasticStore AND an RDFStore? This way SMW could provide a SPARQL endpoint and at the same time have the better search experience with Elastic.

@marijnvanwezel
Copy link
Contributor

Hi @krabina, I do not have any experience with how the RDFStore works, but conceptually I think it should be possible. The ElasticStore already uses replication and lives alongside the SQLStore. I don't see why that would not be possible if it were an RDFStore instead.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

4 participants
@krabina @Designburo @marijnvanwezel and others