Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Document the workflow for customizing a local QLever instance based on Wikidata dump #1569

Open
Daniel-Mietchen opened this issue Oct 20, 2024 · 4 comments

Comments

@Daniel-Mietchen
Copy link

At a hackathon this weekend, @WolfgangFahl has set up a local QLever instance based on the current Wikidata dump. It works well for our main purpose (testing Wikidata-related queries) but we have not yet figured out how to customize our instance.

Landing page Freiburg:
Screenshot from 2024-10-20 17-08-57

Landing page Aachen:
Screenshot from 2024-10-20 17-09-20

No major differences except for the Format/ Reset button, and we would like to customize the footer.

Index Information Freiburg:
Screenshot from 2024-10-20 17-10-42

Index Information Aachen:
Screenshot from 2024-10-20 17-11-33

Here, we are clearly lacking information but haven't figured out how to get the relevant information to display there.

Backend Information Freiburg:
Screenshot from 2024-10-20 17-12-50

Backend Information Aachen:
Screenshot from 2024-10-20 17-13-13

Again, not much of a difference there, except for the ask command.

Thanks for any pointers.

@hannahbast
Copy link
Member

hannahbast commented Oct 20, 2024

@Daniel-Mietchen and @WolfgangFahl: Thanks for the feedback. Some comments/questions:

  1. Did you use the the qlever CLI aka qlever script with the pre-configured QLeverfile for Wikidata? if yes, everything should have just worked out of the box. Please let us know if you encountered any problems. If you did, I would like to find out whether you did more work than necessary (people sometimes do that) or if there are some bugs on our end.

  2. The functionality for the "Format" button is in Add Format button for auto-formatting of query qlever-ui#103, which is not merged yet because of some nitpicks. If it is important for you, you can just merge it yourself for your instance. We will also merge it soon.

  3. Exactly what is it that you want to customize? Are you aware that there is a configuration for each backend, which you can customize by clicking on "Backend information" and then "Edit this backend"? The layout of the page is currently not customizable, but it is very easy to modify the template files, which are in https://github.com/ad-freiburg/qlever-ui/tree/master/backend/templates

  4. You can set the description for the index and the text index in the Qleverfile. The respective variable names are DESCRIPTION and TEXT_DESCRIPTION.

  5. ASK queries are not yet implemented, but will be very soon. There is already a PR that works: Implement ASK queries #1562

@WolfgangFahl
Copy link

WolfgangFahl commented Oct 23, 2024

@hannahbast Thanks for the reponse. We indeed created a qlv script as a wrapper to make the qlever script available in background since it uses a tty and we can not use nohup for it but have to run it in a screen session. That makes error handling harder. Please note that the machine will have a running ui docker process while it runs since we intend to rotate between freshly indexed versions as often as possible which is e.g. currently weekly. So the next test will probably start tomorrow on our alpha disk while the oct 16th dump was on delta.

Did you use the the qlever CLI aka qlever script with the pre-configured QLeverfile for Wikidata? if yes, everything should have just worked out of the box.

See ad-freiburg/qlever-control#80 why it does not. Even on a successful index we'll not automatically get a new running instance in the style of the current qlv script https://wiki.bitplan.com/index.php/Wikidata_Import_2024-10-17#Using_qlv_script

the qlv script is allowing to rotate disks but also works around the tty problem that we can not run the qlever control script in background but have to work around with a screen environment which is much harder to control and debug.

You can set the description for the index and the text index in the Qleverfile. The respective variable names are DESCRIPTION and TEXT_DESCRIPTION.

Having the information in the log outputs would already help IMHO the information about the source time/stamp size of download and number of triples is worthwhile to keep around any way. I still e.g. have a hard time to compare triple counts in https://wiki.bitplan.com/index.php/List_of_Imports

@hannahbast
Copy link
Member

@WolfgangFahl You can solve the TTY problem very easily (for any script) be prepending unbuffer .... For example, see https://github.com/ad-freiburg/qlever-control/blob/main/.github/workflows/end-to-end-test-ubuntu.yml .

Do any of the issues you reported still persist?

@WolfgangFahl
Copy link

@hannahbast thx for the response
Do any of the issues you reported still persist?
Yes - and things have recently been worse since the indexer starts even if the download is incomplete which happened twice recently. see https://wiki.bitplan.com/index.php/Wikidata_Import_2024-11-15 and https://wiki.bitplan.com/index.php/Wikidata_Import_2024-10-24

also our endpoint https://qlever.wikidata.dbis.rwth-aachen.de/wikidata currently points to the wrong backend which is awkward since we tried to change it. We seem to have no proper knowledge how to persist all the necessary settings.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants