Semantic search storage and routing #495

jgonggrijp · 2021-07-29T16:29:57Z

This branch implements #486 by storing a (relatively) lightweight JSON representation of the query to the backend, giving it a serial number and an optional mnemonic label, and using that serial number for routing so that queries can be returned to and shared with other users. On deep linking, the search form is also restored with the visual representation of the query, so that the user can easily create subtle variations of pre-existing queries. This will always create a new query with a separate serial number; queries cannot be changed after creation.

The list action on the backend API returns only the user's own queries. This is partly a preparation for #163 and partly to protect the intellectual property of other users. It is still possible to see other user's queries by entering the serial numbers one by one, which is necessary in order to enable sharing of particular queries.

I just realized that there is no admin page yet to manage the queries (e.g. to delete queries on user request) and that I need to update the README of the semantic-search directory. I'll add some commits to the branch to address this.

Other than that, there is a subtle cosmetic issue that remains after the bug I fixed in 9c1ba6c. Select2 has the annoying property that if you select an <option>, it will no longer update its contents afterwards. Before 9c1ba6c, this resulted in chosen types and predicates having blank labels when re-visiting the search form through a deep link. I mostly fixed it by adding more event handling steps, but due to our complex data model, the label of an option sometimes updates more than once. As a result, the Reader class may appear labeled as "Person" in this scenario. I consider this subtle and cosmetic enough that we can document it in a low-priority issue (which I will create after submitting this PR).

Most of the code changes are glue code: import lines, event bindings, declarative configuration, short methods that just call a few pre-existing functions, etcetera. The only "exciting magic" is in frontend/src/semantic-search/model.ts, which is responsible for two-way conversion between the frontend's model-/collection-based intermediate representation and the backend's more lightweight JSON-based storage representation, and in frontend/src/utilities/abstractDeepEqual.ts, which I needed in order to test the former. There is not much code in either of these modules, but the logic is recursive, which could make it a bit trickier to wrap your head around. Because of this recursive complexity, and because correct conversion is highly critical, both are heavily tested in adjacent modules.

By means of review, I suggest focusing on trying the changes locally, though a code review is also welcome.

jgonggrijp · 2021-07-30T22:54:12Z

I just realized that there is no admin page yet to manage the queries (e.g. to delete queries on user request) and that I need to update the README of the semantic-search directory. I'll add some commits to the branch to address this.

Done.

JeltevanBoheemen

Very nice work @jgonggrijp.

One question: how is the label used (or planned to be used)? I think a nice addition may be to allow accessing a saved query by label (e.g. /explore/query/find-ipad-readers). This would probably need some validation on the label to not produce broken urls though.
I think a place where the user can see his/her latests query would also work, and display the labels there. This might be part of a personal landing page.
Both definitely fall in the 'nice-to-have' category.

JeltevanBoheemen · 2021-08-02T15:29:53Z

backend/items/admin.py

+    list_filter = ('created', 'creator')
+    show_full_result_count = False
+
+    def view_on_site(self, obj):


I didn't know this method, very useful!

JeltevanBoheemen · 2021-08-02T15:36:03Z

frontend/src/core/model.ts

        // Django requires the trailing slash, so add it.
-        return BackboneModel.prototype.url.apply(this) + '/';
+        return superUrl + '/';


Can't follow what's happening here. When a new model is made it shouldn't append the trailing slash?

Correct. By default, Backbone will generate /path/to/api/endpoint/ for saving new models and /path/to/api/endpoint/id for fetching or updating existing models. In the first case, we don't want to append a second slash. In the second case, we do want to finish with a slash because Django will otherwise interpret it as a frontend route.

JeltevanBoheemen · 2021-08-02T15:37:28Z

frontend/src/explorer/route-actions.ts

@@ -69,3 +77,9 @@ export function itemWithOccurrences(control: Controller, node: Node) {
 export function searchResultsSources(control: Controller, queryParams: any) {
    return control.resetSourceListFromSearchResults(queryParams);
 }
+
+export


export is placed wrong/not consistent here. Minor.

JavaScript/TypeScript allows it and it keeps the lines within 80 columns. It seemed less disruptive than using Egyptian parens for the parameter list. Please feel free to format differently when you see code like this (I've done it in more places).

No problem. My IDE didn't format it as a function, but the code ran without a problem, so its the language server that's wrong :)

jgonggrijp · 2021-08-02T18:22:41Z

Very nice work @jgonggrijp.

Thanks!

I think a nice addition may be to allow accessing a saved query by label (e.g. /explore/query/find-ipad-readers). This would probably need some validation on the label to not produce broken urls though.

Nice, but also a bit redundant with the id-based routes. Also introduces potential for name clashes, while duplicate names should arguably be allowed in this case.

I think a place where the user can see his/her latests query would also work, and display the labels there. This might be part of a personal landing page.

This is exactly the use case that motivated me to add the label field.

Thanks for the review!

jgonggrijp added 30 commits July 1, 2021 17:43

Add backend model for storing semantic queries in JSON form (#486)

4d53ed1

Merge branch 'develop' into feature/sem-search-routing

6205a5f

For future use, also store who created each query and when (#486)

bf3d142

Add a preliminary serializer for the backend SemanticQuery model (#486)

b2173fb

Strip trailing whitespace from backend/items/views.py

64c919d

Add preliminary DRF viewset for SemanticQuery model (#486)

e57b0c1

Register the SemanticQueryViewSet with the api root (#486)

234fe16

Add the abstractDeepEqual utility (#486)

e6d7e94

Add SemanticQuery frontend model (#486)

26fff8e

Apply toJSON recursively in SemanticQuery.toJSON (#486)

2f9f39d

Use .toJSON in abstractDeepEqual as well (#486)

caeeb6d

Make abstractDeepEqual more robust against asymmetries (#486)

8bf711f

Comment the frontend SemanticQuery model and tests (#486)

d708149

Add semantic query collection type for backend URL derivation (#486)

994d67e

Make SemanticQuery the model type of SemanticSearchView (#486)

f3560a8

Add an <input> to the SemanticSearchView for the query label (#486)

2f0ab0a

Reset query id on semantic search form changes (#486)

8a3b403

Consistently mediate all search events through welcome view (#486)

0d0d95a

Put WelcomeView in charge of managing the SemanticSearchView (#486)

1dca5b2

Automatically set the creator on newly posted semantic queries (#486)

831a983

Add global collection of semantic queries (#486)

4f4b8d4

Fix an oversight in Model.prototype.url (#486)

7b8028a

Save semantic query to backend if new (#486)

5536948

Pass semantic query model to SearchResultListView (#486)

23a12da

Include serial in semantic search results route (#486)

4b24776

Enable announceRoute utility to work with numeric ids (#486)

005cce7

Take into account that the semantic query may arrive async (#486)

c99f2ed

Report the correct route for semantic search results (#486)

0ecd3c8

Enable semantic search deep linking (close #486)

1864a43

Only destroy select2 element if initialized (#486 #465 #426)

15d6a58

jgonggrijp added 4 commits July 29, 2021 17:29

Restore selection in FilterInput async (#486)

f314315

Ensure that preselected options appear with a label (#486)

9c1ba6c

Only include semantic query in response on retrieve (#486)

b1b0ce8

Restrict semantic query API listing to user's own (#486 #163)

3756a53

jgonggrijp requested a review from JeltevanBoheemen July 29, 2021 16:29

jgonggrijp mentioned this pull request Jul 30, 2021

Selected option in semSearch dropdown may get stuck on non-preferred label after deep link #496

Open

jgonggrijp added 2 commits July 31, 2021 00:28

Add a luxurious backend admin page for the SemanticQuery model (#486)

bbf1977

Update the semantic search README (#486)

6209c64

JeltevanBoheemen approved these changes Aug 2, 2021

View reviewed changes

jgonggrijp merged commit 77dc765 into develop Aug 2, 2021

jgonggrijp deleted the feature/sem-search-routing branch August 2, 2021 18:23

jgonggrijp added this to the Next release milestone Aug 3, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Semantic search storage and routing #495

Semantic search storage and routing #495

jgonggrijp commented Jul 29, 2021

jgonggrijp commented Jul 30, 2021

JeltevanBoheemen left a comment

JeltevanBoheemen Aug 2, 2021

JeltevanBoheemen Aug 2, 2021

jgonggrijp Aug 2, 2021

JeltevanBoheemen Aug 2, 2021

jgonggrijp Aug 2, 2021

JeltevanBoheemen Aug 3, 2021

jgonggrijp commented Aug 2, 2021

Semantic search storage and routing #495

Semantic search storage and routing #495

Conversation

jgonggrijp commented Jul 29, 2021

jgonggrijp commented Jul 30, 2021

JeltevanBoheemen left a comment

Choose a reason for hiding this comment

JeltevanBoheemen Aug 2, 2021

Choose a reason for hiding this comment

JeltevanBoheemen Aug 2, 2021

Choose a reason for hiding this comment

jgonggrijp Aug 2, 2021

Choose a reason for hiding this comment

JeltevanBoheemen Aug 2, 2021

Choose a reason for hiding this comment

jgonggrijp Aug 2, 2021

Choose a reason for hiding this comment

JeltevanBoheemen Aug 3, 2021

Choose a reason for hiding this comment

jgonggrijp commented Aug 2, 2021