Support bulk-fetch using JOIN #171

andyjefferson · 2017-03-07T13:42:36Z

If we have a JDOQL query like
SELECT FROM Person WHERE this.firstName == :value

then this becomes
SELECT P.* FROM PERSON P WHERE P.FIRST_NAME = ?

If a Person has a Set

then if the addresses field is in the fetch plan we already support a bulk-fetch mode "EXISTS" giving SQL of
SELECT A.* FROM ADDRESS A WHERE EXISTS (SELECT P.ID FROM PERSON P WHERE P.FIRST_NAME = ? AND A.PERSON_ID = P.ID)

We could potentially have a bulk-fetch mode "JOIN" as
SELECT A.* FROM PERSON P, ADDRESS A WHERE A.PERSON_ID = P.ID AND P.FIRST_NAME = ?

The reason why this is more complicated than the EXISTS case is that for EXISTS we can make use of the backing store getIteratorStatement for the basic statement, and then put the original query in an EXISTS clause. Here we need to start from the basic query (but clearing the select) and then adding the join to the element, while catering for all different combinations of set/list/collection whether with embedded elements or not, and whether via FK or JoinTable.

The text was updated successfully, but these errors were encountered:

Hexiaoqiao · 2017-11-09T06:53:39Z

any plan to optimize the generated SQL?

andyjefferson · 2017-11-09T07:08:59Z

No. That is dependent on contributions, this being open source and all. As per the "unresourced" tag on this issue

shawnweeks · 2018-04-07T02:36:49Z

Another option that is supported on several databases is to use an "IN" clause instead of "EXISTS" which would be implicitly converted to a join without any of the risks associated with inadvertent many to many relationships. That would probably be a lot easier to implement than the join as the SQL is closely related to exists. I normally work on Hadoop Projects but since I'm looking at using some of this I'll start getting familiar with the code base and see if I can help.

shawnweeks · 2018-04-08T14:55:58Z

Based on some testing against PostgreSQL 10.3, MaraiDB 10.2 and Oracle 12.2 this optimization is already happening with "EXISTS" and "IN". I can post the test scripts for other folks to look at but assuming you can use a relatively modern release of your database software you're already getting the benefit of using a join.

andyjefferson · 2018-04-08T17:04:36Z

Thx for your input, interesting to hear.

A comparison of the 3 "bulk"/"batch" options (EXISTS, IN, JOIN) for EclipseLink JPA is present on this link https://java-persistence-performance.blogspot.co.uk/2010/08/batch-fetching-optimizing-object-graph.html

andyjefferson added the enhancement label Mar 7, 2017

andyjefferson mentioned this issue Mar 11, 2017

Bulk Fetch : Low performance SQL are generated for Objects that are having lists of other objects #52

Closed

andyjefferson added the unresourced label Apr 8, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support bulk-fetch using JOIN #171

Support bulk-fetch using JOIN #171

andyjefferson commented Mar 7, 2017 •

edited

Loading

Hexiaoqiao commented Nov 9, 2017 •

edited by andyjefferson

Loading

andyjefferson commented Nov 9, 2017

shawnweeks commented Apr 7, 2018

shawnweeks commented Apr 8, 2018

andyjefferson commented Apr 8, 2018

Support bulk-fetch using JOIN #171

Support bulk-fetch using JOIN #171

Comments

andyjefferson commented Mar 7, 2017 • edited Loading

Hexiaoqiao commented Nov 9, 2017 • edited by andyjefferson Loading

andyjefferson commented Nov 9, 2017

shawnweeks commented Apr 7, 2018

shawnweeks commented Apr 8, 2018

andyjefferson commented Apr 8, 2018

andyjefferson commented Mar 7, 2017 •

edited

Loading

Hexiaoqiao commented Nov 9, 2017 •

edited by andyjefferson

Loading