add concurrency to the discover #29

keyn4 · 2024-03-11T22:25:43Z

Description of change

add concurrency to the discover to make it faster

hsyyid · 2024-03-12T00:34:42Z

tap_salesforce/__init__.py

+            ]
+            for sobject_name in chunk]
+        run_concurrently(discover_stream, chunk_args)
+        start_counter = end_counter

    for sobject_name in sorted(objects_to_discover):


Shouldn't we get rid of the for loop now since the work is being done above?

old loop deleted

butkeraites-hotglue

Excellent work! Left two suggestions feel free to apply them or not 👍

butkeraites-hotglue · 2024-09-16T20:13:03Z

tap_salesforce/__init__.py

+    results = []
+
+    for future in as_completed(all_tasks):
+        (index, result) = future.result()
+        # Insert the result in the right index of the list
+        results.insert(index, result)


Suggested change

results = []

for future in as_completed(all_tasks):

(index, result) = future.result()

# Insert the result in the right index of the list

results.insert(index, result)

results = [None] * len(fn_args_list) # Preallocate list for correct ordering

for future in as_completed(all_tasks):

index, result = future.result()

results[index] = result

It's just a suggestion as it's not necessary to do this memory allocation dynamically.

butkeraites-hotglue · 2024-09-16T20:19:28Z

tap_salesforce/__init__.py

+    objects_list = sorted(objects_to_discover)
+    start_counter = 0
+    concurrency_limit = 25
+
+    while start_counter < len(objects_list):
+        end_counter = start_counter + concurrency_limit
+        if end_counter >= len(objects_list):
+            end_counter = len(objects_list)
+
+        chunk = objects_list[start_counter:end_counter]
+        chunk_args = [
+            [
+                sf,
+                sobject_name,
+                entries,
+                sf_custom_setting_objects,
+                object_to_tag_references,
+            ]
+            for sobject_name in chunk]
+        run_concurrently(discover_stream, chunk_args)
+        start_counter = end_counter


Suggested change

objects_list = sorted(objects_to_discover)

start_counter = 0

concurrency_limit = 25

while start_counter < len(objects_list):

end_counter = start_counter + concurrency_limit

if end_counter >= len(objects_list):

end_counter = len(objects_list)

chunk = objects_list[start_counter:end_counter]

chunk_args = [

[

sf,

sobject_name,

entries,

sf_custom_setting_objects,

object_to_tag_references,

]

for sobject_name in chunk]

run_concurrently(discover_stream, chunk_args)

start_counter = end_counter

objects_list = sorted(objects_to_discover)

concurrency_limit = 25

for start_counter in range(0, len(objects_list), concurrency_limit):

chunk = objects_list[start_counter:start_counter + concurrency_limit]

chunk_args = [

(sf, sobject_name, entries, sf_custom_setting_objects, object_to_tag_references)

for sobject_name in chunk

]

run_concurrently(discover_stream, chunk_args)

Seems a little bit clearer like that

keyn4 added 2 commits March 11, 2024 17:24

add concurrency to the discover

2ba239b

format

a404ed7

hsyyid requested changes Mar 12, 2024

View reviewed changes

delete old loop

edb0013

butkeraites-hotglue approved these changes Sep 16, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add concurrency to the discover #29

add concurrency to the discover #29

keyn4 commented Mar 11, 2024

hsyyid Mar 12, 2024

keyn4 Mar 12, 2024

butkeraites-hotglue left a comment

butkeraites-hotglue Sep 16, 2024

butkeraites-hotglue Sep 16, 2024

add concurrency to the discover #29

Are you sure you want to change the base?

add concurrency to the discover #29

Conversation

keyn4 commented Mar 11, 2024

Description of change

hsyyid Mar 12, 2024

Choose a reason for hiding this comment

keyn4 Mar 12, 2024

Choose a reason for hiding this comment

butkeraites-hotglue left a comment

Choose a reason for hiding this comment

butkeraites-hotglue Sep 16, 2024

Choose a reason for hiding this comment

butkeraites-hotglue Sep 16, 2024

Choose a reason for hiding this comment