Add streaming API #463

jpsamaroo · 2023-12-21T15:33:26Z

Adds a spawn_streaming task queue to transform tasks into continuously-executing equivalents that automatically take from inputs streams/channels and put their result to an output stream/channel. Useful for processing tons of individual elements of some large (or infinite) collection.

Todo:

JamesWrigley · 2024-03-29T16:15:10Z

Am I correct in thinking that all the necessary items except for tests are complete?

jpsamaroo · 2024-03-31T16:32:16Z

Generally yes, I think we're pretty close to this being merge-ready. There are some remaining TODOs that I need to finish, but most are reasonably small. I could definitely use help with writing tests - just validating that we can run various kinds of pipelines and that they work across multiple workers would be really useful.

… tasks

Instead of taking/putting values sequentially (which may block), runs "pull" and "push" tasks for each input and output, respectively. Uses buffers to communicate values between pullers/pushers and the streaming task, instead of only using one buffer per task-to-task connection.

Switch from RemoteFetcher to RemoteChannelFetcher Pass object rather than type to `stream_{push,pull}_values!` ProcessRingBuffer: Don't exit on graceful interrupt when non-empty

jpsamaroo added enhancement needs tests performance data movement needs docs eager api streaming labels Dec 21, 2023

jpsamaroo marked this pull request as draft December 21, 2023 15:33

jpsamaroo force-pushed the jps/stream2 branch from c32db49 to 8eb1a6a Compare December 21, 2023 15:34

JamesWrigley mentioned this pull request Mar 13, 2024

Reference Dagger.EAGER_THUNK_STREAMS explicitly #484

Merged

JamesWrigley mentioned this pull request Mar 20, 2024

Stop tracking the manifest files #487

Merged

jpsamaroo force-pushed the jps/stream2 branch from 29d790f to 0f747ed Compare April 1, 2024 01:37

JamesWrigley mentioned this pull request Apr 1, 2024

Streaming branch fixes #496

Merged

jpsamaroo force-pushed the jps/stream2 branch from 0f747ed to fa01410 Compare April 5, 2024 21:52

jpsamaroo force-pushed the jps/stream2 branch from aa7b000 to 7a1b408 Compare May 2, 2024 19:48

JamesWrigley force-pushed the jps/stream2 branch from 7a1b408 to d53b5d4 Compare May 17, 2024 12:23

JamesWrigley force-pushed the jps/stream2 branch from bcc744b to 890eb7c Compare May 24, 2024 16:30

JamesWrigley force-pushed the jps/stream2 branch from 5a4ccaa to 1c4473b Compare June 3, 2024 09:24

JamesWrigley force-pushed the jps/stream2 branch from 1c4473b to 21217ab Compare June 24, 2024 20:03

JamesWrigley force-pushed the jps/stream2 branch 2 times, most recently from ed89a7f to 3274093 Compare August 3, 2024 16:57

JamesWrigley mentioned this pull request Aug 16, 2024

Make Dagger.cancel!() handle a shutting down/shutdown scheduler #566

Merged

JamesWrigley force-pushed the jps/stream2 branch from 3274093 to 4013384 Compare August 18, 2024 22:46

JamesWrigley force-pushed the jps/stream2 branch from 9f0ac3f to 96bad48 Compare October 12, 2024 18:48

jpsamaroo and others added 4 commits November 15, 2024 17:03

Add metadata to EagerThunk

90a974f

Sch: Allow occupancy key to be Any

cbac605

Add streaming API

e441bd0

Reference Dagger.EAGER_THUNK_STREAMS explicitly

17096fa

jpsamaroo added 23 commits November 15, 2024 17:03

fixup! fixup! fixup! Initial support for robustly migrating streaming…

b930a42

… tasks

Sch: Add unwrap_nested_exception for DTaskFailedException

16d73c9

ProcessRingBuffer: Add length method

2b2da8e

fixup! fixup! cancellation: Add cancel token support

5be724f

fixup! fixup! fixup! cancellation: Add cancel token support

61ab9c1

Sch: Trigger cancel token on task exit

a51cbf9

Add task_id for DTask

31944af

ProcessRingBuffer: Allow closure

d5c27ab

RemoteFetcher: Only collect values up to free buffer space

fbae73f

streaming: Close buffers on closing StreamStore

bf53117

task-tls: Tweaks and fixes, task_id helper

b9e3c70

task-tls: Add task_cancel!

8908478

streaming: max_evals cannot be specified as 0

1f21693

streaming: Small tweaks to migration and cancellation

c4bc7b2

dagdebug: Always yield to avoid heisenbugs

51e1606

tests: Revamp streaming tests

4ea09c4

tests: Add offline mode

8bf5fbf

dagdebug: Add JULIA_DAGGER_DEBUG config variable

07ba8b1

cancellation: Add graceful vs. forced

3aba122

cancellation: Wrap InterruptException in DTaskFailedException

6ac140c

options: Add internal helper to strip all options

f60cb77

streaming: Get tests passing

b3b70e1

Switch from RemoteFetcher to RemoteChannelFetcher Pass object rather than type to `stream_{push,pull}_values!` ProcessRingBuffer: Don't exit on graceful interrupt when non-empty

JamesWrigley force-pushed the jps/stream2 branch from 96bad48 to b3b70e1 Compare November 16, 2024 19:48

jpsamaroo added 6 commits November 19, 2024 22:15

fixup! Dev the migration-helper branch of MemPool.jl

201fd57

Remove duplicate TaskLocalValues import

4d4047e

tests: Test DTaskFailedException inner type

e6f2305

fixup! tests: Test DTaskFailedException inner type

e109480

fixup! fixup! tests: Test DTaskFailedException inner type

107a683

fixup! fixup! fixup! tests: Test DTaskFailedException inner type

cdaf8df

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add streaming API #463

Add streaming API #463

jpsamaroo commented Dec 21, 2023 •

edited

Loading

JamesWrigley commented Mar 29, 2024

jpsamaroo commented Mar 31, 2024

Add streaming API #463

Are you sure you want to change the base?

Add streaming API #463

Conversation

jpsamaroo commented Dec 21, 2023 • edited Loading

JamesWrigley commented Mar 29, 2024

jpsamaroo commented Mar 31, 2024

jpsamaroo commented Dec 21, 2023 •

edited

Loading