Evaluation parallele bug fix #1406

vishalvanpariya · 2024-02-22T09:04:00Z

Evaluation batch parallel processing bug fixed
Bug: #1358

Additional Information

New backend dependency: nest-asyncio

vishalvanpariya · 2024-02-22T09:04:23Z

@aakrem @mmabrouk

aybruhm

Thank you for the PR, @vishalvanpariya!

Unfortunately, it still doesn't solve the issue:

The LLM app invocation occurs for all data points before any evaluation begins. As a result, we must wait for the complete invocation process to finish across all data points before initiating any evaluations. This leads to significant delays in obtaining evaluation results, particularly for large test sets.

What we want to achieve is to start receiving evaluation outputs immediately after the first data point is processed.

mmabrouk · 2024-02-25T10:03:14Z

@vishalvanpariya Thanks a lot for the PR!
@aybruhm I think there is a confusion in the issue number. The issue that @aakrem has created is for a different feature (the one that you have cited). However this PR resolves another bug, right now we are not calling LLM apps in batches as we say we do! Each batch calls are made in sequence and not in parallel as they should be. This is why evaluation is taking an eternity for long test sets.
So to summarize, this does not close #1358 , however it fixes a bug without an issue right now.
Can you please review the code @aybruhm accordingly. Thank you!

mmabrouk

Thanks again for the PR @vishalvanpariya !
If I understand correctly, we are adding nest_asynco to be able to nest run_batch calls. Is there a way to change the logic so that we don't have the problem and we don't need to use nest_asyncio? It seems nest_asyncio.apply() patches asyncio which I am not very keen to as it might have side effects (what are your thoughts @aybruhm ?)

aybruhm · 2024-02-25T13:10:55Z

Thanks again for the PR @vishalvanpariya ! If I understand correctly, we are adding nest_asynco to be able to nest run_batch calls. Is there a way to change the logic so that we don't have the problem and we don't need to use nest_asyncio? It seems nest_asyncio.apply() patches asyncio which I am not very keen to as it might have side effects (what are your thoughts @aybruhm ?)

Yes, you're right. I am also concerned about why we are patching the event loop as it may result in unpredictable behaviour.

@vishalvanpariya here's what I suggest doing:

async def run_batch(start_idx: int):
        # ... (existing code up to the loop)

        tasks = []  # Store the tasks for parallel execution
        for index in range(start_idx, end_idx):
            task = asyncio.create_task(run_with_retry(
                uri,
                testset_data[index],
                parameters,
                max_retries,
                retry_delay,
                openapi_parameters,
            ))
            tasks.append(task)

        # Gather results of all tasks 
        results = await asyncio.gather(*tasks)
        for result in results:
            list_of_app_outputs.append(result)
            print(f"Adding outputs to batch {start_idx}")

The create_task() function schedules the coroutine to run and returns a task object that we can use to monitor its status, cancel it, or await its completion. And the gather() allows the coroutines to be executed concurrently and wait for all of them to complete before continuing.

mmabrouk · 2024-04-23T10:07:38Z

Thank you a lot @vishalvanpariya ! We have fixed the bug in a different PR. It should be live in the next oss release in an hour or so. Sorry it took so long

mmabrouk · 2024-04-23T10:07:50Z

@all-contributors please add @vishalvanpariya for code

allcontributors · 2024-04-23T10:08:00Z

@mmabrouk

I've put up a pull request to add @vishalvanpariya! 🎉

vishalvanpariya · 2024-04-23T10:15:24Z

Thanks @mmabrouk

VishalPhenom added 2 commits February 22, 2024 14:22

nest-asyncio dependency added for nested async call

ced24d7

batch evaluation parallel processing bug fixed

45721ab

aakrem requested a review from aybruhm February 22, 2024 09:08

reformatted

9d7ea80

aybruhm requested changes Feb 22, 2024

View reviewed changes

mmabrouk requested changes Feb 25, 2024

View reviewed changes

mmabrouk closed this Apr 23, 2024

allcontributors bot mentioned this pull request Apr 23, 2024

docs: add vishalvanpariya as a contributor for code #1550

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Evaluation parallele bug fix #1406

Evaluation parallele bug fix #1406

vishalvanpariya commented Feb 22, 2024 •

edited by aybruhm

Loading

vishalvanpariya commented Feb 22, 2024

aybruhm left a comment •

edited

Loading

mmabrouk commented Feb 25, 2024

mmabrouk left a comment

aybruhm commented Feb 25, 2024 •

edited

Loading

mmabrouk commented Apr 23, 2024

mmabrouk commented Apr 23, 2024

allcontributors bot commented Apr 23, 2024

vishalvanpariya commented Apr 23, 2024

Evaluation parallele bug fix #1406

Evaluation parallele bug fix #1406

Conversation

vishalvanpariya commented Feb 22, 2024 • edited by aybruhm Loading

Additional Information

vishalvanpariya commented Feb 22, 2024

aybruhm left a comment • edited Loading

Choose a reason for hiding this comment

mmabrouk commented Feb 25, 2024

mmabrouk left a comment

Choose a reason for hiding this comment

aybruhm commented Feb 25, 2024 • edited Loading

mmabrouk commented Apr 23, 2024

mmabrouk commented Apr 23, 2024

allcontributors bot commented Apr 23, 2024

vishalvanpariya commented Apr 23, 2024

vishalvanpariya commented Feb 22, 2024 •

edited by aybruhm

Loading

aybruhm left a comment •

edited

Loading

aybruhm commented Feb 25, 2024 •

edited

Loading