Add batching support to map operations with configurable parameters #16

orban · 2024-09-26T23:04:35Z

Summary

This pull request introduces batching support to map operations as described in issue #7, with the aim of significantly enhancing performance and reducing costs when processing small documents. Key updates include new batching parameters, implementation of batching logic, configuration enhancements, expanded testing, and documentation improvements. Additionally, Pydantic models have been introduced in schemas.py to simplify and streamline validation logic.

Main Changes

Batching Support in Map Operations
- New Parameters: Added batch_size and clustering_method to the map operation interface to enable batching functionality.
- Batching Logic: Implemented logic to group documents based on the specified batch size and clustering method, optimizing the efficiency of LLM calls.
- LLM Call Handling: Updated LLM calls to handle batched inputs and ensure accurate mapping of outputs back to individual documents.
- Configuration Updates: Modified the YAML configuration format to support the new batch-related parameters, allowing users to easily configure batching behavior.
Testing Enhancements
- Comprehensive Unit Tests: Developed unit tests covering various batch sizes and clustering methods to ensure functionality and accuracy are maintained across different scenarios.
Documentation Improvements
- Updated Documentation: Expanded documentation with detailed explanations, practical examples, and best practices for utilizing batching in map operations.
Validation Logic Simplification
- Introduction of Pydantic Models: Incorporated Pydantic models into schemas.py to simplify validation logic and improve code maintainability.

New Pydantic Models

ToolFunction: Defines the structure of a tool function with fields for name, description, and parameters.
Tool: Represents a tool with fields for code and function.
OutputSchema: Specifies the output schema using a schema field.
MapOperationConfig: Configures map operations with optional fields such as drop_keys, prompt, output, model, and tools, including a validator for drop_keys.
ParallelMapOperationConfig: Configures parallel map operations with fields for prompts, model, and tools.
BatchConfig: Defines batch configurations with batch_size and an optional clustering_method.
OperationConfig: A generic operation configuration with fields for name, type, and a union of specific operation configurations (MapOperationConfig, ParallelMapOperationConfig, BatchConfig).

These enhancements collectively improve the efficiency and usability of map operations, making it easier to process small documents at scale while maintaining accuracy and performance.

orban · 2024-09-26T23:10:09Z

Forgot to mention that testing_basic.py was split into testing_map.py and testing_map_parallel.py to group the tests together more logically. Common pytest fixtures have also been moved to conftest.py by pytest convention.

shreyashankar · 2024-09-27T17:57:38Z

Wow! This looks so thorough. I will review today 🙏🙌🏽

shreyashankar · 2024-09-27T18:15:05Z

Any reason for introducing the Flask dependency?

orban · 2024-09-27T19:48:24Z

Great question! The Flask dependency was initially introduced as a safeguard against XSS and RCE vulnerabilities by ensuring proper escaping of Jinja2 templates. However, after revisiting the issue, I've found that we can achieve the same level of security without Flask by configuring Jinja2's Environment to enable autoescaping directly.

Based on the documentation here, we can remove the Flask dependency and simply configure Jinja2 with:

from jinja2 import Environment
env = Environment(autoescape=True)

This change would enforce the necessary escaping behavior during template rendering. I’m happy to update the PR to remove the Flask dependency and handle this via Jinja2. Let me know if that works for you!

shreyashankar · 2024-09-27T20:07:18Z

Awesome, I'll let you update it to replace Flask with Jinja. We're using Jinja elsewhere too, e.g., here, so it will be good to be consistent. Thank you 🙏🏽

orban · 2024-09-28T18:04:08Z

Worked out the last few kinks -- make tests_basic is now passing!

I also swapped out the super-unsafe eval code 💀 in favor of ASTEVAL which runs the validation in a stripped down environment which limited functionality.

shreyashankar · 2024-09-28T19:24:19Z

Amazing, will check this out, play around with the new functionality, & merge it this weekend! Thank you for taking the time to do this 😄

shreyashankar · 2024-09-30T02:14:21Z

I went through the PR and most of the changes look good. I ended up removing the parallel map operation batching, since the code did not seem to be changing the functionality. I also removed the semantic similarity functionality here.

Overall, I think there is a misunderstanding between Map operation and Reduce operations (I should be more clear in the documents). Map operations are 1:1, where the prompt that the user writes only has access to one input. Reduce operations, on the other hand, are many:1. The example you had in your documentation looked like a reduce operation--and we support semantic similarity grouping for reduce operations actually :-)

I think the basic batching that we have now (thanks to you!) is good for limiting parallelism; if there are too many documents in the input, we should not try to process all of them at the same time, so batching is good. But in the future I wonder if it's possible to batch map operations in the same prompt, while ensuring the output still matches that same 1:1 expectation.

shreyashankar · 2024-09-30T23:08:53Z

Merging into another branch so I can create a PR that runs test.

orban · 2024-10-01T00:17:36Z

But in the future, I wonder if it's possible to batch map operations in the same prompt while ensuring the output still matches that same 1:1 expectation.

I’ve been thinking along the same lines. The updates to ParallelMapOperation haven't quite hit the mark yet since we’re still submitting all the futures at once. To handle this correctly, we’d need to batch multiple ParallelMapOperation prompts into a single LLM call.

I’ve started working on this already but paused due to the size of the current PR. If we’re aligned on batching, I’m happy to continue and get this integrated. Let me know if you want me to go ahead or focus elsewhere.

Appreciate the quick merge!

shreyashankar · 2024-10-01T19:05:41Z

Sounds good to me! It would be great to also limit the number of concurrent LLM calls for the ParallelMapOperation. The PR should be a lot smaller :-) LMK if any issues come up! Thank you!

shreyashankar force-pushed the map-batching branch from 3d2309d to bb2b08b Compare September 30, 2024 02:07

shreyashankar force-pushed the map-batching branch from eabb763 to ebfda87 Compare September 30, 2024 22:59

fix: update lock file

47505d3

shreyashankar changed the base branch from main to orban-map-batching September 30, 2024 23:08

shreyashankar merged commit e533ea2 into ucbepic:orban-map-batching Sep 30, 2024
0 of 3 checks passed

orban deleted the map-batching branch October 1, 2024 01:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add batching support to map operations with configurable parameters #16

Add batching support to map operations with configurable parameters #16

orban commented Sep 26, 2024

orban commented Sep 26, 2024 •

edited

Loading

shreyashankar commented Sep 27, 2024

shreyashankar commented Sep 27, 2024

orban commented Sep 27, 2024

shreyashankar commented Sep 27, 2024

orban commented Sep 28, 2024

shreyashankar commented Sep 28, 2024

shreyashankar commented Sep 30, 2024

shreyashankar commented Sep 30, 2024

orban commented Oct 1, 2024 •

edited

Loading

shreyashankar commented Oct 1, 2024

Add batching support to map operations with configurable parameters #16

Add batching support to map operations with configurable parameters #16

Conversation

orban commented Sep 26, 2024

orban commented Sep 26, 2024 • edited Loading

shreyashankar commented Sep 27, 2024

shreyashankar commented Sep 27, 2024

orban commented Sep 27, 2024

shreyashankar commented Sep 27, 2024

orban commented Sep 28, 2024

shreyashankar commented Sep 28, 2024

shreyashankar commented Sep 30, 2024

shreyashankar commented Sep 30, 2024

orban commented Oct 1, 2024 • edited Loading

shreyashankar commented Oct 1, 2024

orban commented Sep 26, 2024 •

edited

Loading

orban commented Oct 1, 2024 •

edited

Loading