perf: [LS-2561] Stream compress multipart runs #1316

angus-langchain · 2024-12-10T00:25:49Z

Purpose

Compresses multipart runs into a buffer and streams the compressed data to the API. Compatible with https://github.com/langchain-ai/langchainplus/pull/7317

python/langsmith/_internal/_background_thread.py

python/langsmith/_internal/_operations.py

python/langsmith/_internal/_background_thread.py

python/langsmith/client.py

python/langsmith/_internal/_operations.py

python/langsmith/_internal/_background_thread.py

nfcampos · 2024-12-11T01:08:59Z

python/langsmith/client.py

+        if self.compress_traces:
+            self.boundary = BOUNDARY
+            self.compressed_runs_buffer: io.BytesIO = io.BytesIO()
+            self.compressor_writer: zstd.ZstdCompressionWriter = zstd.ZstdCompressor(level=3).stream_writer(


I'd probably use the lowest level, we want to minimize cpu usage

With level=1 we're only getting a compression ratio of 6.7 vs 18+, level=2 puts us around 12. I'm open to experimenting with this to see if we still OOM with level=1

hinthornw

We will need to copy the zstandard's BSD license somewhere:

Copyright (c) 2016, Gregory Szorc
All rights reserved.

Redistribution and use in source and binary forms, with or without modification,
are permitted provided that the following conditions are met:

1. Redistributions of source code must retain the above copyright notice, this
list of conditions and the following disclaimer.

2. Redistributions in binary form must reproduce the above copyright notice,
this list of conditions and the following disclaimer in the documentation
and/or other materials provided with the distribution.

3. Neither the name of the copyright holder nor the names of its contributors
may be used to endorse or promote products derived from this software without
specific prior written permission.

THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS" AND
ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED
WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE
DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT HOLDER OR CONTRIBUTORS BE LIABLE FOR
ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES
(INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES;
LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON
ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT
(INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS
SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.

python/langsmith/_internal/_background_thread.py

hinthornw · 2024-12-19T19:35:36Z

python/pyproject.toml

@@ -37,6 +37,7 @@ requests-toolbelt = "^1.0.0"

 # Enabled via `langsmith_pyo3` extra: `pip install langsmith[langsmith_pyo3]`.
 langsmith-pyo3 = { version = "^0.1.0rc2", optional = true }
+zstandard = "^0.23.0"


Looks like they have a conda project so should be good, but we may have to manually update our conda distro for the SDK
https://anaconda.org/conda-forge/langsmith
https://anaconda.org/conda-forge/zstandard

Thanks for the heads up. Would this issue come up when creating a new version of the sdk?

Ya upon release. It should be straightforward to support just noting!

python/langsmith/client.py

python/langsmith/_internal/_operations.py

hinthornw · 2024-12-19T19:48:07Z

python/langsmith/_internal/_background_thread.py

+            )
+
+        if (size_limit_bytes is None or current_size < size_limit_bytes) and (
+            size_limit is None or client._run_count < size_limit


This is the limit that is currently used. It also performed well in flush time benchmarks.

It can also be overridden by the batch ingest config

python/langsmith/client.py

python/langsmith/_internal/_background_thread.py

hinthornw · 2024-12-21T01:41:42Z

python/langsmith/_internal/_constants.py

 _SIZE_LIMIT_BYTES = 20_971_520  # 20MB by default
 _AUTO_SCALE_UP_QSIZE_TRIGGER = 200
 _AUTO_SCALE_UP_NTHREADS_LIMIT = 32
 _AUTO_SCALE_DOWN_NEMPTY_TRIGGER = 4
 _BLOCKSIZE_BYTES = 1024 * 1024  # 1MB
+_BOUNDARY = uuid.uuid4().hex


Does this need to be unique every time? or should we hardcode it here?

angus-langchain added 3 commits December 9, 2024 16:10

start of compression

a5efbb8

manually encode

c10a8ad

set boundary

d456776

angus-langchain commented Dec 10, 2024

View reviewed changes

python/langsmith/_internal/_background_thread.py Outdated Show resolved Hide resolved

angus-langchain commented Dec 10, 2024

View reviewed changes

python/langsmith/_internal/_background_thread.py Outdated Show resolved Hide resolved

angus-langchain commented Dec 10, 2024

View reviewed changes

python/langsmith/_internal/_operations.py Show resolved Hide resolved

angus-langchain changed the title ~~Angus/stream compress multipart~~ perf: Stream compress multipart runs Dec 10, 2024

angus-langchain commented Dec 10, 2024

View reviewed changes

python/langsmith/_internal/_background_thread.py Outdated Show resolved Hide resolved

angus-langchain added 2 commits December 9, 2024 17:11

add zstandard

0359235

set limits from config

08bce98

angus-langchain requested a review from agola11 December 10, 2024 01:18

angus-langchain added 8 commits December 9, 2024 17:22

add slots

800472a

lint

d4e45e1

fix mypy

46c8740

implement correct timeouts

a5cba8d

stream instead of read data from buffer

c32cb0c

fix client type

2aafc20

separate compressed buffer from tracing queue

03e4e02

clean up update multipart

c952536

agola11 reviewed Dec 10, 2024

View reviewed changes

angus-langchain added 8 commits December 10, 2024 09:54

address comments

c3fdd36

just write directly to compressor instead of streaming

d6b186d

set trcing queue to none

f061c0b

send multipart req

8a0a60e

remove flush

b2113ba

remove print

982429a

set compression level

5b58940

remove prints

f2e5ed9

nfcampos reviewed Dec 11, 2024

View reviewed changes

angus-langchain marked this pull request as ready for review December 11, 2024 01:10

angus-langchain added 7 commits December 12, 2024 15:49

add flush method

5b0cca4

return early

f59f7be

lint

b76e662

wait

efa4bd6

signal bg threads data is available instead of sleeping

0e06bde

improve buffer checks

f997895

mypy

fbc217f

angus-langchain changed the title ~~perf: Stream compress multipart runs~~ perf: [LS-2561] Stream compress multipart runs Dec 13, 2024

angus-langchain added 4 commits December 18, 2024 14:06

Use more threads for backend requests

7b6c201

fix futures waiting

35d46ed

remove unused slot

dbef2ec

Flush background threads

6fad596

hinthornw reviewed Dec 19, 2024

View reviewed changes

angus-langchain added 9 commits December 19, 2024 13:04

make boundary constant

5cc947a

Remove slot for bool val

d4b2aa4

Use a single join() rather than copying the header strings

63e55f7

Add zstandard license

874c748

lint

0b3d6b8

Create compressed runs object

7739939

Make zstd optional

3ec9b6e

Make zstandard level configurable

f9aac67

mypy ignore optional imports

2ac7a35

angus-langchain force-pushed the angus/stream-compress-multipart branch from 856e4a4 to 2ac7a35 Compare December 20, 2024 16:50

angus-langchain added 2 commits December 20, 2024 11:51

lint

3b291c3

poetry lock

c64fb92

hinthornw reviewed Dec 21, 2024

View reviewed changes

hinthornw approved these changes Dec 21, 2024

View reviewed changes

angus-langchain merged commit 0243f79 into main Dec 21, 2024
9 checks passed

angus-langchain deleted the angus/stream-compress-multipart branch December 21, 2024 15:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: [LS-2561] Stream compress multipart runs #1316

perf: [LS-2561] Stream compress multipart runs #1316

angus-langchain commented Dec 10, 2024 •

edited

Loading

nfcampos Dec 11, 2024

angus-langchain Dec 11, 2024

hinthornw left a comment

hinthornw Dec 19, 2024

angus-langchain Dec 20, 2024 •

edited

Loading

hinthornw Dec 21, 2024

hinthornw Dec 19, 2024

angus-langchain Dec 19, 2024

angus-langchain Dec 19, 2024

hinthornw Dec 21, 2024

perf: [LS-2561] Stream compress multipart runs #1316

perf: [LS-2561] Stream compress multipart runs #1316

Conversation

angus-langchain commented Dec 10, 2024 • edited Loading

Purpose

Choose a reason for hiding this comment

Choose a reason for hiding this comment

hinthornw left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

angus-langchain Dec 20, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

angus-langchain commented Dec 10, 2024 •

edited

Loading

angus-langchain Dec 20, 2024 •

edited

Loading