RFC Model deployment #839

jjerphan · 2023-08-28T12:22:27Z

Context, uses-cases and motivations

This RFC stems from discussions in #823.

Model deployment is a common necessity in many programmatic workflow. The main frameworks all have interfaces to easily download and use pre-trained models.

Hugging Face pipelines are higher levels abstractions which allow using several pre-trained models which already have been trained for a variety of tasks. They allow using the SOTA of Machine Learning models in a couple of lines, be in on CPUs, on a GPU or several of them, supporting various frameworks as back-end.

Due to the kind of problems Flojoy aims to tackle, such pipelines might be appropriate for problems users of Flojoy face. In particular, there exist Hugging Face pipelines for:

We might want to focus on providing an appropriate UX for those problems really well first.

We can then latter study how to configure GPU and CPU and usage and specifically how to process batch of data and have efficient training and inference.

Proposed scope: integrate the most useful Hugging Face pipelines

Create a HUGGING_FACE_PIPELINE node for Image classification
Create a HUGGING_FACE_PIPELINE node for Image segmentation
Create a HUGGING_FACE_PIPELINE node for Audio classification

For all new nodes:

allow the specification of the model and the model revision to use for reproducible results
point out to the online documentation of models so that users can correctly understand the training procedure of those models, intended uses, limitations, and potential biases
make sure to cache serialized pipeline in Flojoy cache

References

The text was updated successfully, but these errors were encountered:

Roulbac · 2023-08-28T12:34:42Z

@jjerphan Thanks for opening this RFC!
We have a few facilities to pull models from the HF hub and cache them under ~/.flojoy, example. Happy to work together on this.

I am curious what your thoughts are about a few "bring-your-own-model" generic nodes that support .torchscript and .onnx formats for classification, dense segmentation and audio classification.

jjerphan · 2023-08-28T12:58:43Z

We have a few facilities to pull models from the HF hub and cache them under ~/.flojoy, example. Happy to work together on this.

Good to know! Definitely, I am interested to join efforts so that we keep consistency in Flojoy's development.

I think this is approach is simple yet has a good granularity:

users do not need to know the technicality ML models to use this node but can if they want to
Hugging Face pipelines handle a lot of complexity for us, and already have documented everything, so maintaining them as Nodes should come with little cost

On my side, I started experimenting with flojoy-ai/nodes#253 this morning to have a simple prototype to discuss. I am waiting for critical reviews.

I am curious what your thoughts are about a few "bring-your-own-model" generic nodes that support .torchscript and .onnx formats for classification, dense segmentation and audio classification.

I think everything depends on Flojoy users. I do not know Flojoy user-base, but I assume that Flojoy users might not be programmer and might just want to use a flow to solve their problem effectively with Nodes that are available of the shelf. In this case, supporting lower-level specifications and tools (like Torchscripts' and ONNX') might potentially have little final added value for end users now, but it might in the future. If I were to choose, I would not preemptively work on supporting those formats, yet I also think users which are developers can easily create their Nodes or even contribute back to Flojoy for the support of those formats.

What do you think?

Feel free to propose additions or rectifications to this RFC, and I will update it accordingly. 🙂

jackparmer · 2023-08-29T21:49:02Z

I think everything depends on Flojoy users. I do not know Flojoy user-base, but I assume that Flojoy users might not be programmer and might just want to use a flow to solve their problem effectively with Node that are available off-the-shelves.

💯 Think biologists or mechanical engineers who have likely heard of Python but don't do any coding or even command-line work themselves.

jjerphan · 2023-08-31T16:04:04Z

Batching is not recommended when using Hugging Face Transformers' pipeline, especially on CPUs.

Processing batches of files can be done using the BATCH_PROCESSOR. For instance one can easily classify cats and upload the predictions to Flojoy Cloud. 🐈‍⬛

jjerphan · 2023-09-06T07:14:29Z

Although slightly more advanced, approaches proposed in flojoy-ai/nodes#234 also are of relevance for model deployment since TorchScript model also are being used as interchangeable format.

Generally, we need to think about the UX for using models which have been serialized using:

TorchScript
TFLite
ONNX

jjerphan added the ux ux design & implementation label Aug 31, 2023

jjerphan mentioned this issue Aug 31, 2023

feat: Minimal Hugging Face Image Classification Pipeline flojoy-ai/nodes#253

Merged

4 tasks

jjerphan closed this as not planned Won't fix, can't repro, duplicate, stale Nov 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RFC Model deployment #839

RFC Model deployment #839

jjerphan commented Aug 28, 2023 •

edited

Loading

Roulbac commented Aug 28, 2023

jjerphan commented Aug 28, 2023 •

edited

Loading

jackparmer commented Aug 29, 2023

jjerphan commented Aug 31, 2023 •

edited

Loading

jjerphan commented Sep 6, 2023 •

edited

Loading

RFC Model deployment #839

RFC Model deployment #839

Comments

jjerphan commented Aug 28, 2023 • edited Loading

Context, uses-cases and motivations

Proposed scope: integrate the most useful Hugging Face pipelines

References

Roulbac commented Aug 28, 2023

jjerphan commented Aug 28, 2023 • edited Loading

jackparmer commented Aug 29, 2023

jjerphan commented Aug 31, 2023 • edited Loading

jjerphan commented Sep 6, 2023 • edited Loading

jjerphan commented Aug 28, 2023 •

edited

Loading

jjerphan commented Aug 28, 2023 •

edited

Loading

jjerphan commented Aug 31, 2023 •

edited

Loading

jjerphan commented Sep 6, 2023 •

edited

Loading