-
Notifications
You must be signed in to change notification settings - Fork 20
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
RFC Model deployment #839
Comments
@jjerphan Thanks for opening this RFC! I am curious what your thoughts are about a few "bring-your-own-model" generic nodes that support |
Good to know! Definitely, I am interested to join efforts so that we keep consistency in Flojoy's development. I think this is approach is simple yet has a good granularity:
On my side, I started experimenting with flojoy-ai/nodes#253 this morning to have a simple prototype to discuss. I am waiting for critical reviews.
I think everything depends on Flojoy users. I do not know Flojoy user-base, but I assume that Flojoy users might not be programmer and might just want to use a flow to solve their problem effectively with Nodes that are available of the shelf. In this case, supporting lower-level specifications and tools (like Torchscripts' and ONNX') might potentially have little final added value for end users now, but it might in the future. If I were to choose, I would not preemptively work on supporting those formats, yet I also think users which are developers can easily create their Nodes or even contribute back to Flojoy for the support of those formats. What do you think? Feel free to propose additions or rectifications to this RFC, and I will update it accordingly. 🙂 |
💯 Think biologists or mechanical engineers who have likely heard of Python but don't do any coding or even command-line work themselves. |
Batching is not recommended when using Hugging Face Transformers' pipeline, especially on CPUs. Processing batches of files can be done using the |
Although slightly more advanced, approaches proposed in flojoy-ai/nodes#234 also are of relevance for model deployment since TorchScript model also are being used as interchangeable format. Generally, we need to think about the UX for using models which have been serialized using:
|
Context, uses-cases and motivations
Model deployment is a common necessity in many programmatic workflow. The main frameworks all have interfaces to easily download and use pre-trained models.
Hugging Face pipelines are higher levels abstractions which allow using several pre-trained models which already have been trained for a variety of tasks. They allow using the SOTA of Machine Learning models in a couple of lines, be in on CPUs, on a GPU or several of them, supporting various frameworks as back-end.
Due to the kind of problems Flojoy aims to tackle, such pipelines might be appropriate for problems users of Flojoy face. In particular, there exist Hugging Face pipelines for:
We might want to focus on providing an appropriate UX for those problems really well first.
We can then latter study how to configure GPU and CPU and usage and specifically how to process batch of data and have efficient training and inference.
Proposed scope: integrate the most useful Hugging Face pipelines
HUGGING_FACE_PIPELINE
node for Image classificationHUGGING_FACE_PIPELINE
node for Image segmentationHUGGING_FACE_PIPELINE
node for Audio classificationFor all new nodes:
References
The text was updated successfully, but these errors were encountered: