Support parallel inference? #216

mmiumiu · 2024-11-25T06:24:27Z

How can the model support parallel inference?

For a single model instance, even a small number of concurrent requests significantly increases response time. From what I've observed, the model only accepts one text input at a time and returns the corresponding audio response. Is it possible to make some simple changes to allow it to process multiple texts simultaneously without slowing down?

prad-human-007 · 2024-11-25T16:38:43Z

You can run multiple instances of the model using Python's ProcessPoolExecutor.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support parallel inference? #216

Support parallel inference? #216

mmiumiu commented Nov 25, 2024

prad-human-007 commented Nov 25, 2024

Support parallel inference? #216

Support parallel inference? #216

Comments

mmiumiu commented Nov 25, 2024

prad-human-007 commented Nov 25, 2024