speed up phi-3 inference? #428

CHNtentes · 2024-05-16T11:45:33Z

Hi, I tried your phi-3 example on Android. I wonder if it can run on GPU/Qualcomm HTP to further increase speed? Currently I suppose it only uses CPU.

salykova · 2024-05-16T12:53:07Z

I am interested in this too

CHNtentes · 2024-05-17T01:15:29Z

I suppose for now they probably cannnot do it, like executorch only small models can run on gpu/htp.

varunchariArm · 2024-07-10T16:40:27Z

I am interested too, also may I know what Execution Provider is being used in the provided 'libonnxruntime.so'?
Thans

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speed up phi-3 inference? #428

speed up phi-3 inference? #428

CHNtentes commented May 16, 2024 •

edited

Loading

salykova commented May 16, 2024 •

edited

Loading

CHNtentes commented May 17, 2024

varunchariArm commented Jul 10, 2024

speed up phi-3 inference? #428

speed up phi-3 inference? #428

Comments

CHNtentes commented May 16, 2024 • edited Loading

salykova commented May 16, 2024 • edited Loading

CHNtentes commented May 17, 2024

varunchariArm commented Jul 10, 2024

CHNtentes commented May 16, 2024 •

edited

Loading

salykova commented May 16, 2024 •

edited

Loading