Try out other models. #23

thepushkarp · 2021-09-19T12:50:10Z

Currently, we are using multi-qa-MiniLM-L6-cos-v1, which has a speed (sentences encoded/sec on 1 V100 GPU) of 14200 and a model size of 80 MB. We should try out other models to see if we can get better performance and speed out of them.

Additionally, we can also try using other types of tokenizers.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Try out other models. #23

Try out other models. #23

thepushkarp commented Sep 19, 2021 •

edited

Loading

Try out other models. #23

Try out other models. #23

Comments

thepushkarp commented Sep 19, 2021 • edited Loading

thepushkarp commented Sep 19, 2021 •

edited

Loading