Compare with proprietary models #2

IonMich · 2024-11-27T08:55:22Z

We should add an inference endpoint that calls OpenAI's 4o and 4o1 variants, to get an idea of what is the state of the art in DocVQA. Similarly for Anthropic's Claude API and Google's Gemini.

Notably, these models should be able to extract all relevant information from the page in one-shot, including problem statement and handwritten attempted solution.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Compare with proprietary models #2

Compare with proprietary models #2

IonMich commented Nov 27, 2024

Compare with proprietary models #2

Compare with proprietary models #2

Comments

IonMich commented Nov 27, 2024