[Draft]Add Multimodal RAG notebook #2497

openvino-dev-samples · 2024-11-01T03:01:52Z

review-notebook-app · 2024-11-01T03:01:58Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

reformat

notebooks/multimodal-rag/multimodal-rag-llamaindex.ipynb

transfer to optimum-intel transfer to optimum-intel

notebooks/multimodal-rag/multimodal-rag-llamaindex.ipynb

eaidova · 2024-11-18T08:42:10Z

@openvino-dev-samples for me everything looks good, thanks.

Couple of comments:
Possibly it is better to move on accuracy aware quantization for VLM using optimum-cli, need to provide --weight-format int4 --dataset contextual options for that (fyi @nikita-savelyevv)

Is there any plans to integrate OV Visual Language models directly in llama-index?

openvino-dev-samples · 2024-11-18T08:47:46Z

@openvino-dev-samples for me everything looks good, thanks.

Couple of comments: Possibly it is better to move on accuracy aware quantization for VLM using optimum-cli, need to provide --weight-format int4 --dataset contextual options for that (fyi @nikita-savelyevv)

Is there any plans to integrate OV Visual Language models directly in llama-index?

Thanks for your review, the integration is already done in llama-index

https://docs.llamaindex.ai/en/stable/examples/multi_modal/openvino_multimodal/

BTW is there an example for phi3-vision's accuracy aware quantization ?

nikita-savelyevv · 2024-11-18T10:04:49Z

Couple of comments: Possibly it is better to move on accuracy aware quantization for VLM using optimum-cli, need to provide --weight-format int4 --dataset contextual options for that (fyi @nikita-savelyevv)

I'll add that an algorithm itself needs to be specified, e.g. --weight-format int4 --dataset contextual --awq.

Also, the default number of samples of 128 might be too large, so it can be reduced with --num-samples 32.

notebooks/multimodal-rag/multimodal-rag-llamaindex.ipynb

openvino-dev-samples · 2024-11-19T02:49:30Z

Couple of comments: Possibly it is better to move on accuracy aware quantization for VLM using optimum-cli, need to provide --weight-format int4 --dataset contextual options for that (fyi @nikita-savelyevv)

I'll add that an algorithm itself needs to be specified, e.g. --weight-format int4 --dataset contextual --awq.

Also, the default number of samples of 128 might be too large, so it can be reduced with --num-samples 32.

Hi, as my test, the accuracy with this configuration is not satisfied:
optimum-cli export openvino --model {vlm_model_id} {vlm_model_path} --trust-remote-code --weight-format int4 --dataset contextual --awq --num-samples 32

add load image function

nikita-savelyevv · 2024-11-19T09:21:46Z

Couple of comments: Possibly it is better to move on accuracy aware quantization for VLM using optimum-cli, need to provide --weight-format int4 --dataset contextual options for that (fyi @nikita-savelyevv)

I'll add that an algorithm itself needs to be specified, e.g. --weight-format int4 --dataset contextual --awq.
Also, the default number of samples of 128 might be too large, so it can be reduced with --num-samples 32.

Hi, as my test, the accuracy with this configuration is not satisfied: optimum-cli export openvino --model {vlm_model_id} {vlm_model_path} --trust-remote-code --weight-format int4 --dataset contextual --awq --num-samples 32

Thanks for the information! Have you compared it against the configuration below?

compression_config = {
    "mode": nncf.CompressWeightsMode.INT4_SYM,
    "group_size": 64,
    "ratio": 0.6,
}

Yes, this configuration brings more reasonable responses compared to optimum-cli

update the method of audio extraction

update the screenshot display method

openvino-dev-samples · 2024-11-21T03:00:23Z

hi @eaidova looks the CI is out of resource to validate this notebook, any suggestions ? thanks

eaidova · 2024-11-21T04:06:43Z

@openvino-dev-samples I can suggest to try internvl2-1b-instruct or nano-llava for ci testing, it is small enough to fit into precommit

fix ci issues

openvino-dev-samples · 2024-11-22T04:49:15Z

--trust-remote-code --weight-format int4 --dataset contextual --awq --num-samples 32

Sorry, I made a mistake before. The result looks good with this accuracy aware config now

update with accruaracy aware quantization

solve conflict

openvino-dev-samples added 2 commits October 31, 2024 03:17

first draft

ee3c3c5

reformat

ebb5fbe

openvino-dev-samples added 3 commits October 31, 2024 20:22

fix ci

5db6113

add gradio demo

6a17791

reformat

e09b544

reformat

openvino-dev-samples force-pushed the multimodal-rag branch from 5f24b3f to e09b544 Compare November 4, 2024 04:19

update gradio UI

9dfa8c4

openvino-dev-samples changed the title ~~[Draft]Add Multimodal RAG~~ [Draft]Add Multimodal RAG notebook Nov 4, 2024

openvino-dev-samples requested a review from eaidova November 5, 2024 06:01

eaidova reviewed Nov 6, 2024

View reviewed changes

notebooks/multimodal-rag/multimodal-rag-llamaindex.ipynb Show resolved Hide resolved

openvino-dev-samples force-pushed the multimodal-rag branch from 2050dc7 to f72dab2 Compare November 18, 2024 04:17

transfer to optimum-intel

669ad71

transfer to optimum-intel transfer to optimum-intel

openvino-dev-samples force-pushed the multimodal-rag branch from f72dab2 to 669ad71 Compare November 18, 2024 04:19

openvino-dev-samples added 2 commits November 18, 2024 12:25

Merge branch 'latest' into multimodal-rag

ec0126f

fix spelling

2a531e2

eaidova requested a review from aleksandr-mokrov November 18, 2024 08:14

eaidova reviewed Nov 18, 2024

View reviewed changes

notebooks/multimodal-rag/multimodal-rag-llamaindex.ipynb Show resolved Hide resolved

aleksandr-mokrov reviewed Nov 18, 2024

View reviewed changes

notebooks/multimodal-rag/multimodal-rag-llamaindex.ipynb Show resolved Hide resolved

add load image function

655ab9c

add load image function

openvino-dev-samples force-pushed the multimodal-rag branch from ccede65 to 655ab9c Compare November 19, 2024 02:50

update the method of audio extraction

7dffa6a

update the method of audio extraction

openvino-dev-samples force-pushed the multimodal-rag branch from cce7f97 to 7dffa6a Compare November 19, 2024 15:40

openvino-dev-samples added 2 commits November 19, 2024 17:33

replaace video upload component

044ec6b

update the screenshot display method

7691683

update the screenshot display method

openvino-dev-samples force-pushed the multimodal-rag branch from 2a20586 to 7691683 Compare November 20, 2024 08:34

replace llm test case

7207e0b

fix ci issues

openvino-dev-samples force-pushed the multimodal-rag branch from 9c30099 to 7207e0b Compare November 21, 2024 09:43

update with accruaracy aware quantization

89f9ec5

update with accruaracy aware quantization

openvino-dev-samples force-pushed the multimodal-rag branch from c4dcb16 to 89f9ec5 Compare November 22, 2024 04:53

openvino-dev-samples added 2 commits November 21, 2024 20:53

solve conflict

087787f

solve conflict

a868ab5

solve conflict

openvino-dev-samples force-pushed the multimodal-rag branch from 2e02d4f to a868ab5 Compare November 22, 2024 06:31

switch to int8 ASR

fd1df74

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Draft]Add Multimodal RAG notebook #2497

[Draft]Add Multimodal RAG notebook #2497

openvino-dev-samples commented Nov 1, 2024 •

edited

Loading

review-notebook-app bot commented Nov 1, 2024

eaidova commented Nov 18, 2024 •

edited

Loading

openvino-dev-samples commented Nov 18, 2024 •

edited

Loading

nikita-savelyevv commented Nov 18, 2024

openvino-dev-samples commented Nov 19, 2024

nikita-savelyevv commented Nov 19, 2024 •

edited by openvino-dev-samples

Loading

openvino-dev-samples commented Nov 21, 2024

eaidova commented Nov 21, 2024 •

edited

Loading

openvino-dev-samples commented Nov 22, 2024

[Draft]Add Multimodal RAG notebook #2497

Are you sure you want to change the base?

[Draft]Add Multimodal RAG notebook #2497

Conversation

openvino-dev-samples commented Nov 1, 2024 • edited Loading

review-notebook-app bot commented Nov 1, 2024

eaidova commented Nov 18, 2024 • edited Loading

openvino-dev-samples commented Nov 18, 2024 • edited Loading

nikita-savelyevv commented Nov 18, 2024

openvino-dev-samples commented Nov 19, 2024

nikita-savelyevv commented Nov 19, 2024 • edited by openvino-dev-samples Loading

openvino-dev-samples commented Nov 21, 2024

eaidova commented Nov 21, 2024 • edited Loading

openvino-dev-samples commented Nov 22, 2024

openvino-dev-samples commented Nov 1, 2024 •

edited

Loading

eaidova commented Nov 18, 2024 •

edited

Loading

openvino-dev-samples commented Nov 18, 2024 •

edited

Loading

nikita-savelyevv commented Nov 19, 2024 •

edited by openvino-dev-samples

Loading

eaidova commented Nov 21, 2024 •

edited

Loading