fill prompt for sampler analysis with real tokens in VLM pipeline #1247

sbalandi · 2024-11-22T10:43:35Z

No description provided.

ilya-lavrenov · 2024-11-27T15:49:08Z

src/cpp/src/visual_language/pipeline.cpp

+        auto chat_history = m_inputs_embedder->get_tokenized_chat_history();
+        size_t chat_history_size = std::max(chat_history.get_shape().at(1), history_size + inputs_embeds_size);
+        ov::Tensor prompt_ids(ov::element::i64, { chat_history_size });
+        std::fill_n(prompt_ids.data<int64_t>(), prompt_ids.get_size(), 1);


why 1 is used as default value? maybe pad_token ?

ilya-lavrenov · 2024-11-27T15:50:56Z

src/cpp/src/visual_language/pipeline.cpp

-        std::fill_n(prompt_ids.data<int64_t>(), prompt_ids.get_size(), 0);
+
+        auto chat_history = m_inputs_embedder->get_tokenized_chat_history();
+        size_t chat_history_size = std::max(chat_history.get_shape().at(1), history_size + inputs_embeds_size);


looks like we have the same case as for LLMs, when decode ( encode ( X ) ) provides smaller value than X ?
in this case we need to partially re-compute the history.

in general, I would consider merging VLM and LLM pipelines generate functions to keep all this magic with history in one place.
Or at least to create helper function similar to get_lm_encoded_results

github-actions bot added the category: visual language Visual language pipeline label Nov 22, 2024

fill prompt for sampler analysis with real tokens in VLM pipeline

15fdc3c

ilya-lavrenov assigned ilya-lavrenov and Wovchena Nov 26, 2024

ilya-lavrenov reviewed Nov 27, 2024

View reviewed changes

ilya-lavrenov added this to the 2025.0 milestone Nov 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fill prompt for sampler analysis with real tokens in VLM pipeline #1247

fill prompt for sampler analysis with real tokens in VLM pipeline #1247

sbalandi commented Nov 22, 2024

ilya-lavrenov Nov 27, 2024

ilya-lavrenov Nov 27, 2024

ilya-lavrenov Nov 27, 2024

fill prompt for sampler analysis with real tokens in VLM pipeline #1247

Are you sure you want to change the base?

fill prompt for sampler analysis with real tokens in VLM pipeline #1247

Conversation

sbalandi commented Nov 22, 2024

ilya-lavrenov Nov 27, 2024

Choose a reason for hiding this comment

ilya-lavrenov Nov 27, 2024

Choose a reason for hiding this comment

ilya-lavrenov Nov 27, 2024

Choose a reason for hiding this comment