Models to port to MLX-VLM #39

Blaizzy · 2024-06-11T12:10:59Z

Blaizzy · 2024-06-22T15:53:36Z

Next release of Llava-Next

TODO:
update text config defaults to avoid errors with Llava-v1.6-vicuna:

class TextConfig:
    model_type: str
    hidden_size: int = 4096
    num_hidden_layers: int = 32
    intermediate_size: int = 11008
    num_attention_heads: int = 32
    rms_norm_eps: float = 1e-05
    vocab_size: int = 32064
    num_key_value_heads: int = 32
    rope_theta: float = 1000000
    rope_traditional: bool = False
    rope_scaling: Optional[Dict[str, Union[float, str]]] = None

BoltzmannEntropy · 2024-07-31T18:26:28Z

Thanks for the great repo. This should also be on the list: https://github.com/THUDM/CogVLM2
I am now just reading the code, and trying to free some time for the conversion routine.

jrp2014 · 2024-08-08T18:18:08Z

https://llava-vl.github.io/blog/2024-08-05-llava-onevision/

Blaizzy · 2024-08-08T20:27:45Z

Hey @BoltzmannEntropy and @jrp2014,

Thanks for the suggestions!

I have added them to the backlog

jrp2014 · 2024-08-27T17:41:55Z

MiniCPM-V v2.6

jrp2014 · 2024-08-27T17:42:30Z

MiniCPM-V v2.6

s-smits · 2024-09-07T10:44:22Z

Do you have a link to Florence-2?

ChristianWeyer · 2024-09-10T05:54:38Z

Is the above list the ultimate and up-to-date list of supported models @Blaizzy? Thanks for your hard work!

Blaizzy · 2024-09-10T12:17:37Z

Hey @ChristianWeyer
Its mostly up-to-date, just missing qwen2-vl

Blaizzy · 2024-09-10T12:18:38Z

@s-smits here you go:

https://huggingface.co/microsoft/Florence-2-large/blob/main/modeling_florence2.py

ChristianWeyer · 2024-09-10T13:50:00Z

[x] Phi-3-vision

Thanks!
I guess Phi-3-vision includes 3.5?

Blaizzy · 2024-09-10T13:59:50Z

Yes, they have the same arch so there are no changes needed :)

pulkitjindal88 · 2024-09-20T15:27:37Z

Hey @Blaizzy, thanks for this great framework. Is there any priority for InternVL? I can see it is present in your list. Just wanted to know if it planned in your near term. Want to make the model run on my macbook and mlx-vlm looks to be the best way for that.

chigkim · 2024-09-21T22:27:26Z

Qwen2-VL-72B would be amazing!

simonw · 2024-09-29T21:28:25Z

This recipe seems to work for Qwen2-VL-2B-Instruct:

python -m mlx_vlm.generate \
  --model Qwen/Qwen2-VL-2B-Instruct \
  --max-tokens 100 \
  --temp 0.0 \
  --image django-roadmap.png \
  --prompt "Describe image in detail, include all text"

My results here: https://gist.github.com/simonw/9e02d425cacb902260ec1307e0671e17

chigkim · 2024-09-30T00:13:52Z

Yep they just merged Qwen2-vl support this weekend.

xSNYPSx · 2024-10-02T00:18:09Z

Molmo please

chigkim · 2024-10-02T17:41:21Z

Nvidia just dropped multimodal NVLM-D-72B. The benchmark looks pretty good.

https://huggingface.co/nvidia/NVLM-D-72B

Blaizzy · 2024-10-02T19:03:08Z

Yap, that's a pretty awesome model!
It's on my radar because we can run it in 4bit quant

chigkim · 2024-10-25T20:33:45Z

Pixtral-12B now has Base model.
https://huggingface.co/mistralai/Pixtral-12B-Base-2409

Benjoyo · 2024-11-22T22:38:37Z

Hey @Blaizzy, could you add ColQwen support? As there already is qwen2-vl and ColQwen is just an additional linear layer on top this seems to be a low hanging fruit, also considering Col* is a really hot topic right now.

I could really use this for my projects (e.g. local private document search + qa) 😊

pcuenca · 2024-11-26T12:41:43Z

Working on Idefics 3 here: #124

Blaizzy · 2024-11-26T14:21:31Z

@Benjoyo, ColQwen and CoPali are awesome models.

At the moment, I'm going working on refactoring and some optimisations. New model ports by me are on hold.

However, I appreaciate any PRs. I'm here to review and help when needed.

Blaizzy · 2024-11-26T14:22:04Z

Thanky you very much, @pcuenca!

It means a lot 🚀

I left a few comments.

kukeshajanth · 2024-11-28T03:17:58Z

is it possible to bring this under mlx-vlm

https://huggingface.co/showlab/ShowUI-2B

Blaizzy added the good first issue Good for newcomers label Jun 11, 2024

This was referenced Jun 15, 2024

LlaVA in MLX ml-explore/mlx-examples#461

Merged

Add support for Llava-1.6 ml-explore/mlx-examples#551

Open

Llava v1.6 support #42

Closed

Blaizzy mentioned this issue Jun 24, 2024

Add support for phi-3-vision-128k-instruct #36

Merged

Blaizzy pinned this issue Jul 4, 2024

Blaizzy mentioned this issue Jul 6, 2024

Where can I get started to convert internvl model to mlx format? ml-explore/mlx-examples#865

Closed

maazel-rhymes mentioned this issue Oct 23, 2024

running on MacOS-M2 GPUs rhymes-ai/Aria#34

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Models to port to MLX-VLM #39

Models to port to MLX-VLM #39

Blaizzy commented Jun 11, 2024 •

edited

Loading

Blaizzy commented Jun 22, 2024

BoltzmannEntropy commented Jul 31, 2024

jrp2014 commented Aug 8, 2024

Blaizzy commented Aug 8, 2024

jrp2014 commented Aug 27, 2024

jrp2014 commented Aug 27, 2024

s-smits commented Sep 7, 2024

ChristianWeyer commented Sep 10, 2024

Blaizzy commented Sep 10, 2024

Blaizzy commented Sep 10, 2024

ChristianWeyer commented Sep 10, 2024

Blaizzy commented Sep 10, 2024

pulkitjindal88 commented Sep 20, 2024

chigkim commented Sep 21, 2024

simonw commented Sep 29, 2024 •

edited

Loading

chigkim commented Sep 30, 2024

xSNYPSx commented Oct 2, 2024

chigkim commented Oct 2, 2024

Blaizzy commented Oct 2, 2024

chigkim commented Oct 25, 2024

Benjoyo commented Nov 22, 2024

pcuenca commented Nov 26, 2024

Blaizzy commented Nov 26, 2024

Blaizzy commented Nov 26, 2024 •

edited

Loading

kukeshajanth commented Nov 28, 2024

Models to port to MLX-VLM #39

Models to port to MLX-VLM #39

Comments

Blaizzy commented Jun 11, 2024 • edited Loading

Blaizzy commented Jun 22, 2024

BoltzmannEntropy commented Jul 31, 2024

jrp2014 commented Aug 8, 2024

Blaizzy commented Aug 8, 2024

jrp2014 commented Aug 27, 2024

jrp2014 commented Aug 27, 2024

s-smits commented Sep 7, 2024

ChristianWeyer commented Sep 10, 2024

Blaizzy commented Sep 10, 2024

Blaizzy commented Sep 10, 2024

ChristianWeyer commented Sep 10, 2024

Blaizzy commented Sep 10, 2024

pulkitjindal88 commented Sep 20, 2024

chigkim commented Sep 21, 2024

simonw commented Sep 29, 2024 • edited Loading

chigkim commented Sep 30, 2024

xSNYPSx commented Oct 2, 2024

chigkim commented Oct 2, 2024

Blaizzy commented Oct 2, 2024

chigkim commented Oct 25, 2024

Benjoyo commented Nov 22, 2024

pcuenca commented Nov 26, 2024

Blaizzy commented Nov 26, 2024

Blaizzy commented Nov 26, 2024 • edited Loading

kukeshajanth commented Nov 28, 2024

Blaizzy commented Jun 11, 2024 •

edited

Loading

simonw commented Sep 29, 2024 •

edited

Loading

Blaizzy commented Nov 26, 2024 •

edited

Loading