Ollama starcoder2:15b inline completions model does not work #896

pedrogutobjj · 2024-07-14T20:43:18Z

Hi everyone, I had previously posted about this "error", could anyone help me? As you can see, when trying to make the inline completion model work nothing happens, only the GPU usage goes to 100% practically and nothing happens in the code line. Follow the inline completion configuration screens and my Jupyter screens.

krassowski · 2024-07-14T21:16:49Z

Does chat work with the same model?
Does completion work with a different, non-local model?
Does setting completion streaming to "always" help?

My first guess would be that the model just takes so long on your machine. Until you can confirm that it works in the chat but not with the completion, then this would be a reasonable assumption.

krassowski · 2024-07-14T21:17:25Z

Also, are you using the latest JupyterLab version?

pedrogutobjj · 2024-07-14T21:30:08Z

On chat model i'm using gemma2, and works fine.

I tested some huggingface models a few weeks ago and it worked normally.
nothing happens when I select the option, the code suggestions still do not appear.

krassowski · 2024-07-14T21:37:25Z

From your pictures it looks like you are expecting the completion to appear in a new line (based on the position of your cursor). Currently it works by completing the text you star typing in an existing line, you need type 2 or 3 characters and wait. What exact version of JupyterLab are you using?

krassowski · 2024-07-14T21:38:48Z

My question is if chat model with the same exact model works, not with a different model. Conversly, if you try using Gemma for inline completion does it work for you?

pedrogutobjj · 2024-07-14T21:48:29Z

I actually wait a bit to see if any code suggestions appear, but nothing happens,

my jupyterlab version

if i use both models(completion model and inline completion model), like gemma2 , its work`s fine!

pedrogutobjj · 2024-07-14T21:49:09Z

pedrogutobjj · 2024-07-14T21:49:44Z

however, starcoder for code suggestions is much better than gemma2.

pedrogutobjj · 2024-07-14T23:08:12Z

Another situation, when I wait for some code to "auto complete" it "resets" the code from the beginning, from the import, and '''' python ''' appears, would this be normal? Can't we make this more fluid?

richlysakowski · 2024-07-14T23:19:36Z

How do i unsubscribe from this Github.com repository thread or channel? I want to monitor activity more passively, because I get a message anytime anyone posts anything, and it is maxing out my Gmail account everyday. I keep trying but apparently I have only unsubscribing to a single thread? Thank you. Best Regards, Rich Lysakowski, Ph.D. Data Scientist, AI & Analytics Engineer, and Senior Business Systems Analyst 781-640-2048 mobile

…

On Sun, Jul 14, 2024 at 6:45 PM Pedro Asevedo ***@***.***> wrote: image.png (view on web) <https://github.com/user-attachments/assets/99a296cd-6ae4-4395-8e4b-8c0b90507d11> testing starcoder2 in both models (completion model) and inline completion model, nothing happen.... :( — Reply to this email directly, view it on GitHub <#896 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ACHJVL2SGNYTYFZXFNIJDGDZML5RPAVCNFSM6AAAAABK3OHIE6VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDEMRXGUYDMOJUHE> . You are receiving this because you are subscribed to this thread.Message ID: ***@***.***>

krassowski · 2024-07-15T07:06:12Z

If other models work but starcoder does not work, it is likely a problem with the model, or in this case also possibly your GPU having less memory than required to run it sufficiently fast to be useful (but I see you managed to run deepseek-coder v2:16b, so unless deepseek-coder is quantized and stardocder is not it would not align with the theory of hardware issue).

@richlysakowski on the main page of the repository (https://github.com/jupyterlab/jupyter-ai) you will see "Unwatch" button, with options to only watch releases.

pedrogutobjj · 2024-07-15T08:12:15Z

@krassowski

When I put a part of the code, and the model returns several explanations, and the complete code, with explanations and etc... and I wanted it to complete only the missing part. For example, I insert "import pandas as", I expect the model of complete with "pd", but it repeats the import pandas as pd and puts some random explanations, you know? I just want you to complete what I'm writing, don't rewrite everything. I don't know if my doubt was very clear and if this can be configurable.

krassowski · 2024-07-15T08:37:11Z

If you use streaming mode, this should have been fixed in #879 (which will be included in 2.19 release)

pedrogutobjj · 2024-07-15T08:39:27Z

Thanks very much!

This release is launch today?

pedrogutobjj · 2024-07-15T21:33:47Z

I'm using the new release, 2.19.... same problem..with completions.. appears ````python` ``` , has the "bug" not been fixed?

pedrogutobjj · 2024-07-15T21:46:39Z

using codegemma...

Is there any ollama model that has been tested and passed the tests to complete the lines?

krassowski · 2024-07-16T10:16:35Z

Thanks for testing, the Ollama provider is experimental so there may be issues to iron out. Things I would suspect:

a) the models may be not very good at respecting instructions and not generating expected output (especially given that some of the models you listed above are rather small, 9b or 15b), or
b) there is some issue with new-line endings Windows-style vs Unix-style

Is there any ollama model that has been tested and passed the tests to complete the lines?

You already know the answer, as it was provided in #646 (comment). Otherwise, there are no systematic tests for individual models.

krassowski · 2024-07-16T11:04:14Z

I can reproduce the prefix trimming issue with all providers in 2.19.0, whether streaming or not.

krassowski · 2024-07-16T11:11:12Z

For some reason in 2.19.0 the suggestion includes an extra space. This is logging from GPT-4 without streaming (so logic which should not have changed since 2.18):

krassowski · 2024-07-16T11:15:20Z

Ah no, this was still ollama with phi, not GPT-4. So it looks like ollama output parsing may be off by a spurious whitespace at the beginning.

pedrogutobjj · 2024-07-16T13:47:40Z

Thanks for testing, the Ollama provider is experimental so there may be issues to iron out. Things I would suspect:

a) the models may be not very good at respecting instructions and not generating expected output (especially given that some of the models you listed above are rather small, 9b or 15b), or

b) there is some issue with new-line endings Windows-style vs Unix-style

Is there any ollama model that has been tested and passed the tests to complete the lines?

You already know the answer, as it was provided in #646 (comment). Otherwise, there are no systematic tests for individual models.

I liked some results that the gemma2:9b model gave me in response, we could go more in-depth about this model, I test some models daily, the most famous ones and analyze their responses, as I test the models I will report here or elsewhere specific topic.

krassowski · 2024-07-24T15:59:07Z

@pedrogutobjj did you have a chance to test the latest release, v2.19.1, which includes #900? Is it any better?

pedrogutobjj · 2024-07-25T11:19:44Z

Hey @krassowski , morning!

i tested some lines of codes this morning, here are the results.

krassowski · 2024-07-25T11:34:54Z

Is this what you would expect or not? To me it looks like syntactically valid response. Of course a bit useless, but this is down to the ability of the model you use.

pedrogutobjj · 2024-07-25T11:42:57Z

The logic is correct, I mean the overlaps, I don't know if it was clear about this, for example: I inserted "def sum_matrizes(matrix1, matrix2): ...... then comes the autocomplete part, it repeats the def sum_matrizes again as a suggestion, since I already inserted it at the beginning of the code, I don't know if I was able to be clear in my suggestion.

krassowski · 2024-07-25T12:19:56Z

I see that, but it looks like the model is at fault here. It first inserted "import numpy as" and only then started the "def sum_matrizes(matrix1, matrix2):" part again.

Previously you were testing with deepseek-coder and codegemma but now you posted a result from llama3.1. If we want to see if the changes helped with the issue that you reported back then, can you test with the same models?

pedrogutobjj · 2024-07-25T12:45:40Z

with deepseek-coder

with codegemma:7b

krassowski · 2024-07-25T12:49:57Z

Thanks! Just to help me reproduce, where was your cursor when you invoked the online completer?

pedrogutobjj · 2024-07-25T12:52:35Z

on top of the cell.

krassowski · 2024-07-25T12:54:12Z

Do you mean that your cursor was before in the first line here:

def soma_matrices(matriz1, matriz2):|

or in the new line:

def soma_matrices(matriz1, matriz2):
|

or in the new line after tab:

def soma_matrices(matriz1, matriz2):
    |

pedrogutobjj · 2024-07-25T12:56:56Z

pedrogutobjj · 2024-07-25T13:21:44Z

pedrogutobjj added the bug Something isn't working label Jul 14, 2024

krassowski changed the title ~~Inline completions model doesn t work with me.~~ Ollama starcoder2:15b inline completions model does not work Jul 15, 2024

krassowski mentioned this issue Jul 16, 2024

Trim leading whitespace when processing #900

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ollama starcoder2:15b inline completions model does not work #896

Ollama starcoder2:15b inline completions model does not work #896

pedrogutobjj commented Jul 14, 2024

krassowski commented Jul 14, 2024

krassowski commented Jul 14, 2024

pedrogutobjj commented Jul 14, 2024

krassowski commented Jul 14, 2024

krassowski commented Jul 14, 2024

pedrogutobjj commented Jul 14, 2024 •

edited

Loading

pedrogutobjj commented Jul 14, 2024

pedrogutobjj commented Jul 14, 2024

pedrogutobjj commented Jul 14, 2024

richlysakowski commented Jul 14, 2024 via email

krassowski commented Jul 15, 2024

pedrogutobjj commented Jul 15, 2024

krassowski commented Jul 15, 2024

pedrogutobjj commented Jul 15, 2024

pedrogutobjj commented Jul 15, 2024

pedrogutobjj commented Jul 15, 2024

krassowski commented Jul 16, 2024

krassowski commented Jul 16, 2024

krassowski commented Jul 16, 2024

krassowski commented Jul 16, 2024

pedrogutobjj commented Jul 16, 2024

krassowski commented Jul 24, 2024

pedrogutobjj commented Jul 25, 2024

krassowski commented Jul 25, 2024

pedrogutobjj commented Jul 25, 2024

krassowski commented Jul 25, 2024

pedrogutobjj commented Jul 25, 2024

krassowski commented Jul 25, 2024

pedrogutobjj commented Jul 25, 2024

krassowski commented Jul 25, 2024

pedrogutobjj commented Jul 25, 2024

pedrogutobjj commented Jul 25, 2024

Ollama starcoder2:15b inline completions model does not work #896

Ollama starcoder2:15b inline completions model does not work #896

Comments

pedrogutobjj commented Jul 14, 2024

krassowski commented Jul 14, 2024

krassowski commented Jul 14, 2024

pedrogutobjj commented Jul 14, 2024

krassowski commented Jul 14, 2024

krassowski commented Jul 14, 2024

pedrogutobjj commented Jul 14, 2024 • edited Loading

pedrogutobjj commented Jul 14, 2024

pedrogutobjj commented Jul 14, 2024

pedrogutobjj commented Jul 14, 2024

richlysakowski commented Jul 14, 2024 via email

krassowski commented Jul 15, 2024

pedrogutobjj commented Jul 15, 2024

krassowski commented Jul 15, 2024

pedrogutobjj commented Jul 15, 2024

pedrogutobjj commented Jul 15, 2024

pedrogutobjj commented Jul 15, 2024

krassowski commented Jul 16, 2024

krassowski commented Jul 16, 2024

krassowski commented Jul 16, 2024

krassowski commented Jul 16, 2024

pedrogutobjj commented Jul 16, 2024

krassowski commented Jul 24, 2024

pedrogutobjj commented Jul 25, 2024

krassowski commented Jul 25, 2024

pedrogutobjj commented Jul 25, 2024

krassowski commented Jul 25, 2024

pedrogutobjj commented Jul 25, 2024

krassowski commented Jul 25, 2024

pedrogutobjj commented Jul 25, 2024

krassowski commented Jul 25, 2024

pedrogutobjj commented Jul 25, 2024

pedrogutobjj commented Jul 25, 2024

pedrogutobjj commented Jul 14, 2024 •

edited

Loading