Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

bug: Local models cannot recognize uploaded files #1740

Open
1 of 3 tasks
imtuyethan opened this issue Nov 20, 2024 · 2 comments
Open
1 of 3 tasks

bug: Local models cannot recognize uploaded files #1740

imtuyethan opened this issue Nov 20, 2024 · 2 comments
Assignees
Labels
category: tools RAG, function calling, etc type: bug Something isn't working

Comments

@imtuyethan
Copy link
Contributor

imtuyethan commented Nov 20, 2024

Jan version

0.5.8-731

Describe the Bug

I tried the entire Llama family. These models seem like they're unable to detect or process uploaded PDF files, responding as if no file was provided despite the UI showing successful file uploads.

1
2
Screenshot 2024-11-20 at 7 29 07 PM

File Type: PDF
File Size: 169.56KB

Steps to Reproduce

  1. Upload a PDF file
  2. Requests to "summarize this"
  3. Model responds that there is no text to summarize

Screenshots / Logs

OS: macOS (Darwin Kernel Version 23.2.0)
Hardware: Apple M2
Jan Version: v0.5.8-731
Memory: 16GB Total
Cortex Version: v1.0.3-rc5

What is your OS?

  • MacOS
  • Windows
  • Linux
@imtuyethan imtuyethan added the type: bug Something isn't working label Nov 20, 2024
@louis-jan
Copy link
Contributor

As I remember, we don't support to change model mid-thread for documents uploads. There would be an enhancement where we fix an embedding model to address this issue from cortex.cpp.

@imtuyethan imtuyethan added the category: tools RAG, function calling, etc label Nov 21, 2024
@sebastian-lempert
Copy link

Same for me:

Screenshots

Bildschirmfoto 2024-11-21 um 17 09 29
Bildschirmfoto 2024-11-21 um 17 10 43
Bildschirmfoto 2024-11-21 um 17 10 58

PDF file used

Downloaded from Wikipedia: https://en.wikipedia.org/w/index.php?title=Special:DownloadAsPdf&page=Bitcoin&action=show-download-screen

System

  • OS: macOS, ProductVersion: 15.1.1, BuildVersion: 24B91
  • CPU: 2,6 GHz 6-Core Intel Core i7
  • GPUs (I guess none of thes should / can be used for AI):
    1. AMD Radeon Pro 5300M 4 GB
    2. Intel UHD Graphics 630 1536 MB
  • Jan: v0.5.8
  • Memory: 16GB Total
  • Cortex: v1.0.2 (according to cortex.log)
  • Model: Llama 3.1 8B Instruct Q4

Steps to reproduce

  1. Start Jan
  2. Install model "Llama 3.1 8B Instruct Q4"
  3. Follow these steps in order to be able to chat with PDFs using RAG: https://jan.ai/docs/tools/retrieval
  4. Start a new Thread
  5. Select model "Llama 3.1 8B Instruct Q4"
  6. Upload PDF
  7. Ask "Please summarize the text in this file" --> this will take forever
  8. Ask "Please summarize the text in this file" again --> this will result in an answer like "You haven't provided a file for me to access or read from. If you'd like, you can copy and paste the contents of the file into this chat window, and I'll be happy to help summarize it for you!"

@imtuyethan imtuyethan transferred this issue from janhq/jan Nov 28, 2024
@github-project-automation github-project-automation bot moved this to Investigating in Jan & Cortex Nov 28, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
category: tools RAG, function calling, etc type: bug Something isn't working
Projects
Status: Investigating
Development

No branches or pull requests

4 participants